GOS 1422010

From Metagenes
Warning: this metagenomic sequence has been carefully annotated by students during bioinformatics assignments. These quality annotations are therefore the result of a teaching exercise that you are most welcome to amend and extend if necessary!


Sequence
CAMERA AccNum : JCVI_READ_1091120681905
Annotathon code: GOS_1422010
Sample :
  • GPS :42°30'11n; 67°14'24w
  • North American East Coast: Gulf of Maine - USA
  • Coastal (-1m, 18.2°C, 0.1-0.8 microns)
Authors
Team : Algarve
Username : borboletas1
Annotated on : 2010-06-22 21:36:41
  • a33823 AnaFilipaGuerreiroLelo
  • a33849 MónicaIsabelBritoBarros
  • a33855 SandraIsabelVenturaGago

Synopsis

Genomic Sequence

>JCVI_READ_1091120681905 GOS_1422010 Genomic DNA
TTATCTTTATTACCCGTAGCACCAGCGAGGAAACTTCCTCCTTTCATCATTTCTTCCAGGCCAGAAACACCCATACCACCAGCAGCAACTGCAGCAGCAC
CTGCCATTTGTCGACCATACCCGCCTGCAGCCATTGCACCACCTGGAGCAAGAGCGGCTTTTCTTCCTGCGCTAAGTTTTTTACCAATATTCATAGCTCC
TGCTTTGATTCCTTTTCCTGCAGATTGTATGGCTCTGGTCAATGTGGGAAATTGTTTAGCTGTCGCTTTTAATCCTCTATTGAATGTTGGGAACGATGCT
GCAGTAACATTTTTTGCTAACGTTCCAAAAGATCCTGCAGTATTTGCTACCTTTTGACCAAATGTTACTACATTATTTTTGCCTGCGTTTAATGCTGAAG
CATACGCCTTAGTAACATTAGCTCCAGCAGCAAGTTCGTGTCTAAATACTTTAAGAGCTGCTTCAGAACCTTTACTTCCAGCTCCAAATTTTTTCTCGAG
ACCTGCTTCTCCAAGAGCTTTCATCCGGGTTCCAGGAGCAGCCCCAGAACCAGCTCCAGAACCAACATTAGGTCCTTGAGGACCTGCAACCTTTTTACCC
TTTTGAGCTTTCTGCATGTGAGAGTTTAAAGCAGCAGTAGATCGGGTTGTAGCTTCCTGTTCACTATCTCCAGCAAGTAAACTTTTTGCATAAGCCAACA
TAGCTGCTCTAGTTTGTTCTGTGTCAGCTCCAAATTTTAGTAAAGCCTTTTCTATTTTACCCATCGCTTGAGGATTCATAGTTGTAGCAGCACTGATAGT
CGCTGCGTCTTTCTTAGCTTTTCCTCCCTTAGCAAAACCAACAAACCCACCCTTATTATATCCTGTAATTCTATCCGCTTGGTTCATGTTGTGTAG

Translation

[1 - 894/896]   indirect strand
>GOS_1422010 Translation [1-894   indirect strand]
LHNMNQADRITGYNKGGFVGFAKGGKAKKDAATISAATTMNPQAMGKIEKALLKFGADTEQTRAAMLAYAKSLLAGDSEQEATTRSTAALNSHMQKAQKG
KKVAGPQGPNVGSGAGSGAAPGTRMKALGEAGLEKKFGAGSKGSEAALKVFRHELAAGANVTKAYASALNAGKNNVVTFGQKVANTAGSFGTLAKNVTAA
SFPTFNRGLKATAKQFPTLTRAIQSAGKGIKAGAMNIGKKLSAGRKAALAPGGAMAAGGYGRQMAGAAAVAAGGMGVSGLEEMMKGGSFLAGATGNKD

[ Warning ] 5' incomplete: does not start with a Methionine
[ Warning ] 3' incomplete: following codon is not a STOP

Annotator commentaries

We can conlude that the unknown sequence is coding, since it has more than 60 amino acids, even more than 200 amino acids (298 aa) and contains sequences homologous, although no have biological significance because the e-values be high (0.006, 0.024, 0.12, 0.46). The longest ORF of this sequence shows no homologs and/or conserved protein domains and is very incomplete ("open-ended sequence") because don’t presents either initiation codon, or STOP codon. For this reason, wasn't possible to calculate the molecular weight.


Was possible to do the multiple alignments, but this gave so bad results that weren’t possible take any conclusions, since the degree of homologies was too low. The unknown sequence could not find alignments with other sequences.


Wasn’t possible to draw conclusions more about the sequence, since it wasn’t possible to do many procedures due to lack of data, which leaves us with no information to held a discussion.



ORF finding

PROTOCOL


a) SMS ORFinder / direct strand / frames 1, 2 & 3 / min 60 AA / 'any codon' initiation / 'standard' genetic code

b) SMS ORFinder / reverse strand / frames 1, 2 & 3 / min 60 AA / 'any codon' initiation / 'standard' genetic code



RESULTS ANALYSIS


b) reverse strand


No ORFs were found in reading frame 2.


No ORFs were found in reading frame 3.


For the ORFs this presents in the direct strand four ORFs, 2 in the reading frame one and 2 in the reading frame three, while in the reverse strand there is only the existence of one ORF in the reading frame one. The ORF chosen for the analyses was the reverse strand, since this is the string that possesses a greater number of amino acids, however, in the chosen sequence the initiation codon and the STOP codon wasn’t present, what makes the sequence very incomplete("open-ended sequence"). The other’s ORFs also present biologic significance. The sequence is said to be coding since it has more than 60 amino acids.

RAW RESULTS

a) direct strand

>ORF number 1 in reading frame 1 on the direct strand extends from base 1 to base 357.
TTATCTTTATTACCCGTAGCACCAGCGAGGAAACTTCCTCCTTTCATCATTTCTTCCAGG
CCAGAAACACCCATACCACCAGCAGCAACTGCAGCAGCACCTGCCATTTGTCGACCATAC
CCGCCTGCAGCCATTGCACCACCTGGAGCAAGAGCGGCTTTTCTTCCTGCGCTAAGTTTT
TTACCAATATTCATAGCTCCTGCTTTGATTCCTTTTCCTGCAGATTGTATGGCTCTGGTC
AATGTGGGAAATTGTTTAGCTGTCGCTTTTAATCCTCTATTGAATGTTGGGAACGATGCT
GCAGTAACATTTTTTGCTAACGTTCCAAAAGATCCTGCAGTATTTGCTACCTTTTGA

>Translation of ORF number 1 in reading frame 1 on the direct strand.
LSLLPVAPARKLPPFIISSRPETPIPPAATAAAPAICRPYPPAAIAPPGARAAFLPALSF
LPIFIAPALIPFPADCMALVNVGNCLAVAFNPLLNVGNDAAVTFFANVPKDPAVFATF*

>ORF number 2 in reading frame 1 on the direct strand extends from base 358 to base 579.
CCAAATGTTACTACATTATTTTTGCCTGCGTTTAATGCTGAAGCATACGCCTTAGTAACA
TTAGCTCCAGCAGCAAGTTCGTGTCTAAATACTTTAAGAGCTGCTTCAGAACCTTTACTT
CCAGCTCCAAATTTTTTCTCGAGACCTGCTTCTCCAAGAGCTTTCATCCGGGTTCCAGGA
GCAGCCCCAGAACCAGCTCCAGAACCAACATTAGGTCCTTGA

>Translation of ORF number 2 in reading frame 1 on the direct strand.
PNVTTLFLPAFNAEAYALVTLAPAASSCLNTLRAASEPLLPAPNFFSRPASPRAFIRVPG
AAPEPAPEPTLGP*

No ORFs were found in reading frame 2.

>ORF number 1 in reading frame 3 on the direct strand extends from base 3 to base 272.
ATCTTTATTACCCGTAGCACCAGCGAGGAAACTTCCTCCTTTCATCATTTCTTCCAGGCC
AGAAACACCCATACCACCAGCAGCAACTGCAGCAGCACCTGCCATTTGTCGACCATACCC
GCCTGCAGCCATTGCACCACCTGGAGCAAGAGCGGCTTTTCTTCCTGCGCTAAGTTTTTT
ACCAATATTCATAGCTCCTGCTTTGATTCCTTTTCCTGCAGATTGTATGGCTCTGGTCAA
TGTGGGAAATTGTTTAGCTGTCGCTTTTAA

>Translation of ORF number 1 in reading frame 3 on the direct strand.
IFITRSTSEETSSFHHFFQARNTHTTSSNCSSTCHLSTIPACSHCTTWSKSGFSSCAKFF
TNIHSSCFDSFSCRLYGSGQCGKLFSCRF*

>ORF number 2 in reading frame 3 on the direct strand extends from base 399 to base 629.
AGCATACGCCTTAGTAACATTAGCTCCAGCAGCAAGTTCGTGTCTAAATACTTTAAGAGC
TGCTTCAGAACCTTTACTTCCAGCTCCAAATTTTTTCTCGAGACCTGCTTCTCCAAGAGC
TTTCATCCGGGTTCCAGGAGCAGCCCCAGAACCAGCTCCAGAACCAACATTAGGTCCTTG
AGGACCTGCAACCTTTTTACCCTTTTGAGCTTTCTGCATGTGAGAGTTTAA

>Translation of ORF number 2 in reading frame 3 on the direct strand.
SIRLSNISSSSKFVSKYFKSCFRTFTSSSKFFLETCFSKSFHPGSRSSPRTSSRTNIRSL
RTCNLFTLLSFLHVRV*

-----------------------------------------------------------------------------------------------------

b) reverse strand

>ORF number 1 in reading frame 1 on the reverse strand extends from base 1 to base 894.
CTACACAACATGAACCAAGCGGATAGAATTACAGGATATAATAAGGGTGGGTTTGTTGGT
TTTGCTAAGGGAGGAAAAGCTAAGAAAGACGCAGCGACTATCAGTGCTGCTACAACTATG
AATCCTCAAGCGATGGGTAAAATAGAAAAGGCTTTACTAAAATTTGGAGCTGACACAGAA
CAAACTAGAGCAGCTATGTTGGCTTATGCAAAAAGTTTACTTGCTGGAGATAGTGAACAG
GAAGCTACAACCCGATCTACTGCTGCTTTAAACTCTCACATGCAGAAAGCTCAAAAGGGT
AAAAAGGTTGCAGGTCCTCAAGGACCTAATGTTGGTTCTGGAGCTGGTTCTGGGGCTGCT
CCTGGAACCCGGATGAAAGCTCTTGGAGAAGCAGGTCTCGAGAAAAAATTTGGAGCTGGA
AGTAAAGGTTCTGAAGCAGCTCTTAAAGTATTTAGACACGAACTTGCTGCTGGAGCTAAT
GTTACTAAGGCGTATGCTTCAGCATTAAACGCAGGCAAAAATAATGTAGTAACATTTGGT
CAAAAGGTAGCAAATACTGCAGGATCTTTTGGAACGTTAGCAAAAAATGTTACTGCAGCA
TCGTTCCCAACATTCAATAGAGGATTAAAAGCGACAGCTAAACAATTTCCCACATTGACC
AGAGCCATACAATCTGCAGGAAAAGGAATCAAAGCAGGAGCTATGAATATTGGTAAAAAA
CTTAGCGCAGGAAGAAAAGCCGCTCTTGCTCCAGGTGGTGCAATGGCTGCAGGCGGGTAT
GGTCGACAAATGGCAGGTGCTGCTGCAGTTGCTGCTGGTGGTATGGGTGTTTCTGGCCTG
GAAGAAATGATGAAAGGAGGAAGTTTCCTCGCTGGTGCTACGGGTAATAAAGAT

>Translation of ORF number 1 in reading frame 1 on the reverse strand.
LHNMNQADRITGYNKGGFVGFAKGGKAKKDAATISAATTMNPQAMGKIEKALLKFGADTE
QTRAAMLAYAKSLLAGDSEQEATTRSTAALNSHMQKAQKGKKVAGPQGPNVGSGAGSGAA
PGTRMKALGEAGLEKKFGAGSKGSEAALKVFRHELAAGANVTKAYASALNAGKNNVVTFG
QKVANTAGSFGTLAKNVTAASFPTFNRGLKATAKQFPTLTRAIQSAGKGIKAGAMNIGKK
LSAGRKAALAPGGAMAAGGYGRQMAGAAAVAAGGMGVSGLEEMMKGGSFLAGATGNKD

Multiple Alignement

PROTOCOL


EBI,CLUSTAL 2.0.12 multiple sequence alignment



RESULTS ANALYSIS


The multiple alignments obtein were so bad, that wasn’t possible to proceed to the next’s steps of annotathon.


The unknown sequence could not find homologies and alignments with other sequences.

RAW RESULTS

GOS_142201      ------------------------------------------------------------
Lactobacil      ---------------------------mkGsrRRftiqaivwlvvmvAaIiflpnisslV
Roseovariu      ------------------------------------------------------------
Xanthomona      mnrafalvwnaaigncgvthelvrrrwkpGaPRRqlpalVvltvgaaApmlawgactpAi
Verminephr      ------------------------------------------------------------
Opitutus_t      ------------------------------mPshravkkVktrpvpfvlIasllapaala
Shigella_s      -------------------------------------------------------mnMAV
Ecoli9_IN       ---------------------------------------------------------MAV
Ecoli7_IN       ---------------------------------------------------------MAV
Ecoli13_IN      ---------------------------------------------------------MAV
Ecoli14_IN      ---------------------------------------------------------MAV
Ecoli6_IN       ---------------------------------------------------------MAV
Ecoli1_IN       ---------------------------------------------------------MtV
Ecoli2_IN       ---------------------------------------------------------MtV
Ecoli4_IN       ---------------------------------------------------------MAV
Escoli5_IN      ---------------------------------------------------------MAV
Ecoli3_IN       ---------------------------------------------------------MtV
Ecoli_10_I      ---------------------------------------------------------MtV
Ecoli11_IN      ---------------------------------------------------------MtV
Ecoli12_IN      ---------------------------------------------------------MtV
Ecoli8_IN       ------------------------------------------------------------

GOS_142201      ------------------------------------------------------------
Lactobacil      rdkGqTkl----------------------------------------------------
Roseovariu      ------------------------------------------------------------
Xanthomona      paaGaTvtcsglpstsnifasgannltvniasgskmtagllggtaialsgtnstlnnagt
Verminephr      ------------------------------------------------------------
Opitutus_t      qstGg-------------------------------------------------------
Shigella_s      KISG--------------------------------------------------------
Ecoli9_IN       KISG--------------------------------------------------------
Ecoli7_IN       rISG--------------------------------------------------------
Ecoli13_IN      KISG--------------------------------------------------------
Ecoli14_IN      KISG--------------------------------------------------------
Ecoli6_IN       KISG--------------------------------------------------------
Ecoli1_IN       KISG--------------------------------------------------------
Ecoli2_IN       KISG--------------------------------------------------------
Ecoli4_IN       KISG--------------------------------------------------------
Escoli5_IN      KISG--------------------------------------------------------
Ecoli3_IN       KISG--------------------------------------------------------
Ecoli_10_I      KISG--------------------------------------------------------
Ecoli11_IN      KISG--------------------------------------------------------
Ecoli12_IN      KISG--------------------------------------------------------
Ecoli8_IN       ------------------------------------------------------------

GOS_142201      ------------------------------------------------------------
Lactobacil      ------------------------------------------------------------
Roseovariu      ------------------------------------------------------------
Xanthomona      idpsvlgllsvlssgvtmgnasstavtvnnlaggtiygtggvlganllnldgmalgltsa
Verminephr      ------------------------------------------------------------
Opitutus_t      ------------------------------------------------------------
Shigella_s      ------------------------------------------------------------
Ecoli9_IN       ------------------------------------------------------------
Ecoli7_IN       ------------------------------------------------------------
Ecoli13_IN      ------------------------------------------------------------
Ecoli14_IN      ------------------------------------------------------------
Ecoli6_IN       ------------------------------------------------------------
Ecoli1_IN       ------------------------------------------------------------
Ecoli2_IN       ------------------------------------------------------------
Ecoli4_IN       ------------------------------------------------------------
Escoli5_IN      ------------------------------------------------------------
Ecoli3_IN       ------------------------------------------------------------
Ecoli_10_I      ------------------------------------------------------------
Ecoli11_IN      ------------------------------------------------------------
Ecoli12_IN      ------------------------------------------------------------
Ecoli8_IN       ------------------------------------------------------------

GOS_142201      ------------------------------------------------------------
Lactobacil      -----------------------pSSaksqVaqviqnhwgrnQNnTrqvvvvFNNgdspl
Roseovariu      ------------------------------------------------------------
Xanthomona      sggtvqinnagvidsrplaglsilSSntpviavtGgGtvNAvntgTingrvsFqsSatgn
Verminephr      ------------------------------------------------------------
Opitutus_t      ------------------------------VLqavnnatNAaKNasraasnaaNNAahaa
Shigella_s      ------------------------------VLKDGTG-----------------------
Ecoli9_IN       ------------------------------VLKDGTG-----------------------
Ecoli7_IN       ------------------------------VLKDGaG-----------------------
Ecoli13_IN      ------------------------------VLKDGTG-----------------------
Ecoli14_IN      ------------------------------VLKDGTG-----------------------
Ecoli6_IN       ------------------------------VLKDGTG-----------------------
Ecoli1_IN       ------------------------------VLKDGTG-----------------------
Ecoli2_IN       ------------------------------VLKDGTG-----------------------
Ecoli4_IN       ------------------------------VLKDGTG-----------------------
Escoli5_IN      ------------------------------VLKDGTG-----------------------
Ecoli3_IN       ------------------------------VLKDGTG-----------------------
Ecoli_10_I      ------------------------------VLKDGTG-----------------------
Ecoli11_IN      ------------------------------VLKDGTG-----------------------
Ecoli12_IN      ------------------------------VLKDGTG-----------------------
Ecoli8_IN       ------------------------------------------------------------

GOS_142201      ------------------------------------------------------------
Lactobacil      TsdqKqaidgtiqRfKvqkskyhVksmTaASdNaaakkqliskdkstQllq---------
Roseovariu      ------------------------------------------------------------
Xanthomona      TfvnagaingsvsmgagstNSfTaVttgtvnsagglAlellgpggllsfaqTgvvdgglg
Verminephr      -----------------mtslvhVlVdlenvqpPaallenlapgfanaw-----------
Opitutus_t      TnaaKhasdsaanaanqatqaannaadaaAtaasnaAntaasaatnaasnaT--------
Shigella_s      -----KPVQNCTIQLKAKRNSTTVVVNTLASENPDEAGRYSMDVEYGQYSVi--------
Ecoli9_IN       -----KPVQNCTIQLKAKRNSTTVVVNTLASENPDEAGRYSMDVEYGQYSVi--------
Ecoli7_IN       -----KPiQNCTIQLKARRNSTTVVVNTvASENPDEAGRYtMDVEYGQYSVs--------
Ecoli13_IN      -----KPVQNCTIQLKAKRNSTTVVVNTLASENPDEAGRYSMDVEYGQYSVi--------
Ecoli14_IN      -----KPVQNCTIQLKAKRNSTTVVVNTLASENPDEAGRYSMDVEYGQYSVi--------
Ecoli6_IN       -----KPVQNCTIQLKAKRNSTTVVVNTLASENPDEAGRYSMDVEYGQYSVi--------
Ecoli1_IN       -----KPVQNCTIvLKARRtSsTVVVNTvASENPDEAGRYSMDVEhGQYSVT--------
Ecoli2_IN       -----KPVQNCTIvLKARRtSsTVVVNTvASENPDEAGRYSMDVEhGQYSVT--------
Ecoli4_IN       -----KPVeNCTIQLKAKRNSTTVVVNTLASENPDEAGRYSMDVEYGQYSVi--------
Escoli5_IN      -----KPVQNCTIQLKAKRNSTTVVVNTLASENPDEAGRYSMDVEYGQYSVi--------
Ecoli3_IN       -----KPVQNCTIvLKARRtSsTVVVNTvASENPDEAGRYSMDVEhGQYSVT--------
Ecoli_10_I      -----KPVQNCTIvLKARRtSsTVVVNTvASENPDEAGRYSMDVEhGQYSVT--------
Ecoli11_IN      -----KPVQNCTIvLKARRtSsTVVVNTvASENPDEAGRYSMDVEhGQYSVT--------
Ecoli12_IN      -----KPVQNCTIvLKARRtSsTVVVNTvASENPDEAGRYSMDVEhGQYSVT--------
Ecoli8_IN       ------------------------------------------------------------

GOS_142201      ------------------------------------------------------------
Lactobacil      ------------------------------------------------------------
Roseovariu      ------------------------------------------------------------
Xanthomona      gnntlilqnsavgtgsgstgagtlsttqyinfgnlrvnsgtwsvggannfgnsalnggvl
Verminephr      ------------------------------------------------------------
Opitutus_t      ------------------------------------------------------------
Shigella_s      ------------------------------------------------------------
Ecoli9_IN       ------------------------------------------------------------
Ecoli7_IN       ------------------------------------------------------------
Ecoli13_IN      ------------------------------------------------------------
Ecoli14_IN      ------------------------------------------------------------
Ecoli6_IN       ------------------------------------------------------------
Ecoli1_IN       ------------------------------------------------------------
Ecoli2_IN       ------------------------------------------------------------
Ecoli4_IN       ------------------------------------------------------------
Escoli5_IN      ------------------------------------------------------------
Ecoli3_IN       ------------------------------------------------------------
Ecoli_10_I      ------------------------------------------------------------
Ecoli11_IN      ------------------------------------------------------------
Ecoli12_IN      ------------------------------------------------------------
Ecoli8_IN       ------------------------------------------------------------

GOS_142201      ----------------------------------------lhnmnqadritGynkGg---
Lactobacil      ---------------------------------LmVgknqtvanmTknItaGaKtag---
Roseovariu      ------------------------------------------mnGTrTaivGaal-----
Xanthomona      qfanpaqlgntitanggtleataagltlaptngialgagglSlqGTnnliiGSaitgtga
Verminephr      ---------------------------------ifhgphqeklskTfktliGeRatl---
Opitutus_t      ---------------------------------naadaaanaansaVsaasnaatta---
Shigella_s      ---------------------------------LLVEGFPPSHAGTITVYEdSRPGT---
Ecoli9_IN       ---------------------------------LLVEGFPPSHAGTITVYEdSRPGT---
Ecoli7_IN       ---------------------------------LLVEGFPPSHAGiITVYEdSKPGT---
Ecoli13_IN      ---------------------------------LLVEGFPPSHAGTITVYEdSqPGT---
Ecoli14_IN      ---------------------------------LLVEGFPPSHAGTITVYEdSqPGT---
Ecoli6_IN       ---------------------------------LLVEGFPPSHAGTITVYEdSRPGT---
Ecoli1_IN       ---------------------------------LLVEGFPPSHAGTITVYEGSRPGT---
Ecoli2_IN       ---------------------------------LLVEGFPPSHAGTITVYEGSRPGT---
Ecoli4_IN       ---------------------------------LLVEGFPPSHAGTITVYEdSRPGT---
Escoli5_IN      ---------------------------------LLVEGFPPSHAGTITVYEdSqPGT---
Ecoli3_IN       ---------------------------------LLVEGFPPSHAGTITVYEGSRPGT---
Ecoli_10_I      ---------------------------------LLVEGFPPSHAGTITVYEGSRPGT---
Ecoli11_IN      ---------------------------------LLVEGFPPSHAGTITVYEGSRPGT---
Ecoli12_IN      ---------------------------------LLVEGFPPSHAGTITVYEGSRPGT---
Ecoli8_IN       ------------------------------------------mmsdrrhcaa--------

GOS_142201      -------------fvgFakggkakkdaati------------------------------
Lactobacil      -------------vktyvtgsdilnddftqeteagIqkteIitvififIvltlVfrSpit
Roseovariu      -------------valytGAiaaaDgitkl------------------------------
Xanthomona      ltktgtggltlngagtFtGgvnltDgaltvgtnsaLgtgtVtasggsvLagasVnlAnaf
Verminephr      -------------vpitrtgnnalDf----------------------------------
Opitutus_t      -------------tNaaqsAtTaasaatdAasnaanaat---------------------
Shigella_s      -------------LNDFLGAMTEDDaRPEALRRFELMVE---------------------
Ecoli9_IN       -------------LNDFLGAMTEDDaRPEALRRFELMVE---------------------
Ecoli7_IN       -------------LNDFLGAMTEDDvRPEALRRFELMVE---------------------
Ecoli13_IN      -------------LNDFLGAMTEDDaRPEALRRFELMVE---------------------
Ecoli14_IN      -------------LNDFLGAMTEDDaRPEALRRFELMVE---------------------
Ecoli6_IN       -------------LNDFLGAMTEDDaRPEALRRFELMVE---------------------
Ecoli1_IN       -------------LNDFLGAMTEDDvRPEALRRFEqMVE---------------------
Ecoli2_IN       -------------LNDFLGAMTEDDvRPEALRRFEqMVE---------------------
Ecoli4_IN       -------------LNDFLGAMTEDDaRPEALRRFELMVE---------------------
Escoli5_IN      -------------LNDFLGAMTEDDaRPEALRRFEqMVE---------------------
Ecoli3_IN       -------------LNDFLGAMTEDDvRPEALRRFEqMVE---------------------
Ecoli_10_I      -------------LNDFLGAMTEDDvRPEALRRFEqMVE---------------------
Ecoli11_IN      -------------LNDFLGAMTEDDvRPEALRRFEqMVE---------------------
Ecoli12_IN      -------------LNDFLGAMTEDDvRPEALRRFEqMVE---------------------
Ecoli8_IN       -------------LsrwwkrchvtppRlhrIR----------------------------

GOS_142201      ---------------saAttmnpqAmgkiekAllkfgadt--------------------
Lactobacil      pLvSlLsVG------vsfiislsVvmNlvnkfnfplSnftqvfmvvvlfgigtdynillf
Roseovariu      ---------------iaggyAapqlfalsaglvllmSlga--------------------
Xanthomona      sLsNtVgIGgenplriagtisgtgllNkvgtgtltlgg-n--------------------
Verminephr      ---------------hlsfylgyVAakhpdArlvvvand---------------------
Opitutus_t      ---------------naAntAStaAasaAdAAtnaAaaAt--------------------
Shigella_s      ---------------EVARNASAVAQNTA-AAKKSASDAS--------------------
Ecoli9_IN       ---------------EVARNASAVAQNTA-AAKKSAgDAg--------------------
Ecoli7_IN       ---------------EVARNASAVAQNTA-AAKKSAgDAg--------------------
Ecoli13_IN      ---------------EVARNASAVAQNTA-AAKKSASDAS--------------------
Ecoli14_IN      ---------------EVARNASAVAQNTA-AAKKSASDAS--------------------
Ecoli6_IN       ---------------EVARNASAVAQNTA-AAKKSASDAg--------------------
Ecoli1_IN       ---------------EVsRNASAVAQNTA-AAKKSASDAS--------------------
Ecoli2_IN       ---------------EVsRNASAVAQNTA-AAKKSASDAS--------------------
Ecoli4_IN       ---------------EVARNASAVAQNTA-AAKKSASDAS--------------------
Escoli5_IN      ---------------EVARNASvVAQNTA-AAKKSASDAg--------------------
Ecoli3_IN       ---------------EVsRNASAVAQNTA-AAKKSASDAS--------------------
Ecoli_10_I      ---------------EVsRNASAVAQNTA-AAKKSASDAS--------------------
Ecoli11_IN      ---------------EVsRNASAVAQNTA-AAKKSASDAS--------------------
Ecoli12_IN      ---------------EVsRNASAVAQNTA-AAKKSASDAS--------------------
Ecoli8_IN       ---------------QprKNqpAmpvhqparrqlmqpmlq--------------------

GOS_142201      eqtRaAmlayakslLa---gdSeqeAtTrstaAlnShmq---------------------
Lactobacil      dqfKEelsHgdspV-----AAtgRAlkiagrtilySgsS---------------------
Roseovariu      SrkngetlrtTcr------gAmAlrAlltvvsAlSffQa---------------------
Xanthomona      ntysggtTlgagsVLletsgAlgtgtvTaAGgsldttaplsltnnfaltntlglgasgna
Verminephr      kgyepmieHA---------vSlgfevrregynAAdkttp---------------------
Opitutus_t      naAstAtTaAsnA------AdaAttAaTaAsnAAntAtt---------------------
Shigella_s      TSAREAATHATDA------AdSARAASTSAGQAASSAQS---------------------
Ecoli9_IN       TSAREAATHATDA------AgSARAASTSAGQAAtSAQS---------------------
Ecoli7_IN       TSAREAATHATDA------AgSARAASTSAGQAASSAQS---------------------
Ecoli13_IN      TSAREAATHAaDA------AdSARAASTSAGQAASSAQS---------------------
Ecoli14_IN      TSAREAATHAaDA------AdSARAASTSAGQAASSAQS---------------------
Ecoli6_IN       TSAREAATHATDA------AgSARAASTSAGQAASSAQS---------------------
Ecoli1_IN       aSAsEAATHATDA------AASARAASTSAGQAASSAQS---------------------
Ecoli2_IN       aSAsEAATHATDA------AASARAASTSAGQAASSAQS---------------------
Ecoli4_IN       TSAREAATHATDA------AgSARAASTSAGQAASSAQS---------------------
Escoli5_IN      TSAREAATHATDA------AdSARAASTSAGQAASSAQS---------------------
Ecoli3_IN       aSAsEAATHATDA------AASARAASTSAGQAASSAQS---------------------
Ecoli_10_I      aSAsEAATHATDA------AASARAASTSAGQAASSAQS---------------------
Ecoli11_IN      aSAsEAATHATDA------AASARAASTSAGQAASSAQS---------------------
Ecoli12_IN      aSAsEAATHATDA------AASARAASTSAGQAASSAQS---------------------
Ecoli8_IN       pqh----------------vppARAASTSAGQAASSAQS---------------------

GOS_142201      ------------------------------------------------------------
Lactobacil      -------------lligfsalglanfsiyKSAvgvAiavAilllvllTlnpffmAllgpr
Roseovariu      ------------------------------------------------------------
Xanthomona      ltlsgtlagvggvnktgAGTltlgglntysggtnlAsgtlqlgtasAlgtgalnvTgASN
Verminephr      ------------------------------------------------------------
Opitutus_t      -------------AaSnAanAaTnAaqtAtnAAsnAsdaAtnaanvATnaastaATaASN
Shigella_s      -------------ASSSAGTASTKATEAsKSAAA--------------------------
Ecoli9_IN       -------------ASSSAGTASTKATEAsKSAAA--------------------------
Ecoli7_IN       -------------ASSSAGTASaKATEAsKSAAA--------------------------
Ecoli13_IN      -------------ASSSAGTASTKATEAsKSAAA--------------------------
Ecoli14_IN      -------------ASSSAGTASTKATEAsKSAAA--------------------------
Ecoli6_IN       -------------ASSSAGTASTKATEAsKSAAA--------------------------
Ecoli1_IN       -------------ASSSAGTASTKArEAAKSAAA--------------------------
Ecoli2_IN       -------------ASSSAGTASTKArEAAKSAAA--------------------------
Ecoli4_IN       -------------ASSSAGTASTKATEAsKSAAA--------------------------
Escoli5_IN      -------------ASSSAGTASTKATEAeKSAAA--------------------------
Ecoli3_IN       -------------ASSSAGTASTKArEAAKSAAA--------------------------
Ecoli_10_I      -------------ASSSAGTASTKArEAAKSAAA--------------------------
Ecoli11_IN      -------------ASSSAGTASTKArEAAKSAAA--------------------------
Ecoli12_IN      -------------ASSSAGTASTKArEAAKSAAA--------------------------
Ecoli8_IN       -------------ASSSAGTASTKArEAAKSAAA--------------------------

GOS_142201      ------------------------------------------------------------
Lactobacil      LfwptKkfAggStSkmwhgiAAssvry---------------------------------
Roseovariu      ---------frllp------fAdvflf---------------------------------
Xanthomona      LsttapltvAnaiS------lAaalnlpstqaltltgaisgagsliksgagdltlanana
Verminephr      ------------------------------------------------------------
Opitutus_t      aAdaatnAAtaatn------AAnTatr---------------------------------
Shigella_s      -AESSKSAAATSAg------AAKTlET---------------------------------
Ecoli9_IN       -AESSKSAAATSAg------AAKTSET---------------------------------
Ecoli7_IN       -AESSKSAAATSAa------AAKTSET---------------------------------
Ecoli13_IN      -AESSKSAAATSAg------AAKTSET---------------------------------
Ecoli14_IN      -AESSKSAAATSAg------AAKTSET---------------------------------
Ecoli6_IN       -AESSKSAAATSAS------AAKTSET---------------------------------
Ecoli1_IN       -AESSKSAAATSAS------AAKTSET---------------------------------
Ecoli2_IN       -AESSKSAAATSAS------AAKTSET---------------------------------
Ecoli4_IN       -AESSKSAAATSAa------AAKTSET---------------------------------
Escoli5_IN      -AESSKSAAATSAa------AAKTSET---------------------------------
Ecoli3_IN       -AESSKSAAATSAS------AAKTSET---------------------------------
Ecoli_10_I      -AESSKSAAATSAS------AAKTSET---------------------------------
Ecoli11_IN      -AESSKSAAATSAS------AAKTSET---------------------------------
Ecoli12_IN      -AESSKSAAATSAS------AAKTSET---------------------------------
Ecoli8_IN       -AESSKSAAATSAS------AAKTSET---------------------------------

GOS_142201      -------------------------------------------kAqkgkkvAgpqgpnv-
Lactobacil      -------------------------------------------piiAlvvvlgvSlpfil
Roseovariu      -------------------------------------------iAlvpliaAAmSgpal-
Xanthomona      ytggttlsagrlvvgsnaalgtgtltasggeldattattlgnaiAltgtmgvgsSgnaln
Verminephr      -------------------------------------------kkAApdk----------
Opitutus_t      -------------------------------------------aAsnaaQtAtTaAnnAs
Shigella_s      -------------------------------------------NAAASQQSAATSASTAT
Ecoli9_IN       -------------------------------------------NAAASQkSAATSASaAT
Ecoli7_IN       -------------------------------------------NAAASQQSAATSASTAT
Ecoli13_IN      -------------------------------------------NAsASlQSAATSASTAT
Ecoli14_IN      -------------------------------------------NAsASlQSAATSASTAT
Ecoli6_IN       -------------------------------------------NAAASQkSAATSASaAT
Ecoli1_IN       -------------------------------------------NAAASQkSAATSASTAT
Ecoli2_IN       -------------------------------------------NAAASQkSAATSASTAT
Ecoli4_IN       -------------------------------------------NAAASQQSAATSASTAT
Escoli5_IN      -------------------------------------------NAAASQQSAATSASTAT
Ecoli3_IN       -------------------------------------------NAAASQkSAATSASTAT
Ecoli_10_I      -------------------------------------------NAAASQQSAATSASTAT
Ecoli11_IN      -------------------------------------------NAAASQQSAATSASTAT
Ecoli12_IN      -------------------------------------------NAAASQQSAATSASTAT
Ecoli8_IN       -------------------------------------------NAAASQQSAATSASTAT

GOS_142201      ------------------------------------------gSgagsgaap--------
Lactobacil      TynnElnydtlaelndtipAk---------------kgfqvvqdhfskgtaepStlyikt
Roseovariu      ------------------------------------------------------------
Xanthomona      ltgtisgagAlnklgtgtltlgglntysggtslnagtlqvAsgtalgtgvldvtgAAtlq
Verminephr      ------------------------------------------------------------
Opitutus_t      qaASnAAstAaDAAtnaatAatn-------------AantAttaatsaAStatnaAsnaA
Shigella_s      TKASEAATSARDAs----------------------ASKEAAKSSETNASSSASSAASSA
Ecoli9_IN       TKASEAATSARDAA----------------------ASKEAAKSSETNASSSASSAASSA
Ecoli7_IN       TKASEAATSARDAs----------------------ASKEAAKSSETNASSSASSAASSA
Ecoli13_IN      TKASEAATSARDAA----------------------ASKEAAKSSETNASSSASSAASSA
Ecoli14_IN      TKASEAATSARDAA----------------------ASKEAAKSSETNASSSASSAASSA
Ecoli6_IN       TKASEAATSARDAA----------------------ASKEAAKSSETNASSSASSAASSA
Ecoli1_IN       mKASEAATSARgAA----------------------ASKEAAKSSETNASSSASSAASSA
Ecoli2_IN       mKASEAATSARgAA----------------------ASKEAAKSSETNASSSASSAASSA
Ecoli4_IN       TKASEAATSARDAs----------------------ASKEAAKSSETNASSSASSAASSA
Escoli5_IN      TKASEAATSARDAs----------------------ASKEAAKSSETNASSSASSAASSA
Ecoli3_IN       TKASEAATSARgAA----------------------ASKEAAKSSETNASSSASSAASSA
Ecoli_10_I      TKASEAATSARDAs----------------------ASKEAAKSSETNAaSSASSAASSA
Ecoli11_IN      TKASEAATSARDAs----------------------ASKEAAKSSETNAaSSASSAASSA
Ecoli12_IN      TKASEAATSARDAs----------------------ASKEAAKSSETNAaSSASSAASSA
Ecoli8_IN       TKASEAATSARDAs----------------------ASKEAAKSSETNAaSSASSAASSA

GOS_142201      ------------------------------------------------------------
Lactobacil      dhAlNneKdlKvidsltkqlqkvkGvktvAsvtqpG------------------------
Roseovariu      ------------------------------------------------------------
Xanthomona      ntAaatlnnAvTlsTgtltldgAqaltlggniSgAGslvkSggdltlsgnntyaggtsln
Verminephr      ------------------------------------------------------------
Opitutus_t      eAAsNtAntAaTaaTNAattaaAAasnAtnAASnAaqtatTaa-----------------
Shigella_s      TAAaNSAKAAKTSETNARSSETAAGQSASAAA----------------------------
Ecoli9_IN       TAAGNSAKAAKTSETNARSSETAAGQSASAAA----------------------------
Ecoli7_IN       TAAGNSAKAAKTSETNARSSETAAGQSASAAA----------------------------
Ecoli13_IN      TAAGNSAKAAKTSETNARSSETAAGQSASAAA----------------------------
Ecoli14_IN      TAAGNSAKAAKTSETNARSSETAAGQSASAAA----------------------------
Ecoli6_IN       TAAGNSAKAAKTSETNARSSETAAGQSASAAA----------------------------
Ecoli1_IN       TAAGNSAKAAKTSETNAkSSETAAGQSASAAA----------------------------
Ecoli2_IN       TAAGNSAKAAKTSETNAkSSETAAGQSASAAA----------------------------
Ecoli4_IN       TAAGNSAKAAKTSETNAkSSETAAGQSASAAA----------------------------
Escoli5_IN      TAAaNSAKAAKTSETNARSSETAAGQSASAAA----------------------------
Ecoli3_IN       TAAGNSAKAAKTSETNARSSETAAGQSASAAA----------------------------
Ecoli_10_I      TAAaNSAKAAKTSETNARSSETAAGQSASAAA----------------------------
Ecoli11_IN      TAAaNSAKAAKTSETNARSSETAAGQSASAAA----------------------------
Ecoli12_IN      TAAaNSAKAAKTSETNARSSETAAGQSASAAA----------------------------
Ecoli8_IN       TAAaNSAKAAKTSETNARSSETAAGQSASAAA----------------------------

GOS_142201      ---------------------Gtrmk------------------------------Alge
Lactobacil      ---------------------GSqidqly----------------vgnqlGtvtsgmksA
Roseovariu      ---------------------GepvrpAv-----------------------wmAlllgA
Xanthomona      tgkltvgsntalgsgtlaaaaGttldtAvvStlAnaiAlagqldvggsnAltltgaisgA
Verminephr      ---------------------GtpakkAv--------------------------akkkA
Opitutus_t      ---------------------snaadAATnAatAatnAantaAtAAtnaAntAtnaAsnA
Shigella_s      ---------------------GSKTAAAS------------SASAASTSAGQASASATAA
Ecoli9_IN       ---------------------GSKTAAAS------------SASAASTSAGQASASATAA
Ecoli7_IN       ---------------------GSKTAAAS------------SASAASTSAGQASASATAA
Ecoli13_IN      ---------------------GSKTAAAS------------SASAASTSAGQASASATAA
Ecoli14_IN      ---------------------GSKTAAAS------------SASAASTSAGQASASATAA
Ecoli6_IN       ---------------------GSKTAAAS------------SASAASTSAGQASASATAA
Ecoli1_IN       ---------------------GSKTAAAl------------SASAASTSAGQASASATAA
Ecoli2_IN       ---------------------GSKTAAAl------------SASAASTSAGQASASATAA
Ecoli4_IN       ---------------------GSKTAAAS------------SASAASTSAGQASASATAA
Escoli5_IN      ---------------------GSKTAAAS------------SASAASTSAGQASASATAA
Ecoli3_IN       ---------------------GSKTAAAS------------SASAASTSAGQASASATAA
Ecoli_10_I      ---------------------dSKTAAAl------------SASAASTSAGQASASATAA
Ecoli11_IN      ---------------------dSKTAAAl------------SASAASTSAGQASASATAA
Ecoli12_IN      ---------------------dSKTAAAl------------SASAASTSAGQASASATAA
Ecoli8_IN       ---------------------dSKTAAAl------------SASAASTSAGQASASATAA

GOS_142201      aglekkfga---------------------------------------------------
Lactobacil      sKgltkigSGLdsAnSklksAnmsdSAkSAkklADgAdevASgvVSytdgvyTvnngLnq
Roseovariu      Ggllfvmpa---------------------------------------------------
Xanthomona      GgldklgpasLTlAgnNtylggTrltAgSlilgSDtAlgAgSlsVSgglldsTaarvLaN
Verminephr      aKkAtpvAk---------------------------------------------------
Opitutus_t      aqtAtnAASnaTsAaSNaaaAaTnaAstaAqtattaAsnAAntaaSaatnaantattaaS
Shigella_s      GKSAESAAS---------------------------------------------------
Ecoli9_IN       GKSAESAAS---------------------------------------------------
Ecoli7_IN       GKSAESAAS---------------------------------------------------
Ecoli13_IN      GKSAESAAS---------------------------------------------------
Ecoli14_IN      GKSAESAAS---------------------------------------------------
Ecoli6_IN       GKSAESAAS---------------------------------------------------
Ecoli1_IN       GKSAESAAS---------------------------------------------------
Ecoli2_IN       GKSAESAAS---------------------------------------------------
Ecoli4_IN       GKSAESAAS---------------------------------------------------
Escoli5_IN      GKSAESAAS---------------------------------------------------
Ecoli3_IN       GKSAESAAS---------------------------------------------------
Ecoli_10_I      GKSAESAAS---------------------------------------------------
Ecoli11_IN      GKSAESAAS---------------------------------------------------
Ecoli12_IN      GKSAESAAS---------------------------------------------------
Ecoli8_IN       GKSAESAAS---------------------------------------------------

GOS_142201      ---------------------------------------------gskgsEaAlkvfRhe
Lactobacil      lnsktgTLssGvnQlv---------------------SgSgqvTsgaktvasgnstlatn
Roseovariu      ---------------------------------------------gelslDpghlwAlfA
Xanthomona      svvldAgLqlGgsQdltlnggiSgigglNkqGSAtltlgSanTfsgGvAlDagrlvlgnn
Verminephr      -----------------------------------------------ksatkkkvptkkk
Opitutus_t      naadaATtaanaatnaantaatAatnaaStaGNAasnaAqTAanaAsnAsQvASsAAttA
Shigella_s      -------------------------------------SASTATTKAGEATEQAtAAARSA
Ecoli9_IN       -------------------------------------SASTATTKAGEATEQASAAARSA
Ecoli7_IN       -------------------------------------SASTATTKAGEAavQASAAARSA
Ecoli13_IN      -------------------------------------SASTATTKAGEATEQASAAARSA
Ecoli14_IN      -------------------------------------SASTATTKAGEATEQASAAARSA
Ecoli6_IN       -------------------------------------SASTATTKAGEATEQASAAARSA
Ecoli1_IN       -------------------------------------SASTATTKAGkATEQAtAAARSA
Ecoli2_IN       -------------------------------------SASTATTKAGkATEQAtAAARSA
Ecoli4_IN       -------------------------------------SASTATTKAGEATEQASAAAsSA
Escoli5_IN      -------------------------------------SASTATTKAGEATEQASAAARSA
Ecoli3_IN       -------------------------------------SASTATTKAGkATEQASAAARSA
Ecoli_10_I      -------------------------------------SASTATTKAGEAavQASAAARSA
Ecoli11_IN      -------------------------------------SASTATTKAGEAavQASAAARSA
Ecoli12_IN      -------------------------------------SASTATTKAGEAavQASAAARSA
Ecoli8_IN       -------------------------------------SASTATTKAGEAavQASAAARSA

GOS_142201      lAAganvTkAyASaln------------------------------------AgKnnvv-
Lactobacil      lvtyTngvytvnSglnqldSktgtlsSgvnklAtgAnSlS-sglttvfsgtqslsnglse
Roseovariu      --------------------------------------------------AftgtGSm--
Xanthomona      dAlgvggl--vvSgaaeld-tnaAltvgndvllnapltlv-gSnDlAldgtisgtGSlik
Verminephr      lAvaivvapkKsnasndld-------------------------------tpAAlrrvvk
Opitutus_t      anAaTtaTaAannaTnAaSNaaqtAttaASnAAdaAttAanAatnaAntaAtAAtnaAsT
Shigella_s      SAAKTSETNAKASETrAES-SKTAAASSASSAASSASSAS-ASKDEATRQASAAKGSATT
Ecoli9_IN       SAAKTSETNAKASETSAES-SKTAAASSASSAsSSASSAS-ASKDEATRQASAAKGSATT
Ecoli7_IN       SAAKTSETNAKASETrAES-SKTAAASSASSAASSASSAS-ASKDEATRQASAAKGSATT
Ecoli13_IN      SAAKTSETNAKASETSAES-SKTAAASSASSAASSASSAS-ASKDEATRQASAAKsSATT
Ecoli14_IN      SAAKTSETNAKASETSAES-SKTAAASSASSAASSASSAS-ASKDEATRQASAAKsSATT
Ecoli6_IN       SAAKTSETNAKtSadnAaS-SKaAAASSAgSAASSASSAS-ASKDEATRQASAAKGSATT
Ecoli1_IN       SAAKTSETNAKtSadnAaS-SKaAAASSASSAASSASSAS-ASKDEATRQASAAKGSATT
Ecoli2_IN       SAAKTSETNAKtSadnAaS-SKaAAASSASSAASSASSAS-ASKDEATRQASAAKGSATT
Ecoli4_IN       SAAKTSETNAKASETSAES-SKTAAASSASSAASSASSAS-ASKDEATRQASAAKsSATT
Escoli5_IN      SAAKTSETNAKASETSAES-SKTAAASSASSAASSASSAS-ASKDEATRQASAAKGSATT
Ecoli3_IN       SAAKTSETNAKASETSAES-SKTAAASSASSAASSASSAS-ASKDEATRQASAAKGSATT
Ecoli_10_I      SAAKTSkTNAKASETSAES-SKTAAASSASSAASSASSAS-ASKDEATRQASAAKGSATT
Ecoli11_IN      SAAKTSkTNAKASETSAES-SKTAAASSASSAASSASSAS-ASKDEATRQASAAKGSATT
Ecoli12_IN      SAAKTSkTNAKASETSAES-SKTAAASSASSAASSASSAS-ASKDEATRQASAAKGSATT
Ecoli8_IN       SAAKTSkTNAKASETSAES-SKTAAASSASSAASSASSAS-ASKDEATRQASAAKGSATT

GOS_142201      ------------------------------------------------------------
Lactobacil      lnnKkTsissgtTqlttSisAlqQgsaTlsenlaalEkevsssnsaidtseLqtltDqlk
Roseovariu      ------------------------------------------------------------
Xanthomona      ngTttlsltatnsysggtsltAgtiavgAdnAlgtgplAvlgnsvlsnAVAvalAnDiAl
Verminephr      AfsK--------------------------------------------------------
Opitutus_t      ASnaAsnAAqtaTnaasNAsqvAsSaaTtaanAattaTAAannatnAasnAaqtAttaAs
Shigella_s      ASTKATEAAGS-------ATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDAS----
Ecoli9_IN       ASTKATEAAGS-------ATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDAS----
Ecoli7_IN       ASTKATEAAGS-------ATAAsQSKtTAESAATRAEaAAdRAEeIAgAVAmEDAS----
Ecoli13_IN      ASTKATEAAGS-------ATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDAS----
Ecoli14_IN      ASTKATEAAGS-------ATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDAS----
Ecoli6_IN       AtTKAlEAAGS-------ATAAsQSKvaAESAATRAETAAKRAEDIASAVALEDAS----
Ecoli1_IN       AtTKAsEAAGS-------ATAAsQSKvaAESAATRAETAAKRAEDIASAVALEDAS----
Ecoli2_IN       AtTKAsEAAGS-------ATAAsQSKvaAESAATRAETAAKRAEDIASAVALEDAS----
Ecoli4_IN       ASTKATEAAGS-------ATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDAS----
Escoli5_IN      ASTKATEAAGS-------ATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDAS----
Ecoli3_IN       ASTKATEAAGS-------ATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDAS----
Ecoli_10_I      ASTKATEAAGS-------ATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDAS----
Ecoli11_IN      ASTKATEAAGS-------ATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDAS----
Ecoli12_IN      ASTKATEAAGS-------ATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDAS----
Ecoli8_IN       ASTKATEAAGS-------ATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDAS----

GOS_142201      -------TfgqkVan---------------------------------------------
Lactobacil      kintnlqTvadspnn------------------------------------VsTniSSve
Roseovariu      -------iaaryIar---------------------------------------------
Xanthomona      gAaltvdnaadmlasgaisgsgdliktglgtltlsgnnsytgplaiqagtvVastaaSlg
Verminephr      -------mgaerpqk---------------------------------------------
Opitutus_t      nAadaatTaanaatn-----------------------------------aanTaataAT
Shigella_s      -------TTKKGIVQ----------------------------------------LSSAT
Ecoli9_IN       -------TTKKGIVQ----------------------------------------LSSAT
Ecoli7_IN       -------lTtKGVVk----------------------------------------LSSAv
Ecoli13_IN      -------TTKKGIVQ----------------------------------------LSSAT
Ecoli14_IN      -------TTKKGIVQ----------------------------------------LSSAT
Ecoli6_IN       -------TTKKGIVQ----------------------------------------LSSAT
Ecoli1_IN       -------TTKKGIVQ---------------------------------------------
Ecoli2_IN       -------TTKKGIVQ---------------------------------------------
Ecoli4_IN       -------TTKKGIVQ----------------------------------------LSSAT
Escoli5_IN      -------TTKKGIVQ----------------------------------------LSSAT
Ecoli3_IN       -------TTKKGIVQ---------------------------------------------
Ecoli_10_I      -------TTKKGIVQ----------------------------------------LSSAT
Ecoli11_IN      -------TTKKGIVQ----------------------------------------LSSAT
Ecoli12_IN      -------TTKKGIVQ----------------------------------------LSSAT
Ecoli8_IN       -------TTKKGIVQ----------------------------------------LSSAT

GOS_142201      ------------------------------------------------------------
Lactobacil      kaaSafsAsnedisndmvtMksalssAasrdAAss-------------------------
Roseovariu      ------------------------------------------------------------
Xanthomona      NasnvdvAAgallnltnggLiNtltgAgDvdtdtgatlQLgggdfagSlggggnldkVGt
Verminephr      ------------------------------------------------------------
Opitutus_t      NaaStasnAasnAaqtttnaAsnasQAasNaAntaataaanaAntaTt---------aaS
Shigella_s      NSTSESLAATPKAVKAAYDLAN--------------------------------------
Ecoli9_IN       NSTSESLAATPKAVKAAYDLAN--------------------------------------
Ecoli7_IN       dSTSESLAATPKAVKsAYDnAekRLQkDqNGAdipD---------------------kGr
Ecoli13_IN      NSTSEtLAATPKAVKsAYDnAekRLQkDqNGAdipD---------------------kGc
Ecoli14_IN      NSTSEtLAATPKAVKsAYDnAekRLQkDqNGAdipD---------------------kGc
Ecoli6_IN       NSTSEtLAATPKAVKsAYDnAekRLQkDqNGAdipD---------------------kGr
Ecoli1_IN       ------------------------------------------------------------
Ecoli2_IN       ------------------------------------------------------------
Ecoli4_IN       NSTSESLAATPKAVKAAYDLAN--------------------------------------
Escoli5_IN      NSTSESLAATPKAVKAAYeLAN--------------------------------------
Ecoli3_IN       ------------------------------------------------------------
Ecoli_10_I      NSTSESLAATPKAVKAAYeLANGKytAqDattAqKgIVQLSNATNSTSEMLAATPKSVkA
Ecoli11_IN      NSTSESLAATPKAVKAAYeLANGKytAqDattAqKgIVQLSNATNSTSEMLAATPKSVkA
Ecoli12_IN      NSTSESLAATPKAVKAAYeLANGKytAqDattAqKgIVQLSNATNSTSEMLAATPKSVkA
Ecoli8_IN       NSTSESLAATPKAVKAAYeLANGKytAqDattAqKgIVQLSNATNSTSEMLAATPKSVkA

GOS_142201      -----------------------tagsfgtlaknv-------------------------
Lactobacil      ------sttasvNAnsvasaalaaagssatseQKaAissavtsevnaqnSsnaStsstAT
Roseovariu      ------------------------------------------------------------
Xanthomona      gtvrllGtsaiggsTqvsgGtldvvgslgtsalnvAlggtltgtgtagnvGaavnisdgg
Verminephr      ------------------------------------------------------------
Opitutus_t      naaqtasgaasyaAdaAsnaattaanAaNtaSnaasAasnvasqAastASnaaSnavdAa
Shigella_s      ------GKYTAQDATTAQKGIIQLSSATNSTSEtLAATPKAVKAAnDnAekRlqkdqnga
Ecoli9_IN       ------GKYTAQDATTAQKGIIQLSSATNSTSEtLAATPKAVKAAnDnAekRlqkdqnga
Ecoli7_IN       fLnniNavsktdfAdkrgmryVrvnapagaTSgKyypvvvmrsAgsvselasrviittAT
Ecoli13_IN      fLnniNavsktdfAdkrgmryVrvnapagaTSgKyypvvvmrsAgsvselasrviittAT
Ecoli14_IN      fLnniNavsktdfAdkrgmryVrvnapagaTSgKyypvvvmrsAgsvselasrviittAT
Ecoli6_IN       fLnniNavsktdfAdkrgmryVrvnapagaTSgKyypvvvmrsAgsvselasrviittAT
Ecoli1_IN       -----------------------LSSATNSTSEKLAATPKAVKtvkD-------------
Ecoli2_IN       -----------------------LSSATNSTSEKLAATPKAVKtvkD-------------
Ecoli4_IN       ------GKYTAQDATTAQKGIVQLSSATNSTSEmLAATPKSVKAAYDlANGKYTaqDAtT
Escoli5_IN      ------GKYTAQDATTAQKGIVQLSSATNSTSEmLAATPKSVKAAYDlANGKYTaqDAtT
Ecoli3_IN       -----------------------LSSATNSTSEtLAATPKAVKAAnDnAN----------
Ecoli_10_I      ayDlANGKYTAQDATTAQKGIVQLSSATNSTSEmLAATPKSVKAAYDlANGKYTaqDAtT
Ecoli11_IN      ayDlANGKYTAQDATTAQKGIVQLSSATNSTSEmLAATPKSVKAAYDlANGKYTaqDAtT
Ecoli12_IN      ayDlANGKYTAQDATTAQKGIVQLSSATNSTSEmLAATPKSVKAAYDlANGKYTaqDAtT
Ecoli8_IN       ayDlANGKYTAQDATTAQKGIVQLSSATNSTSEmLAATPKSVKAAYDlANGKYTaqDAtT

GOS_142201      ------------------------------------------------------------
Lactobacil      tnAGtvtaSdgltalAdmqqklstlkEAssSlddlssNlnnlaS----------------
Roseovariu      ------------------------------------------------------------
Xanthomona      rlAltsgsSlSvgalTlapgAfvdvalgApSttrllevngnltlDgtlnitdiggfgtgv
Verminephr      ------------------------------------------------------------
Opitutus_t      tnAattasnAASNaATtaanAannAatnAsNaAgsaaNaatVaSNaa-------------
Shigella_s      dipGkdtftknigacrafgg----------------------------------------
Ecoli9_IN       dipGkdtftknigacraf------------------------------------------
Ecoli7_IN       rtAGdpmnncefNgfvmpggwTdRgryAygmfwqYQNNERAIHS----------------
Ecoli13_IN      rtAGdpmnncefNgfvmpggwTdRgryAygmfwqYQNNERAIHS----------------
Ecoli14_IN      rtAGdpmnncefNgfvmpggwTdRgryAygmfwqYQNNERAIHS----------------
Ecoli6_IN       rtAGdpmnncefNgfvmpggwTdRgryAygmfwqYQNNERAIHS----------------
Ecoli1_IN       ------------------------------------------------------------
Ecoli2_IN       ------------------------------------------------------------
Ecoli4_IN       AqkGivQlSSATNSASetlAeTPKAvKAANDnAN--------------------------
Escoli5_IN      AqkGivQlSSATNSASetlAATPKAvKAANDnAN--------------------------
Ecoli3_IN       ------------------------------------------------------------
Ecoli_10_I      AqkGivQlSSATNSASetlAATPKAvKAANNnAN--------------------------
Ecoli11_IN      AqkGivQlSSATNSASetlAATPKAvKAANNnAN--------------------------
Ecoli12_IN      AqkGivQlSSATNSASetlAATPKAvKAANNnAN--------------------------
Ecoli8_IN       AqkGivQlSSATNSASetlAATPKAvKAANNnAN--------------------------

GOS_142201      -------------------------------taasfptfNr----gLkaTaKqfptLtra
Lactobacil      -------------------------vQqvaSaaikGnqaSlqAIqqIsqaaakMqqMeAa
Roseovariu      -------------------------iervplaQvfwpnLAl-----MlsmavaLpfvwqp
Xanthomona      yrlidytgaltnngmliglfpgsvdisqlalQtaiGQqLNl-----VvtapgsIvqfwdg
Verminephr      -------------------------------KtsflRhLSS-----MlgkdasqeaLtea
Opitutus_t      -------------------------anasnAasqsantaTSAAtNaaqtataaastatqa
Shigella_s      ---------------------------svStttgnwttaqf--IewLdsqgafnhpywmc
Ecoli9_IN       -------------------------ggsvStttgnwttaqf--IewLdsqgafnhpywmc
Ecoli7_IN       -------------------------imMsnkgddlrsvfyv-----dgaafpvfafIedg
Ecoli13_IN      -------------------------imMsnkgddlrsvfyv-----dgaafpvfafIedg
Ecoli14_IN      -------------------------imMsnkgddlrsvfyv-----dgaafpvfafIedg
Ecoli6_IN       -------------------------imMsnkgddlrsvfyv-----dgaafpvfafIedg
Ecoli1_IN       -------------------------ssVqktgdtmGgqLkiStINaLrifnQafGlIfrr
Ecoli2_IN       -------------------------ssVqktgdtmGgqLkiStINaLrifnQafGlIfrr
Ecoli4_IN       -------------------------gRVpSARKvnGKaLSA----dItlTpKdIGtLnSt
Escoli5_IN      -------------------------gRVpSARKvnGKaLSA----dItlTpKdIGtLnSt
Ecoli3_IN       -------------------------gRVpSnRKvnGKaLTA----dItlTpKdIGtLnSv
Ecoli_10_I      -------------------------gRVpSARKvnGKaLSA----dItlTpKdIGtLnSt
Ecoli11_IN      -------------------------gRVpSARKvnGKaLSA----dItlTpKdIGtLnSt
Ecoli12_IN      -------------------------gRVpSARKvnGKaLSA----dItlTpKdIGtLnSt
Ecoli8_IN       -------------------------gRVpSARKvnGKaLSA----dItlTpKdIGtLnSt

GOS_142201      iq--------SagkG----------------------------------ikAgaMni---
Lactobacil      va--------klSsG----------------------sseltnNlsklqtGANtLaTsln
Roseovariu      mp--------ladlG-------------------------WvvtyaVfLfaArwLsvE--
Xanthomona      vq--------TtanGtidggngtwgaatnwtgqdgsvnstWqgNfaVfsgaAgtVgvQgt
Verminephr      vs--------ylqkr---------------------------------------------
Opitutus_t      ag--------NaSna----------------------astaattaanaasnvgaaaSnas
Shigella_s      kg--------SwSyG---------------------------nNkiITdtGcgnIhlaga
Ecoli9_IN       kg--------SwSyG---------------------------nNkiITdtGcgnIhlaga
Ecoli7_IN       ls--------isapG-----------------------AdlvvNdTtykfGATnpaTEci
Ecoli13_IN      ls--------isapG-----------------------AdlvvNdTtykfGATnpaTEci
Ecoli14_IN      ls--------isapG-----------------------AdlvvNdTtykfGATnpaTEci
Ecoli6_IN       ls--------isapG-----------------------AdlvvNdTtykfGATnpaTEci
Ecoli1_IN       Se--------d-------------------------------------------------
Ecoli2_IN       SedhlhliptNegeG----------------------enGdigSlrpfsinlrSglvsig
Ecoli4_IN       Tm--------SfSgG-----------------------AGWFklaTVTMpqASSVv----
Escoli5_IN      Tm--------SfSgG-----------------------AGWFklaTVTMpqASSVv----
Ecoli3_IN       Ti--------SfSgG-----------------------AGWFklaTVTMpqASSIv----
Ecoli_10_I      Tm--------SfSgG-----------------------AGWFklaTVTMpqASSVv----
Ecoli11_IN      Tm--------SfSgG-----------------------AGWFklaTVTMpqASSVv----
Ecoli12_IN      Tm--------SfSgG-----------------------AGWFklaTVTMpqASSVv----
Ecoli8_IN       Tm--------SfSgG-----------------------AGWFklaTVTMpqASSVv----

GOS_142201      ------------------------------gkkLsaGrka--------------------
Lactobacil      qytaGvAtASaG------------------askLntGvss--------------------
Roseovariu      ------------------------------aLrLlp------------------------
Xanthomona      qnigGlqfvtdGyqltdvgAgaLAvTnplTaIrvdpGAtaSIGvaitgiggvekldtgtl
Verminephr      ------------------------------sIvaidGdkv--------------------
Opitutus_t      hAasnaAqAAtdaaaqaasAanVAaTaasTaVdaaasAGnlVtNvsstasNvasnav---
Shigella_s      ------------------------------vIevmGiksamtirittpt-----------
Ecoli9_IN       ------------------------------vIevmGiksamtirittpt-----------
Ecoli7_IN       AADV--------------------------iLdfksGrGFyeshslivNDNLSCKKL---
Ecoli13_IN      AADV--------------------------iLdfksGrGFyeshslivNDNLSCKKL---
Ecoli14_IN      AADV--------------------------iLdfksGrGFyeshslivNDNLSCKKL---
Ecoli6_IN       AADV--------------------------iLdfksGrGFyeshslivNDNLSCKKL---
Ecoli1_IN       ------------------------------hLhLip------------------------
Ecoli2_IN       n-----------------------------gLkvgGsvtgNL------------------
Ecoli4_IN       ------------------------------sItLiGGAGFNVGSPQQ-------------
Escoli5_IN      ------------------------------sItLiGGAGFNVGSPQQ-------------
Ecoli3_IN       ------------------------------yIaLiGGAGYNVGSPQQ-------------
Ecoli_10_I      ------------------------------sItLiGGAGFNVGSPQQ-------------
Ecoli11_IN      ------------------------------sItLiGGAGFNVGSPQQ-------------
Ecoli12_IN      ------------------------------sItLiGGAGFNVGSPQQ-------------
Ecoli8_IN       ------------------------------sItLiGGAGFNVGSPQQ-------------

GOS_142201      ------------------------------------------------------------
Lactobacil      ------------------------------------------------------------
Roseovariu      ------------------------------------------------------------
Xanthomona      vlsgpntytggtalsggslvlgnaqalgtgtltaaagttldtnqalavgnavvlngaltv
Verminephr      ------------------------------------------------------------
Opitutus_t      ------------------------------------------------------------
Shigella_s      ------------------------------------------------------------
Ecoli9_IN       ------------------------------------------------------------
Ecoli7_IN       ------------------------------------------------------------
Ecoli13_IN      ------------------------------------------------------------
Ecoli14_IN      ------------------------------------------------------------
Ecoli6_IN       ------------------------------------------------------------
Ecoli1_IN       ------------------------------------------------------------
Ecoli2_IN       ------------------------------------------------------------
Ecoli4_IN       ------------------------------------------------------------
Escoli5_IN      ------------------------------------------------------------
Ecoli3_IN       ------------------------------------------------------------
Ecoli_10_I      ------------------------------------------------------------
Ecoli11_IN      ------------------------------------------------------------
Ecoli12_IN      ------------------------------------------------------------
Ecoli8_IN       ------------------------------------------------------------

GOS_142201      --------------------------------AlapggaMaAG-----------------
Lactobacil      --------------------------------AstgasqLssGlGslqgqiptltsaiSa
Roseovariu      --------------------------------AyVAtplMn-------------------
Xanthomona      pgsndlaltgaisgagslikngtstlaldgsnSyLggsVLnAGtlavgsnTalgagalSv
Verminephr      --------------------------------tyV-------------------------
Opitutus_t      --------------------------------SvaADsaaKAataasdaaTaAstaaTka
Shigella_s      --------------------------------tssgggttnAqftyinhgTdyspgwrrD
Ecoli9_IN       --------------------------------tstgggttnAqftyinhgTdyspgwrrD
Ecoli7_IN       --------------------------------fatdEiVaRgGNqirmiggeygalwrND
Ecoli13_IN      --------------------------------fatdEiVaRgGNqirmiggeygalwrND
Ecoli14_IN      --------------------------------fatdEiVaRgGNqirmiggeygalwrND
Ecoli6_IN       --------------------------------fatdEiVaRgGNqirmiggeygalwrND
Ecoli1_IN       --------------------------------tnegEgkM--------------------
Ecoli2_IN       --------------------------------tgnADtatKiktarkiggvafdgsadiN
Ecoli4_IN       --------------------------------AgISElVLRAGNGnpkgiTGAlWqRTSt
Escoli5_IN      --------------------------------AgISElVLRAGNGnpkgiTGAlWqRTSt
Ecoli3_IN       --------------------------------AgISElVLRAGNGkpkgiTGAlWkRTAv
Ecoli_10_I      --------------------------------AgISElVLRAGNGnpkgiTGAlWqRTSt
Ecoli11_IN      --------------------------------AgISElVLRAGNGnpkgiTGAlWqRTSt
Ecoli12_IN      --------------------------------AgISElVLRAGNGnpkgiTGAlWqRTSt
Ecoli8_IN       --------------------------------AgISElVLRAGNGnpkgiTGAlWqRTSt

GOS_142201      ------------------------------------------------------------
Lactobacil      ldagt-------------------------------------------------------
Roseovariu      ------------------------------------------------------------
Xanthomona      lgTStlsnaialalgnnvnlganltiasadnlaltgvltgagaltktglgtlTlsgsnlf
Verminephr      ------------------------------------------------------------
Opitutus_t      seaat-------------------------------------------------------
Shigella_s      ynSrn-------------------------------------------------------
Ecoli9_IN       ynSrn-------------------------------------------------------
Ecoli7_IN       GaktY-------------------------------------------------------
Ecoli13_IN      GaktY-------------------------------------------------------
Ecoli14_IN      GaktY-------------------------------------------------------
Ecoli6_IN       GaktY-------------------------------------------------------
Ecoli1_IN       ------------------------------------------------------------
Ecoli2_IN       lpgvn-------------------------------------------------------
Ecoli4_IN       GfTNF-------------------------------------------------------
Escoli5_IN      GfTNF-------------------------------------------------------
Ecoli3_IN       GlTNF---------------------------------------awintsgdTydiyvei
Ecoli_10_I      GfTNF-------------------------------------------------------
Ecoli11_IN      GfTNF-------------------------------------------------------
Ecoli12_IN      GfTNF-------------------------------------------------------
Ecoli8_IN       GfTNF-------------------------------------------------------

GOS_142201      ------------------------------------------------------------
Lactobacil      -----------------------------------------nkLTNnsaalvsGanqlaS
Roseovariu      ------------------------------------------------------------
Xanthomona      sgpvSvqSgvlaTSSTgtlGttASvdvAAGaglSlgSntAlnsVTglgNVaivGgntlql
Verminephr      ------------------------------------------------------------
Opitutus_t      ------------TAATAAsnavSSaastAGsaatqsAeIAaqaaqtastsaakaadAatA
Shigella_s      -----------------------------------------kptasEigalpsGgTAvsS
Ecoli9_IN       -----------------------------------------kptasEigalpsGgTAvsS
Ecoli7_IN       -----------------------------------------llLTNQgDVygGwNTlrPf
Ecoli13_IN      -----------------------------------------llLTNQgDVygGwNTlrPf
Ecoli14_IN      -----------------------------------------llLTNQgDVygGwNTlrPf
Ecoli6_IN       -----------------------------------------llLTNQgDVygGwNTlrPf
Ecoli1_IN       ------------------------------------------------------------
Ecoli2_IN       -------------------------------------------aTgnqNttgnaaTAtkl
Ecoli4_IN       -----------------------------------------------------------A
Escoli5_IN      -----------------------------------------------------------A
Ecoli3_IN       gnyaTsvNihwdcttnASvsiytSptySAskpsSvtggVvytmysshqkptpsdigAlPt
Ecoli_10_I      -----------------------------------------------------------A
Ecoli11_IN      -----------------------------------------------------------A
Ecoli12_IN      -----------------------------------------------------------A
Ecoli8_IN       -----------------------------------------------------------A

GOS_142201      ------------------------------------------------------------
Lactobacil      gaNqva-----------------------------SGssqvttgltTLkg----------
Roseovariu      ------------------------------------------------------------
Xanthomona      gVNnassTfagslsdAgNLekvgtgtlqLtGtsaIgGttqitagtLdVdgtlGSTggltv
Verminephr      ------------------------------------------------------------
Opitutus_t      aadhatkaastaTsaAaTvaaNAaTtasqaGrsasNaatqatqsatkaagSaaSTatq--
Shigella_s      vnlaSkGrVaalTdnTqgaaGleLyevynnGyptayGniihlkgmtaV------------
Ecoli9_IN       vnlsSkGrVtalTdnTqgatGleLyevynnGyptayGniihlkgmtaV------------
Ecoli7_IN       aIdnatGeLvigTKlSaSLNGNALTATkLqtpRrVSGVefdGSkdITLTAAh--------
Ecoli13_IN      aIdnatGeLvigTKlSaSLNGNALTATkLqtpRrVSGVefdGSkdITLTAAh--------
Ecoli14_IN      aIdnatGeLvigTKlSaSLNGNALTATkLqtpRrVSGVefdGSkdITLTAAh--------
Ecoli6_IN       aINnatGeLvigTKlSaSLNGNALTATkLqtpRlVSGVefdGSrdITLTAAh--------
Ecoli1_IN       ------------------------------------------------------------
Ecoli2_IN       qaartingVsfdgsaNiTLtpSnIgAlaLtGgtlsgGLtaaGevisrsangl--------
Ecoli4_IN       wVNTSgdTyd--------------------------------------------------
Escoli5_IN      wVNTSg----dtydiyvAIgnyAtgvniqwdytsnAsVtihtSpaySankpeG-------
Ecoli3_IN       tggTisGpLsvtdgiTgALkGNAdTATkLaaapkINGVkfdGSadInLTSen--------
Ecoli_10_I      wVNTSgdTyd--------------------------------------------------
Ecoli11_IN      wVNTSgdTyd--------------------------------------------------
Ecoli12_IN      wVNTSgdTyd--------------------------------------------------
Ecoli8_IN       wVNTSgdTyd--------------------------------------------------

GOS_142201      ------------------------------------------------------------
Lactobacil      ------------------------------------------------------------
Roseovariu      ------------------------------------------------------------
Xanthomona      gtggtlsgsgvvdaavtvndggrlvvtsgsllttgslsfapnatldaflgipsqtgvlav
Verminephr      ------------------------------------------------------------
Opitutus_t      ------------------------------------------------------------
Shigella_s      ------------------------------------------------------------
Ecoli9_IN       ------------------------------------------------------------
Ecoli7_IN       ------------------------------------------------------------
Ecoli13_IN      ------------------------------------------------------------
Ecoli14_IN      ------------------------------------------------------------
Ecoli6_IN       ------------------------------------------------------------
Ecoli1_IN       ------------------------------------------------------------
Ecoli2_IN       ------------------------------------------------------------
Ecoli4_IN       ------------------------------------------------------------
Escoli5_IN      ------------------------------------------------------------
Ecoli3_IN       ------------------------------------------------------------
Ecoli_10_I      ------------------------------------------------------------
Ecoli11_IN      ------------------------------------------------------------
Ecoli12_IN      ------------------------------------------------------------
Ecoli8_IN       ------------------------------------------------------------

GOS_142201      --------------------gYgrqmaGaaaVaaggmgvSgle-----------------
Lactobacil      --------------------qvptlvnaisaLDagtnklvanSaTlNSgaSqvAsGnEqm
Roseovariu      -------------------------lqfvwmVvIgWfgfgeip-----------------
Xanthomona      ngdltldgtlnItdiggfgTgvyrliDytgaLinngmgfgtlpgTvdptqltlqTslaqq
Verminephr      ------------------------------------------------------------
Opitutus_t      --------------------aasnaaksvdaVasnatqaAstAadaaTtatDsATGaaSn
Shigella_s      ---------------------------GegeLlIgWsgTSgAh-----------------
Ecoli9_IN       ---------------------------GegeLlIgWsgTSgAh-----------------
Ecoli7_IN       -----------VAAFARRATdtyAdaDGgvpwNaEsgayNvtrsgdsyilvNfyTGvgSC
Ecoli13_IN      -----------VAAFARRATdtyAdaDGgvpwNaEsgayNvtrsgdsyilvNfyTGvgSC
Ecoli14_IN      -----------VAAFARRATdtyAdaDGgvpwNaEsgayNvtrsgdsyilvNfyTGvgSC
Ecoli6_IN       -----------VAAFARRATdtyAdaDGgvpwNaEsgayNvtrsgdtyilvNfyTGvgSC
Ecoli1_IN       ------------------------------------------------------------
Ecoli2_IN       -----------riAygnygffirndgsntyfMltDsgnSlgthnSlrpfiiSnhTGnvti
Ecoli4_IN       --------------------iYvAignyatgVNIQWdyTSnASvTiHTspAysAnkpEgl
Escoli5_IN      ------------------------ltDGtvyslytpse----------------------
Ecoli3_IN       -----------IgAFARRsTgayAdsDGavpwNaEsgayNvtrsgdsyilvNfyTGvgSC
Ecoli_10_I      --------------------iYvAignyatgVNIQWdyTSnASvTiHTspAysAnkpEgl
Ecoli11_IN      --------------------iYvAignyatgVNIQWdyTSnASvTiHTspAysAnkpEgl
Ecoli12_IN      --------------------iYvAignyatgVNIQWdyTSnASvTiHTspAysAnkpEgl
Ecoli8_IN       --------------------iYvAignyatgVNIQWdyTSnASvTiHTspAysAnkpEgl

GOS_142201      ------------------------------------------------------------
Lactobacil      yTsiqslvgQmqtLqdglktasd-GtktintgVgsAnsyltglkdsaAaktyyvpks---
Roseovariu      ------------------------------------------------------------
Xanthomona      vnvvLlAsgntaqfwdgAqtvan-GiaDggtgVwTAgTtnwtsidgltnsdWqnsfavfG
Verminephr      ------------------------------------------------------------
Opitutus_t      aaevaataadqaataaaTasDaaataaDaaATaasNaTkaavavtgtAasageaastatk
Shigella_s      ----------apafirsrrdttd-anwspWAqlYTSahpPaEFYPvGAPIPWPSDTVPSG
Ecoli9_IN       ----------apafirsrrdttd-anwspWAqlYTSahpPaEFYPvGAPIPWPSDTVPSG
Ecoli7_IN       RTlQMKAHYRNGGLfyRSSRDGy-GfeEdWAeVYTSknlPpEsYPvGAPIPWPSDTVPSG
Ecoli13_IN      RTlQMKAHYRNGGLfyRSSRDGy-GfeEdWAeVYTSknlPpEsYPvGAPIPWPSDTVPSG
Ecoli14_IN      RTlQMKAHYRNGGLfyRSSRDGy-GfeEdWAeVYTSknlPpEsYPvGAPIPWPSDTVPSG
Ecoli6_IN       RTlQMKAHYRNGGLfyRSSRDGy-GfeEdWAeVYTSknlPpEsYPvGAPIPWPSDTVPSG
Ecoli1_IN       ------------------------------------------------------------
Ecoli2_IN       aTklnasggitGsLsgnAStatklqtartingVkfdgsaniEafPPGvPLPWPSDTpPaG
Ecoli4_IN       td----------------------------gTVYslyTpseQFYPPGAPIPWPSDTVPSG
Escoli5_IN      -----------------------------------------QFYPPGAPIPWPSDTVPSG
Ecoli3_IN       RTlQMKAHYRNrGLfyRSSRDGy-GfeEdWAeVYTSknlPpEsYPvGAPIPWPSDTVPSG
Ecoli_10_I      td----------------------------gTVYslyTpseQFYPPGAPIPWPSDTVPSG
Ecoli11_IN      td----------------------------gTVYslyTpseQFYPPGAPIPWPSDTVPSG
Ecoli12_IN      td----------------------------gTVYslyTpseQFYPPGAPIPWPSDTVPSG
Ecoli8_IN       td----------------------------gTVYslyTpseQFYPPGAPIPWPSDTVPSG

GOS_142201      ------------------------------------------------------------
Lactobacil      -------------------tlkGkTyksalkaymsgnnhatkltivlnenpssKsAmdkv
Roseovariu      ------------------------------------------------------------
Xanthomona      gtagnvgvqgtqgivgmqfitdGyvlsdagagaLsAnaPttIIrvdpGvTgtidvAiGgt
Verminephr      ------------------------------------------------------------
Opitutus_t      av-----------------aqaaeTagtaAnaasdAAssaadaaDsaakaadaaagaatv
Shigella_s      YA-----------------LMQGQTFDKSAYPKLAAAYPSGVIPDMRGWTIKGKPASGRA
Ecoli9_IN       YA-----------------LMQGQTFDKSAYPKLAAAYPSGVIPDMRGWTIKGKPASdRA
Ecoli7_IN       YA-----------------LMQGQTFDKSAYPKLAAAYPSGVIPDlRGWTIKGKPASGRA
Ecoli13_IN      YA-----------------LMQGQaFDKSAYPKLAAAYPSGVIPDMRGWTIKGKPASGRA
Ecoli14_IN      YA-----------------LMQGQaFDKSAYPKLAAAYPSGVIPDMRGWTIKGKPASGRA
Ecoli6_IN       YA-----------------LMQGQTFDKSAYPKLAAAYPSGVIPDMRGWTIKGKPASGRA
Ecoli1_IN       ------------------------------------------------------------
Ecoli2_IN       YA-----------------iMQGQTFDKaAYPKLAiAYPSGVIPDMRGWTIKGKPASGRA
Ecoli4_IN       YA-----------------LMQGQTFDKSAYPKLAAAYPSGVIPDMRGWTIKGKPASGRA
Escoli5_IN      YA-----------------LMQGQTFDKSAYPKLAtAYPSGVIPDMRGWTIKGKPASGRA
Ecoli3_IN       YA-----------------LMQGQTFnKSAYPKLAAAYPSGVIPDMRGWTIKGKPASGRA
Ecoli_10_I      YA-----------------LMQGQTFDKSAYPKLAAAYPSGVIPDMRGWTIKGKPASGRA
Ecoli11_IN      YA-----------------LMQGQTFDKSAYPKLAAAYPSGVIPDMRGWTIKGKPASGRA
Ecoli12_IN      YA-----------------LMQGQTFDKSAYPKLAAAYPSGVIPDMRGWTIKGKPASGRA
Ecoli8_IN       YA-----------------LMQGQTFDKSAYPKLAAAYPSGVIPDMRGWTIKGKPASGRA

GOS_142201      ----------------------emmkggSflaG---------------------------
Lactobacil      taiQkQatnsl---------kgtaldgatiaiGgqTsetnDtqaiassdfvrTaaimlvG
Roseovariu      --------------------avgtmlgvtlvvGsglwliFDea-----------------
Xanthomona      galQklDtgtlvlsgSntyTggtalgggSlvLGnaqalgtgtlTtaagttldTnqaltvG
Verminephr      ------------------------------------------------------------
Opitutus_t      aaSatQsGaKaatqaSqvaSnTstsSAaSahqaTqTaqvaqtarnasragttvashtass
Shigella_s      VLSQEQDGIKS---------HTHSASASSTDLGTKTTSSFDYGTKSTNN-----------
Ecoli9_IN       VLSQEQDGIKS---------HTHSASvSSTDLGTKTTSSFDYGTKSTNN-----------
Ecoli7_IN       VLSQEQDGIKS---------HTHSASASSTDLGTKTTSSFDYGTKSTNN-----------
Ecoli13_IN      VLSQEQDGIKS---------HTHSASASSTDLGTKTTSSFDYGTKSTNN-----------
Ecoli14_IN      VLSQEQDGIKS---------HTHSASASSTDLGTKTTSSFDYGTKSTNN-----------
Ecoli6_IN       VLSQEQDGIKS---------HTHSASASSTDLGTKTTSSFDYGTKSTNN-----------
Ecoli1_IN       ------------------------------------------------------------
Ecoli2_IN       VLSQEQDGIKS---------HTHSASASSTDLGTKTTSSFDYGTKSTNN-----------
Ecoli4_IN       VLSQEQDGIKS---------HTHSASASSTDLGTKTTSSFDYGTKSTNN-----------
Escoli5_IN      VLSQEQDGIKS---------HTHSASASSTDLGTKTTSSFDYGTKSTNN-----------
Ecoli3_IN       VLSQEQDGIKS---------HTHSASASSTDLGTKTTSSFDYGTKSTNN-----------
Ecoli_10_I      VLSQEQDGIKS---------HTHSASASSTDLGTKnTSSFDYGTKSTNN-----------
Ecoli11_IN      VLSQEQDGIKS---------HTHSASASSTDLGTKnTSSFDYGTKSTNN-----------
Ecoli12_IN      VLSQEQDGIKS---------HTHSASASSTDLGTKnTSSFDYGTKSTNN-----------
Ecoli8_IN       VLSQEQDGIKS---------HTHSASASSTDLGTKnTSSFDYGTKSTNN-----------

GOS_142201      ------------------------------------------------------------
Lactobacil      ilIALmvitrsvl-----------------------------------------------
Roseovariu      ------------------------------------------------------------
Xanthomona      ntVvLdgAltlpgSndlaltgaisgagslikngtstltlsgnnsyaggtvlndgtlavag
Verminephr      ------------------------------------------------------------
Opitutus_t      aptAmasApatatTv---------------------------------------------
Shigella_s      ------------------------------------------------------------
Ecoli9_IN       ------------------------------------------------------------
Ecoli7_IN       ------------------------------------------------------------
Ecoli13_IN      ------------------------------------------------------------
Ecoli14_IN      ------------------------------------------------------------
Ecoli6_IN       ------------------------------------------------------------
Ecoli1_IN       ------------------------------------------------------------
Ecoli2_IN       ------------------------------------------------------------
Ecoli4_IN       ------------------------------------------------------------
Escoli5_IN      ------------------------------------------------------------
Ecoli3_IN       ------------------------------------------------------------
Ecoli_10_I      ------------------------------------------------------------
Ecoli11_IN      ------------------------------------------------------------
Ecoli12_IN      ------------------------------------------------------------
Ecoli8_IN       ------------------------------------------------------------

GOS_142201      ------------------------------------------------------------
Lactobacil      ------------------------------------------------------------
Roseovariu      ------------------------------------------------------------
Xanthomona      ntalgsgtlsvtgnsvlsnpvaaalsnavtlgaaltvdnpadltlagnisgsgaliktnt
Verminephr      ------------------------------------------------------------
Opitutus_t      ------------------------------------------------------------
Shigella_s      ------------------------------------------------------------
Ecoli9_IN       ------------------------------------------------------------
Ecoli7_IN       ------------------------------------------------------------
Ecoli13_IN      ------------------------------------------------------------
Ecoli14_IN      ------------------------------------------------------------
Ecoli6_IN       ------------------------------------------------------------
Ecoli1_IN       ------------------------------------------------------------
Ecoli2_IN       ------------------------------------------------------------
Ecoli4_IN       ------------------------------------------------------------
Escoli5_IN      ------------------------------------------------------------
Ecoli3_IN       ------------------------------------------------------------
Ecoli_10_I      ------------------------------------------------------------
Ecoli11_IN      ------------------------------------------------------------
Ecoli12_IN      ------------------------------------------------------------
Ecoli8_IN       ------------------------------------------------------------

GOS_142201      ------------------------------------------------------------
Lactobacil      ------------------------------------------------------------
Roseovariu      ------------------------------------------------------------
Xanthomona      gtltlggsntysggttltagtlvgdsaslqgaivdnatlafnqvadgvfngsitgtgtvv
Verminephr      ------------------------------------------------------------
Opitutus_t      ------------------------------------------------------------
Shigella_s      ------------------------------------------------------------
Ecoli9_IN       ------------------------------------------------------------
Ecoli7_IN       ------------------------------------------------------------
Ecoli13_IN      ------------------------------------------------------------
Ecoli14_IN      ------------------------------------------------------------
Ecoli6_IN       ------------------------------------------------------------
Ecoli1_IN       ------------------------------------------------------------
Ecoli2_IN       ------------------------------------------------------------
Ecoli4_IN       ------------------------------------------------------------
Escoli5_IN      ------------------------------------------------------------
Ecoli3_IN       ------------------------------------------------------------
Ecoli_10_I      ------------------------------------------------------------
Ecoli11_IN      ------------------------------------------------------------
Ecoli12_IN      ------------------------------------------------------------
Ecoli8_IN       ------------------------------------------------------------

GOS_142201      ------------------------------------------------------------
Lactobacil      ------------------------------------------------------------
Roseovariu      ------------------------------------------------------------
Xanthomona      kdgvgalllngangytggtsivagtligdtaslqgdiannaalvfnqttdgvyagtvsgt
Verminephr      ------------------------------------------------------------
Opitutus_t      ------------------------------------------------------------
Shigella_s      ------------------------------------------------------------
Ecoli9_IN       ------------------------------------------------------------
Ecoli7_IN       ------------------------------------------------------------
Ecoli13_IN      ------------------------------------------------------------
Ecoli14_IN      ------------------------------------------------------------
Ecoli6_IN       ------------------------------------------------------------
Ecoli1_IN       ------------------------------------------------------------
Ecoli2_IN       ------------------------------------------------------------
Ecoli4_IN       ------------------------------------------------------------
Escoli5_IN      ------------------------------------------------------------
Ecoli3_IN       ------------------------------------------------------------
Ecoli_10_I      ------------------------------------------------------------
Ecoli11_IN      ------------------------------------------------------------
Ecoli12_IN      ------------------------------------------------------------
Ecoli8_IN       ------------------------------------------------------------

GOS_142201      ------------------------------------------------------------
Lactobacil      ------------------------------------------------------------
Roseovariu      ------------------------------------------------------------
Xanthomona      gslekagagvlqllgpntytggtvisagslvgnttslqgdvvdnatlifdqatddtfagt
Verminephr      ------------------------------------------------------------
Opitutus_t      ------------------------------------------------------------
Shigella_s      ------------------------------------------------------------
Ecoli9_IN       ------------------------------------------------------------
Ecoli7_IN       ------------------------------------------------------------
Ecoli13_IN      ------------------------------------------------------------
Ecoli14_IN      ------------------------------------------------------------
Ecoli6_IN       ------------------------------------------------------------
Ecoli1_IN       ------------------------------------------------------------
Ecoli2_IN       ------------------------------------------------------------
Ecoli4_IN       ------------------------------------------------------------
Escoli5_IN      ------------------------------------------------------------
Ecoli3_IN       ------------------------------------------------------------
Ecoli_10_I      ------------------------------------------------------------
Ecoli11_IN      ------------------------------------------------------------
Ecoli12_IN      ------------------------------------------------------------
Ecoli8_IN       ------------------------------------------------------------

GOS_142201      ------------------------------------------------------------
Lactobacil      ------------------------------------------------------------
Roseovariu      ------------------------------------------------------------
Xanthomona      vsgtgsvikqgtgaltlvapntysggttvasgsligttaslqgdivnnavlvfdqagdgt
Verminephr      ------------------------------------------------------------
Opitutus_t      ------------------------------------------------------------
Shigella_s      ------------------------------------------------------------
Ecoli9_IN       ------------------------------------------------------------
Ecoli7_IN       ------------------------------------------------------------
Ecoli13_IN      ------------------------------------------------------------
Ecoli14_IN      ------------------------------------------------------------
Ecoli6_IN       ------------------------------------------------------------
Ecoli1_IN       ------------------------------------------------------------
Ecoli2_IN       ------------------------------------------------------------
Ecoli4_IN       ------------------------------------------------------------
Escoli5_IN      ------------------------------------------------------------
Ecoli3_IN       ------------------------------------------------------------
Ecoli_10_I      ------------------------------------------------------------
Ecoli11_IN      ------------------------------------------------------------
Ecoli12_IN      ------------------------------------------------------------
Ecoli8_IN       ------------------------------------------------------------

GOS_142201      ------------------------------------------------------------
Lactobacil      ------------------------------qpfyilgtlllayimslsitr----ilsnW
Roseovariu      ------------------------------------------------------------
Xanthomona      yagnvsgsgalikqgagaltlqggnsylggTtinagtLiGnTaSlqgditn----Nstlt
Verminephr      ------------------------------------------------------------
Opitutus_t      ------------------------------TplfTsptqGtaaltsapasagsqlaGsst
Shigella_s      ------------------------------TGAHTHSLSGSTsSAGaHqHs----qtgpr
Ecoli9_IN       ------------------------------TGAHTHSvSGtaaSAGaHTHs---------
Ecoli7_IN       ------------------------------TGAHTHSLSGSTnaAGnHsHrdgrrfnpsv
Ecoli13_IN      ------------------------------TGAHTHSvSGSTnSAGaHTHslANvNtasa
Ecoli14_IN      ------------------------------TGAHTHSvSGSTnSAGaHTHslANvNtasa
Ecoli6_IN       ------------------------------TGAHTHSLSGSTGSAGvHTHG----NGiRW
Ecoli1_IN       ------------------------------------------------------------
Ecoli2_IN       ------------------------------TGAHTHSLSGSTGSAGvHTHG----NGiRW
Ecoli4_IN       ------------------------------TGAHTHSLSGSTsSAGaHsHvdgrrfntst
Escoli5_IN      ------------------------------TGAHTHSvSGtaaSAGnHTHs---------
Ecoli3_IN       ------------------------------TGAHTHSLSGSTGSAGvHTHG----NGiRW
Ecoli_10_I      ------------------------------TGAHTHSLSGSTGSAGdHTHG----NGiRW
Ecoli11_IN      ------------------------------TGAHTHSLSGSTGSAGdHTHG----NGiRW
Ecoli12_IN      ------------------------------TGAHTHSLSGSTGSAGdHTHG----NGiRW
Ecoli8_IN       ------------------------------TGAHTHSLSGSTGSAGdHTHG----NGiRW

GOS_142201      ------------------------------------------------------------
Lactobacil      flGqsmltwntpfFGfimlIaLGvdYsiflmmkYrefdnsAatpStrivrasaviGAvvl
Roseovariu      ------------------------------------------------------------
Xanthomona      flqnsagtysgrlFGvgrfIkLG-ngTlVltAnsgsftGpTevnAgtlsVgnQanpgaal
Verminephr      ------------------------------------------------------------
Opitutus_t      fnGasSsatqqtatslspsadvaaatTFsapAtapaagvTAaapTstpaVaTptvtAtpp
Shigella_s      tnsGsqptgmFpa-----------GsTqVsgTnqvgisGsltsgTsqwvgKssSeGnHTH
Ecoli9_IN       --------------------------mtfvSggssgaPGsgSpdyskysVnTsSAGAHTH
Ecoli7_IN       fkdtyqygytss------------------gqntwgvqGsvgmsTgWla-nTstdGnHsH
Ecoli13_IN      nsGaGSAst---------------rlsvVhNqnY----------------aTsSAGAHTH
Ecoli14_IN      nsGaGSAst---------------rlsvVhNqnY----------------aTsSAGAHTH
Ecoli6_IN       pGGGGSALsFYD----------GGGFTYVqNSQYqvsPeTSSyrSYYqRIQTQSAGAHTH
Ecoli1_IN       ------------------------------------------------------------
Ecoli2_IN       pGGGGSALaFYD----------GGGFTYVqNSQYqvsPeTSSyrSYYqRIQTQSAGAHTH
Ecoli4_IN       fkdtyq------------------ygstavgSnsyqvqGavglgigtma-nTnSAGAHTH
Escoli5_IN      -------------------------vTgasavsqwsqnGsvhkvvsaasVnTsaAGAHTH
Ecoli3_IN       pGGGGSALaFYD----------GGGFTYVqNSQYqvsPGTSSyrSYYqRIQTQSAGAHTH
Ecoli_10_I      pGGGGSALaFYD----------GGGFTYVqDSQYqvsPGTSSrrSYYqRIQTQSAGAHTH
Ecoli11_IN      pGGGGSALaFYD----------GGGFTYVqDSQYqvsPGTSSrrSYYqRIQTQSAGAHTH
Ecoli12_IN      pGGGGSALaFYD----------GGGFTYVqDSQYqvsPGTSSrrSYYqRIQTQSAGAHTH
Ecoli8_IN       pGGGGSALaFYD----------GGGFTYVqDSQYqvsPGTSSrrSYYqRIQTQSAGAHTH

GOS_142201      ------------------------------------------------------------
Lactobacil      SaalilsgtfAalmpsGvltliqvamvviiGliIlVfAipti------------------
Roseovariu      ------------------------------------------------------------
Xanthomona      ggpvTvgpgGtltgsgsIGsllsSgtvvigGtsgTisvgGNAtfApgstlqvtpgaggVv
Verminephr      ------------------------------------------------------------
Opitutus_t      pavdTpAStnsftvatpvapsapapsatdtssppsftggslrplAtarpgdagelgsnLd
Shigella_s      SLSGTAASAGAHAHTVGIGAHTHSVAIGSHGHTITVNAAGNA------------------
Ecoli9_IN       SvSGTAASAGAHAHTVGIGAHTHSVAIGSHGHTITVNAAGNA------------------
Ecoli7_IN       SLSGTAASAGAHAHTVGIGAHTHSVAIGSHGHTITVNAAGNA------------------
Ecoli13_IN      SLSGTAASAGAHAHTVGIGAHTHSVAIGSHGHTITVNAAGNA------------------
Ecoli14_IN      SLSGTAASAGAHAHTVGIGAHTHSVAIGSHGHTITVNAAGNA------------------
Ecoli6_IN       SLSGTAASsGAHAHTVGIGAHTHSVAIGSHGHTITVNAAGNA------------------
Ecoli1_IN       ------------------------------------------------------------
Ecoli2_IN       SLSGTAASsGAHAHTVGIGAHTHSVAIGSHGHTITVNAAGNA------------------
Ecoli4_IN       SLSGTAASAGAHAHTVGIGAHTHSVAIGSHGHTITVNAAGNA------------------
Escoli5_IN      SvSGTvASAGAHAHTVGIGAHTHSVAIGSHGHTITVNAAGNA------------------
Ecoli3_IN       SLSGTAASsGAHAHTVGIGAHTHSVAIGSHGHTITVNAAGNA------------------
Ecoli_10_I      SLSGTAASsGAHAHTVGIGAHTHSVAIGSHGHTITVNAAGNA------------------
Ecoli11_IN      SLSGTAASsGAHAHTVGIGAHTHSVAIGSHGHTITVNAAGNA------------------
Ecoli12_IN      SLSGTAASsGAHAHTVGIGAHTHSVAIGSHGHTITVNAAGNA------------------
Ecoli8_IN       SLSGTAASsGAHAHTVGIGAHTHSVAIGSHGHTITVNAAGNA------------------

GOS_142201      ------------------------------------------------------------
Lactobacil      ------------------------------------------------------------
Roseovariu      ------------------------------------------------------------
Xanthomona      TpvTVggaatvatgttleLVratdlplfteipvisasggltgqftniisdyafidpnvst
Verminephr      ------------------------------------------------------------
Opitutus_t      ThnTVqqlvradyvrlkaILsq--------------------------------------
Shigella_s      ------------------------------------------------------------
Ecoli9_IN       ------------------------------------------------------------
Ecoli7_IN       ------------------------------------------------------------
Ecoli13_IN      ------------------------------------------------------------
Ecoli14_IN      ------------------------------------------------------------
Ecoli6_IN       ------------------------------------------------------------
Ecoli1_IN       ------------------------------------------------------------
Ecoli2_IN       ------------------------------------------------------------
Ecoli4_IN       ------------------------------------------------------------
Escoli5_IN      ------------------------------------------------------------
Ecoli3_IN       ------------------------------------------------------------
Ecoli_10_I      ------------------------------------------------------------
Ecoli11_IN      ------------------------------------------------------------
Ecoli12_IN      ------------------------------------------------------------
Ecoli8_IN       ------------------------------------------------------------

GOS_142201      --------------------atgnKd----------------------------------
Lactobacil      --------------------ipslirltypltdRmedettAeKQtKRnrq----------
Roseovariu      --------------------vtqerrssaNa-----------------------------
Xanthomona      tatvlsvtfgrntvgmvdavttpnqqstaaavdgLpNtspiyQavvRlpndpeavrtAfa
Verminephr      ------------------------------------------------------------
Opitutus_t      --------------------vkqniNVtatsvaRLqNaifvqnQwKReqlkrsdelrSve
Shigella_s      --------------------ENTVKNIAFNYIVRLA------------------------
Ecoli9_IN       --------------------ENTVKNIAFNYIVRLA------------------------
Ecoli7_IN       --------------------ENTVKNIAFNYIVRLA------------------------
Ecoli13_IN      --------------------ENTVKNIAFNYIVRLA------------------------
Ecoli14_IN      --------------------ENTVKNIAFNYIVRLA------------------------
Ecoli6_IN       --------------------ENTVKNIAFNYIVRLA------------------------
Ecoli1_IN       --------------------EtsVh-----------------------------------
Ecoli2_IN       --------------------ENTVKNIAFNYIVRLA------------------------
Ecoli4_IN       --------------------ENTVKNIAFNYIVRLA------------------------
Escoli5_IN      --------------------ENTVKNIAFNYIVRLA------------------------
Ecoli3_IN       --------------------ENTVKNIAFNYIVRLA------------------------
Ecoli_10_I      --------------------ENTVKNIAFNYIVRLA------------------------
Ecoli11_IN      --------------------ENTVKNIAFNYIVRLA------------------------
Ecoli12_IN      --------------------ENTVKNIAFNYIVRLA------------------------
Ecoli8_IN       --------------------ENTVKNIAFNYIVRLA------------------------

GOS_142201      ------------------------------------------------------------
Lactobacil      ------------------------------------------------------------
Roseovariu      ------------------------------------------------------------
Xanthomona      slsgeshAstAtAlLdSRflnsgignhlrgdSrdtlvgdtmvwItgrslpnrvdgnadav
Verminephr      ------------------------------------------------------------
Opitutus_t      pqpllrpAddAfAqVrAReailresiaaasmAppesaaaaqakVaadyeayaravtvaes
Shigella_s      ------------------------------------------------------------
Ecoli9_IN       ------------------------------------------------------------
Ecoli7_IN       ------------------------------------------------------------
Ecoli13_IN      ------------------------------------------------------------
Ecoli14_IN      ------------------------------------------------------------
Ecoli6_IN       ------------------------------------------------------------
Ecoli1_IN       ------------------------------------------------------------
Ecoli2_IN       ------------------------------------------------------------
Ecoli4_IN       ------------------------------------------------------------
Escoli5_IN      ------------------------------------------------------------
Ecoli3_IN       ------------------------------------------------------------
Ecoli_10_I      ------------------------------------------------------------
Ecoli11_IN      ------------------------------------------------------------
Ecoli12_IN      ------------------------------------------------------------
Ecoli8_IN       ------------------------------------------------------------

GOS_142201      ------------------------------------------------------------
Lactobacil      ------------------------------------------------------------
Roseovariu      ------------------------------------------------------------
Xanthomona      svRredngvMAgAerrfgessvvgialgnqdikswsrewgdradvdgthagvygrtdwsa
Verminephr      ------------------------------------------------------------
Opitutus_t      faRaasraaLAnAmp---------------------------------------------
Shigella_s      ------------------------------------------------------------
Ecoli9_IN       ------------------------------------------------------------
Ecoli7_IN       ------------------------------------------------------------
Ecoli13_IN      ------------------------------------------------------------
Ecoli14_IN      ------------------------------------------------------------
Ecoli6_IN       ------------------------------------------------------------
Ecoli1_IN       ------------------------------------------------------------
Ecoli2_IN       ------------------------------------------------------------
Ecoli4_IN       ------------------------------------------------------------
Escoli5_IN      ------------------------------------------------------------
Ecoli3_IN       ------------------------------------------------------------
Ecoli_10_I      ------------------------------------------------------------
Ecoli11_IN      ------------------------------------------------------------
Ecoli12_IN      ------------------------------------------------------------
Ecoli8_IN       ------------------------------------------------------------

GOS_142201      ------------------------------------------------------------
Lactobacil      ------------------------------------------------------------
Roseovariu      ------------------------------------------------------------
Xanthomona      fslqgvvdyasynidstrrvmlpgvlaerlsssydatavtvsleaawnlrtkgavyspfv
Verminephr      ------------------------------------------------------------
Opitutus_t      ------------------------------------------------------------
Shigella_s      ------------------------------------------------------------
Ecoli9_IN       ------------------------------------------------------------
Ecoli7_IN       ------------------------------------------------------------
Ecoli13_IN      ------------------------------------------------------------
Ecoli14_IN      ------------------------------------------------------------
Ecoli6_IN       ------------------------------------------------------------
Ecoli1_IN       ------------------------------------------------------------
Ecoli2_IN       ------------------------------------------------------------
Ecoli4_IN       ------------------------------------------------------------
Escoli5_IN      ------------------------------------------------------------
Ecoli3_IN       ------------------------------------------------------------
Ecoli_10_I      ------------------------------------------------------------
Ecoli11_IN      ------------------------------------------------------------
Ecoli12_IN      ------------------------------------------------------------
Ecoli8_IN       ------------------------------------------------------------

GOS_142201      ------------------------------------------------------------
Lactobacil      ------------------------------------------------------------
Roseovariu      ------------------------------------------------------------
Xanthomona      aadytrlktdgfterggisalsvdrasdtfstatlgvrghwdldqkaglyasagwqhafg
Verminephr      ------------------------------------------------------------
Opitutus_t      ------------------------------------------------------------
Shigella_s      ------------------------------------------------------------
Ecoli9_IN       ------------------------------------------------------------
Ecoli7_IN       ------------------------------------------------------------
Ecoli13_IN      ------------------------------------------------------------
Ecoli14_IN      ------------------------------------------------------------
Ecoli6_IN       ------------------------------------------------------------
Ecoli1_IN       ------------------------------------------------------------
Ecoli2_IN       ------------------------------------------------------------
Ecoli4_IN       ------------------------------------------------------------
Escoli5_IN      ------------------------------------------------------------
Ecoli3_IN       ------------------------------------------------------------
Ecoli_10_I      ------------------------------------------------------------
Ecoli11_IN      ------------------------------------------------------------
Ecoli12_IN      ------------------------------------------------------------
Ecoli8_IN       ------------------------------------------------------------

GOS_142201      ------------------------------------------------------------
Lactobacil      ------------------------------------------------------------
Roseovariu      ------------------------------------------------------------
Xanthomona      drsvqrsagfvgtasefsvqsvtlaknavvgelgvslitspssrfalslqgldgdgqtay
Verminephr      ------------------------------------------------------------
Opitutus_t      ------------------------------------------------------------
Shigella_s      ------------------------------------------------------------
Ecoli9_IN       ------------------------------------------------------------
Ecoli7_IN       ------------------------------------------------------------
Ecoli13_IN      ------------------------------------------------------------
Ecoli14_IN      ------------------------------------------------------------
Ecoli6_IN       ------------------------------------------------------------
Ecoli1_IN       ------------------------------------------------------------
Ecoli2_IN       ------------------------------------------------------------
Ecoli4_IN       ------------------------------------------------------------
Escoli5_IN      ------------------------------------------------------------
Ecoli3_IN       ------------------------------------------------------------
Ecoli_10_I      ------------------------------------------------------------
Ecoli11_IN      ------------------------------------------------------------
Ecoli12_IN      ------------------------------------------------------------
Ecoli8_IN       ------------------------------------------------------------

GOS_142201      ----------
Lactobacil      ----------
Roseovariu      ----------
Xanthomona      ggqvtwevsf
Verminephr      ----------
Opitutus_t      ----------
Shigella_s      ----------
Ecoli9_IN       ----------
Ecoli7_IN       ----------
Ecoli13_IN      ----------
Ecoli14_IN      ----------
Ecoli6_IN       ----------
Ecoli1_IN       ----------
Ecoli2_IN       ----------
Ecoli4_IN       ----------
Escoli5_IN      ----------
Ecoli3_IN       ----------
Ecoli_10_I      ----------
Ecoli11_IN      ----------
Ecoli12_IN      ----------
Ecoli8_IN

Protein Domains

PROTOCOL


InterPro, default parameters at EBI



RESULTS ANALYSIS


The protein has no domains.

RAW RESULTS

Phylogeny

PROTOCOL


a) Phylogeny.fr / BioNJ method / out group: Verminephrobacter eiseniae (b-proteobacteria); Xanthomonas oryzae (g-proteobacteria); Roseovarius sp.(a-proteobacteria); Lactobacillus plantarum (firmicutes); Opitutus terrae (verrucomicrobia);


b) Phylogeny.fr / PhyML method / no bootstrap / default substitution model / out group: Verminephrobacter eiseniae (b-proteobacteria); Xanthomonas oryzae (g-proteobacteria); Roseovarius sp.(a-proteobacteria); Lactobacillus plantarum (firmicutes); Opitutus terrae (verrucomicrobia);



RESULTS ANALYSIS


Could not perform the trees, since it failed to obtain alignments between sequences by Gblocks:


Guide tree (WARNING: tree given as an indication, NOT a significant phylogenetic tree)


Warning: Gblocks returned the following warnings:

GBlocks did not find any relevant alignment site and the workflow has been stopped.

This usually means the alignment is not reliable.If you believe it is, you can either try again your phylogeny workflow without GBlocks or with less stringent GBlocks parameters. If you believe the alignment found is bad, you can try to improve it either by yourself using alignment editors or with another alignment program.


Error: BioNJ input: provided data is not a valid alignment: supposed aligned sequences have different length.


RAW RESULTS

Taxonomy report

PROTOCOL


BLASTx versus NR, NCBI default parameters apart from "Number of descriptions_1000"


RESULTS ANALYSIS


In the lineage report we can conclude that the likelihood of the unknown sequence belong to the taxonomic groups presented are relatively low, since it does not show homologies with biological significance for these sequences. In the selection of the in-groups and the out-groups, the in-groups were chosen 15 of them belonging to E. coli, enterobacteria, becuse in spite of their high e-value, in all, were the 15th lowest. The out-groups were selected in five sequences that did not belong to the group of E. coli. In multiple alignments was chosen sequences belonging to the in-group and out-group chosen previously.



In group: Enterobacteria


emb|CBG34200.1| Ecoli1 0.006 Escherichia coli (enterobacteria)

ref|ZP_06648468.1| Ecoli2 0.024 Escherichia coli (enterobacteria)

ref|YP_002412113.1| Ecoli3 0.024 Escherichia coli (enterobacteria)

ref|YP_002402491.1| Ecoli4 0.032 Escherichia coli (enterobacteria)

ref|YP_003227698.1| Ecoli5 0.054 Escherichia coli (enterobacteria)

ref|YP_003220558.1| Ecoli6 0.12 Escherichia coli (enterobacteria)

ref|YP_002412549.1| Ecoli7 0.12 Escherichia coli (enterobacteria)

ref|YP_001462390.1| Ecoli8 0.21 Escherichia coli (enterobacteria)

ref|YP_002292730.1| Ecoli9 0.35 Escherichia coli (enterobacteria)

ref|YP_311286.1| Ssonnei 0.35 Shigella sonnei (enterobacteria)

ref|ZP_04535755.1| Ecoli10 0.46 Escherichia coli (enterobacteria)

ref|YP_001725078.1| Ecoli11 0.46 Escherichia coli (enterobacteria)

ref|YP_540528.1| Ecoli12 0.46 Escherichia coli (enterobacteria)

ref|YP_852428.1| Ecoli13 0.46 Escherichia coli (enterobacteria)

ref|YP_002391150.1| Ecoli14 0.46 Escherichia coli (enterobacteria)


Out group: other bacteria


ref|YP_997212.1| Veiseniae 0.46 Verminephrobacter eiseniae (b-proteobacteria)

ref|ZP_02243355.1| Xoryzae 0.79 Xanthomonas oryzae (g-proteobacteria)

ref|ZP_01445204.1| Roseovarius sp. 1.0 Roseovarius sp. (a-proteobacteria)

ref|NP_784110.1| Lplantarum 1.3 Lactobacillus plantarum (firmicutes)

ref|YP_001818573.1| Oterrae 8.7 Opitutus terrae (verrucomicrobia)


RAW RESULTS

Lineage Report

cellular organisms
. Bacteria            [bacteria]
. . Proteobacteria      [proteobacteria]
. . . Gammaproteobacteria [g-proteobacteria]
. . . . Enterobacteriaceae  [enterobacteria]
. . . . . Escherichia         [enterobacteria]
. . . . . . Escherichia coli    [enterobacteria]
. . . . . . . Escherichia coli 042 --------------------------   45 2 hits [enterobacteria]    phage side tail fiber protein [Escherichia coli 042]
. . . . . . . Escherichia coli FVEC1412 .....................   43 2 hits [enterobacteria]    conserved hypothetical protein [Escherichia coli FVEC1412] 
. . . . . . . Escherichia coli UMN026 .......................   43 4 hits [enterobacteria]    Side tail fiber protein from bacteriophage origin [Escheric
. . . . . . . Escherichia coli 55989 ........................   43 4 hits [enterobacteria]    putative tail fiber protein [Escherichia coli 55989] >gi|21
. . . . . . . Escherichia coli O26:H11 str. 11368 ...........   42 2 hits [enterobacteria]    putative side tail fiber protein [Escherichia coli O26:H11 
. . . . . . . Escherichia coli O103:H2 str. 12009 ...........   41 2 hits [enterobacteria]    putative side tail fiber protein [Escherichia coli O103:H2 
. . . . . . . Escherichia coli E24377A ......................   40 2 hits [enterobacteria]    phage tail domain-containing protein [Escherichia coli E243
. . . . . . . Escherichia coli SE11 .........................   40 4 hits [enterobacteria]    putative phage tail fiber protein [Escherichia coli SE11] >
. . . . . . . Escherichia coli ATCC 8739 ....................   39 2 hits [enterobacteria]    prophage tail fibre domain-containing protein [Escherichia 
. . . . . . . Escherichia coli UTI89 ........................   39 2 hits [enterobacteria]    putative tail fiber protein [Escherichia coli UTI89] >gi|11
. . . . . . . Escherichia coli APEC O1 ......................   39 4 hits [enterobacteria]    putative tail fiber protein [Escherichia coli UTI89] >gi|11
. . . . . . . Escherichia coli S88 ..........................   39 4 hits [enterobacteria]    putative tail fiber protein [Escherichia coli UTI89] >gi|11
. . . . . . . Escherichia coli str. K-12 substr. W3110 ......   38 2 hits [enterobacteria]    predicted tail fiber protein [Escherichia coli str. K-12 su
. . . . . . . Escherichia coli str. K-12 substr. MG1655 .....   38 2 hits [enterobacteria]    predicted tail fiber protein [Escherichia coli str. K-12 su
. . . . . . . Escherichia coli str. K-12 substr. DH10B ......   38 2 hits [enterobacteria]    predicted tail fiber protein [Escherichia coli str. K-12 su
. . . . . . . Escherichia coli BW2952 .......................   38 2 hits [enterobacteria]    predicted tail fiber protein [Escherichia coli str. K-12 su
. . . . . . . Escherichia coli K-12 .........................   38 1 hit  [enterobacteria]    predicted tail fiber protein [Escherichia coli str. K-12 su
. . . . . . . Escherichia coli DH1 ..........................   38 1 hit  [enterobacteria]    predicted tail fiber protein [Escherichia coli str. K-12 su
. . . . . . . Escherichia coli O55:H7 str. CB9615 ...........   38 4 hits [enterobacteria]    PPE-repeat protein [Escherichia coli O55:H7 str. CB9615] >g
. . . . . . . Escherichia coli B088 .........................   36 2 hits [enterobacteria]    predicted protein [Escherichia coli B088] >gi|291325183|gb|
. . . . . . . Escherichia coli O157:H7 str. FRIK966 .........   36 1 hit  [enterobacteria]    putative tail fiber protein [Escherichia coli O157:H7 str. 
. . . . . . . Escherichia coli O157:H7 str. EC4024 ..........   36 1 hit  [enterobacteria]    putative tail fiber protein [Escherichia coli O157:H7 str. 
. . . . . . . Escherichia coli O157:H7 str. TW14359 .........   36 2 hits [enterobacteria]    putative tail fiber protein [Escherichia coli O157:H7 str. 
. . . . . . . Escherichia coli O157:H7 str. FRIK2000 ........   36 1 hit  [enterobacteria]    putative tail fiber protein [Escherichia coli O157:H7 str. 
. . . . . . . Escherichia coli O157:H7 EDL933 ...............   36 2 hits [enterobacteria]    putative membrane protein of prophage CP-933X [Escherichia 
. . . . . . . Escherichia coli O157:H7 str. Sakai ...........   36 2 hits [enterobacteria]    putative tail fiber protein [Escherichia coli O157:H7 str. 
. . . . . . . Escherichia coli SMS-3-5 ......................   35 2 hits [enterobacteria]    putative prophage side tail fiber protein [Escherichia coli
. . . . . . Escherichia sp. 3_2_53FAA -----------------------   39 2 hits [enterobacteria]    conserved hypothetical protein [Escherichia sp. 3_2_53FAA] 
. . . . . Shigella sonnei Ss046 -----------------------------   40 2 hits [enterobacteria]    phage protein-related [Shigella sonnei Ss046] >gi|73856344|
. . . . Xanthomonas oryzae pv. oryzicola BLS256 -------------   38 1 hit  [g-proteobacteria]  YapH protein [Xanthomonas oryzae pv. oryzicola BLS256]
. . . . Stenotrophomonas maltophilia K279a ..................   37 2 hits [g-proteobacteria]  putative histone H1-like protein [Stenotrophomonas maltophi
. . . . Stenotrophomonas maltophilia R551-3 .................   36 2 hits [g-proteobacteria]  hypothetical protein Smal_0268 [Stenotrophomonas maltophili
. . . . Mannheimia succiniciproducens MBEL55E ...............   36 2 hits [g-proteobacteria]  hypothetical protein MS0748 [Mannheimia succiniciproducens 
. . . Verminephrobacter eiseniae EF01-2 ---------------------   39 2 hits [b-proteobacteria]  hypothetical protein Veis_2448 [Verminephrobacter eiseniae 
. . . Burkholderia oklahomensis C6786 .......................   38 1 hit  [b-proteobacteria]  outer membrane protein, putative [Burkholderia oklahomensis
. . . Roseovarius sp. HTCC2601 ..............................   38 2 hits [a-proteobacteria]  hypothetical protein R2601_22557 [Roseovarius sp. HTCC2601]
. . . Curvibacter putative symbiont of Hydra magnipapillata .   38 1 hit  [b-proteobacteria]  hypothetical protein [Curvibacter putative symbiont of Hydr
. . Lactobacillus plantarum WCFS1 ---------------------------   38 4 hits [firmicutes]        transport protein [Lactobacillus plantarum WCFS1] >gi|28270
. . Veillonella parvula DSM 2008 ............................   37 2 hits [firmicutes]        Hemagluttinin domain protein [Veillonella parvula DSM 2008]
. . Bryantella formatexigens DSM 14469 ......................   37 2 hits [firmicutes]        conserved hypothetical protein [Bryantella formatexigens DS
. . Anaerococcus tetradius ATCC 35098 .......................   36 2 hits [firmicutes]        conserved hypothetical protein [Anaerococcus tetradius ATCC
. . Streptomyces hygroscopicus ATCC 53653 ...................   36 1 hit  [high GC Gram+]     putative glycine betaine ABC transporter substrate-binding 
. . Lactobacillus plantarum JDM1 ............................   36 2 hits [firmicutes]        penicillin binding protein 2A [Lactobacillus plantarum JDM1
. . Lactobacillus plantarum subsp. plantarum ATCC 14917 .....   36 2 hits [firmicutes]        penicillin binding protein 2A [Lactobacillus plantarum WCFS
. . Desulfotomaculum reducens MI-1 ..........................   36 2 hits [firmicutes]        TRAP transporter solute receptor TAXI family protein [Desul
. . Lactobacillus johnsonii ATCC 33200 ......................   35 2 hits [firmicutes]        conserved hypothetical protein [Lactobacillus johnsonii ATC
. . Opitutus terrae PB90-1 ..................................   35 2 hits [verrucomicrobia]   cell wall surface anchor family protein [Opitutus terrae PB
. . Rhodopirellula baltica SH 1 .............................   35 2 hits [planctomycetes]    signal peptide [Rhodopirellula baltica SH 1] >gi|32446419|e
. Drosophila virilis ----------------------------------------   36 2 hits [flies]             GJ16973 [Drosophila virilis] >gi|194146991|gb|EDW62710.1| G
. Ostreococcus lucimarinus CCE9901 ..........................   36 2 hits [green algae]       predicted protein [Ostreococcus lucimarinus CCE9901] >gi|14
. Candida albicans WO-1 .....................................   36 1 hit  [ascomycetes]       conserved hypothetical protein [Candida albicans WO-1]
. Enterocytozoon bieneusi H348 ..............................   36 8 hits [microsporidians]   phox homology domain protein [Enterocytozoon bieneusi H348]
. Mercurialis annua (herb mercury) ..........................   36 1 hit  [eudicots]          female sex protein [Mercurialis annua]
. Candida albicans SC5314 ...................................   36 4 hits [ascomycetes]       hypothetical protein CaO19.2676 [Candida albicans SC5314] >
. Tetraodon nigroviridis ....................................   35 1 hit  [bony fishes]       unnamed protein product [Tetraodon nigroviridis]

BLAST

PROTOCOL


BLASTx versus NR, NCBI default parameters apart from "Number of descriptions_1000"


RESULTS ANALYSIS


Was performed a BLASTx since with the BLASTp the results were bad, because the e-values were too high. Through the BLASTx was verified that the e-values were too high and that in terms of homologies they also weren’t good since the values of the identities were below 30%.


However, we can say that there are homologous proteins that are known but no biological significance.

RAW RESULTS


Sequences producing significant alignments:                       (Bits)  Value

emb|CBG34200.1|  phage side tail fiber protein [Escherichia co...  45.8    0.006
ref|ZP_06648468.1|  conserved hypothetical protein [Escherichi...  43.9    0.024
ref|YP_002412113.1|  Side tail fiber protein from bacteriophag...  43.9    0.024
ref|YP_002402491.1|  putative tail fiber protein [Escherichia ...  43.5    0.032
ref|YP_003227698.1|  putative side tail fiber protein [Escheri...  42.7    0.054
ref|YP_003220558.1|  putative side tail fiber protein [Escheri...  41.6    0.12 
ref|YP_002412549.1|  putative tail fiber protein [Escherichia ...  41.6    0.12 
ref|YP_001462390.1|  phage tail domain-containing protein [Esc...  40.8    0.21 
ref|YP_002292730.1|  putative phage tail fiber protein [Escher...  40.0    0.35 
ref|YP_311286.1|  phage protein-related [Shigella sonnei Ss046...  40.0    0.35 
ref|ZP_04535755.1|  conserved hypothetical protein [Escherichi...  39.7    0.46 
ref|YP_001725078.1|  prophage tail fibre domain-containing pro...  39.7    0.46 
ref|YP_997212.1|  hypothetical protein Veis_2448 [Verminephrob...  39.7    0.46 
ref|YP_540528.1|  putative tail fiber protein [Escherichia col...  39.7    0.46 
ref|ZP_02366038.1|  outer membrane protein, putative [Burkhold...  38.9    0.78 
ref|ZP_02243355.1|  YapH protein [Xanthomonas oryzae pv. oryzi...  38.9    0.78 
ref|AP_001998.1|  predicted tail fiber protein [Escherichia co...  38.9    0.78 
emb|CBG34958.1|  putative prophage tail fiber protein [Escheri...  38.5    1.0  
ref|ZP_01445204.1|  hypothetical protein R2601_22557 [Roseovar...  38.5    1.0  
ref|YP_003502625.1|  PPE-repeat protein [Escherichia coli O55:...  38.1    1.3  
ref|YP_003498320.1|  PPE-repeat protein [Escherichia coli O55:...  38.1    1.3  
emb|CBA28118.1|  hypothetical protein [Curvibacter putative sy...  38.1    1.3  
ref|YP_002401908.1|  putative tail fiber protein [Escherichia ...  38.1    1.3  
ref|NP_784110.1|  transport protein [Lactobacillus plantarum W...  38.1    1.3  
ref|YP_003311015.1|  Hemagluttinin domain protein [Veillonella...  37.4    2.3  
ref|ZP_05344342.1|  conserved hypothetical protein [Bryantella...  37.4    2.3  
ref|YP_001970298.1|  putative histone H1-like protein [Stenotr...  37.4    2.3  
ref|ZP_06661090.1|  predicted protein [Escherichia coli B088] ...  37.0    3.0  
ref|ZP_03929740.1|  conserved hypothetical protein [Anaerococc...  37.0    3.0  
ref|XP_002057224.1|  GJ16973 [Drosophila virilis] >gb|EDW62710...  37.0    3.0  
ref|ZP_05519135.1|  putative glycine betaine ABC transporter s...  36.6    3.9  
ref|XP_001421290.1|  predicted protein [Ostreococcus lucimarin...  36.6    3.9  
ref|YP_002026656.1|  hypothetical protein Smal_0268 [Stenotrop...  36.6    3.9  
ref|YP_860050.1|  hypothetical protein APECO1_2012 [Escherichi...  36.6    3.9  
ref|ZP_05950537.1|  putative tail fiber protein [Escherichia c...  36.2    5.1  
ref|YP_003062766.1|  penicillin binding protein 2A [Lactobacil...  36.2    5.1  
gb|EEQ45172.1|  conserved hypothetical protein [Candida albica...  36.2    5.1  
ref|XP_002651889.1|  phox homology domain protein [Enterocytoz...  36.2    5.1  
ref|ZP_03084192.1|  putative tail fiber protein [Escherichia c...  36.2    5.1  
gb|AAB38497.1|  female sex protein [Mercurialis annua]             36.2    5.1  
ref|YP_087940.1|  hypothetical protein MS0748 [Mannheimia succ...  36.2    5.1  
ref|NP_287395.1|  putative membrane protein of prophage CP-933...  36.2    5.1  
ref|NP_309677.1|  putative tail fiber protein [Escherichia col...  36.2    5.1  
ref|NP_785034.1|  penicillin binding protein 2A [Lactobacillus...  36.2    5.1  
ref|YP_001114540.1|  TRAP transporter solute receptor TAXI fam...  36.2    5.1  
ref|XP_720429.1|  hypothetical protein CaO19.2676 [Candida alb...  36.2    5.1  
ref|XP_002650577.1|  transcription initiation factor IIA, gamm...  35.8    6.6  
ref|XP_002650974.1|  superoxide dismutase [Enterocytozoon bien...  35.8    6.6  
ref|XP_002651976.1|  phox homology domain protein [Enterocytoz...  35.8    6.6  
ref|YP_002293321.1|  putative phage tail fiber protein [Escher...  35.8    6.6  
ref|YP_001743266.1|  putative prophage side tail fiber protein...  35.8    6.6  
emb|CAG07033.1|  unnamed protein product [Tetraodon nigroviridis]  35.8    6.6  
ref|ZP_04007962.1|  conserved hypothetical protein [Lactobacil...  35.4    8.7  
ref|YP_001818573.1|  cell wall surface anchor family protein [...  35.4    8.7  
ref|NP_868870.1|  signal peptide [Rhodopirellula baltica SH 1]...  35.4    8.7  


ALIGNMENTS
>emb|CBG34200.1| phage side tail fiber protein [Escherichia coli 042]
Length=1004

 Score = 45.8 bits (107),  Expect = 0.006
 Identities = 55/198 (27%), Positives = 75/198 (37%), Gaps = 9/198 (4%)
 Frame = +1

Query  106  AATTMNPQAMGKIEKALLKFGADTEQTRAAMLAYAKSLLAGDSEQEATTRSTAALNSHMQ  285
            AA+T   QA    + A    G  + +TR A    AKS  A +S + A   S +A  +   
Sbjct  154  AASTSAGQAASSAQSASSSAGTASTKTREA----AKSAAAAESSKSAAATSASAAKTSET  209

Query  286  KAQKGKKVAGPQGPNVgsgagsgaapgTRMKALGEAGLEKKFGAGSKGSEAALKVFRHEL  465
             A   +K A        + A   A       A  EA    +  A S  S AA        
Sbjct  210  NAAASQKSAATSASTATTKASEAATSARGAAASKEAAKSSETNASSSASSAASSA----T  265

Query  466  AAGANVTKAYASALNAGKNNVVTFGQKVANTAGSFGTLAKNVTAASFPTFNRGLKATAKQ  645
            AAG +   A  S  NA K++    GQ  +  AGS    A + +AAS         ATA  
Sbjct  266  AAGNSAKAAKTSETNA-KSSETAAGQSASAAAGSKTAAALSASAASTSAGQASASATAAG  324

Query  646  FPTLTRAIQSAGKGIKAG  699
                + A  ++    KAG
Sbjct  325  KSAESAASSASTATTKAG  342


>ref|ZP_06648468.1| conserved hypothetical protein [Escherichia coli FVEC1412]
 gb|EFF02085.1| conserved hypothetical protein [Escherichia coli FVEC1412]
Length=551

 Score = 43.9 bits (102),  Expect = 0.024
 Identities = 54/198 (27%), Positives = 73/198 (36%), Gaps = 9/198 (4%)
 Frame = +1

Query  106  AATTMNPQAMGKIEKALLKFGADTEQTRAAMLAYAKSLLAGDSEQEATTRSTAALNSHMQ  285
            AA+T   QA    + A    G  + + R A    AKS  A +S + A   S +A  +   
Sbjct  154  AASTSAGQAASSAQSASSSAGTASTKAREA----AKSAAAAESSKSAAATSASAAKTSET  209

Query  286  KAQKGKKVAGPQGPNVgsgagsgaapgTRMKALGEAGLEKKFGAGSKGSEAALKVFRHEL  465
             A   +K A          A   A       A  EA    +  A S  S AA        
Sbjct  210  NAAASQKSAATSASTATMKASEAATSARGAAASKEAAKSSETNASSSASSAASSA----T  265

Query  466  AAGANVTKAYASALNAGKNNVVTFGQKVANTAGSFGTLAKNVTAASFPTFNRGLKATAKQ  645
            AAG +   A  S  NA K++    GQ  +  AGS    A + +AAS         ATA  
Sbjct  266  AAGNSAKAAKTSETNA-KSSETAAGQSASAAAGSKTAAALSASAASTSAGQASASATAAG  324

Query  646  FPTLTRAIQSAGKGIKAG  699
                + A  ++    KAG
Sbjct  325  KSAESAASSASTATTKAG  342


>ref|YP_002412113.1| Side tail fiber protein from bacteriophage origin [Escherichia 
coli UMN026]
 emb|CAR12573.1| Side tail fiber protein from bacteriophage origin [Escherichia 
coli UMN026]
Length=1008

 Score = 43.9 bits (102),  Expect = 0.024
 Identities = 54/198 (27%), Positives = 73/198 (36%), Gaps = 9/198 (4%)
 Frame = +1

Query  106  AATTMNPQAMGKIEKALLKFGADTEQTRAAMLAYAKSLLAGDSEQEATTRSTAALNSHMQ  285
            AA+T   QA    + A    G  + + R A    AKS  A +S + A   S +A  +   
Sbjct  154  AASTSAGQAASSAQSASSSAGTASTKAREA----AKSAAAAESSKSAAATSASAAKTSET  209

Query  286  KAQKGKKVAGPQGPNVgsgagsgaapgTRMKALGEAGLEKKFGAGSKGSEAALKVFRHEL  465
             A   +K A          A   A       A  EA    +  A S  S AA        
Sbjct  210  NAAASQKSAATSASTATMKASEAATSARGAAASKEAAKSSETNASSSASSAASSA----T  265

Query  466  AAGANVTKAYASALNAGKNNVVTFGQKVANTAGSFGTLAKNVTAASFPTFNRGLKATAKQ  645
            AAG +   A  S  NA K++    GQ  +  AGS    A + +AAS         ATA  
Sbjct  266  AAGNSAKAAKTSETNA-KSSETAAGQSASAAAGSKTAAALSASAASTSAGQASASATAAG  324

Query  646  FPTLTRAIQSAGKGIKAG  699
                + A  ++    KAG
Sbjct  325  KSAESAASSASTATTKAG  342


>ref|YP_002402491.1| putative tail fiber protein [Escherichia coli 55989]
 emb|CAU97268.1| putative tail fiber protein [Escherichia coli 55989]
Length=1056

 Score = 43.5 bits (101),  Expect = 0.032
 Identities = 53/198 (26%), Positives = 74/198 (37%), Gaps = 9/198 (4%)
 Frame = +1

Query  106  AATTMNPQAMGKIEKALLKFGADTEQTRAAMLAYAKSLLAGDSEQEATTRSTAALNSHMQ  285
            AA+T   QA    + A    G  + + R A    AKS  A +S + A   S +A  +   
Sbjct  154  AASTSAGQAASSAQSASSSAGTASTKAREA----AKSAAAAESSKSAAATSASAAKTSET  209

Query  286  KAQKGKKVAGPQGPNVgsgagsgaapgTRMKALGEAGLEKKFGAGSKGSEAALKVFRHEL  465
             A   +K A        + A   A       A  EA    +  A S  S AA        
Sbjct  210  NAAASQKSAATSASTATTKASEAATSARGAAASKEAAKSSETNASSSASSAASSA----T  265

Query  466  AAGANVTKAYASALNAGKNNVVTFGQKVANTAGSFGTLAKNVTAASFPTFNRGLKATAKQ  645
            AAG +   A  S  NA +++    GQ  +  AGS    A + +AAS         ATA  
Sbjct  266  AAGNSAKAAKTSETNA-RSSETAAGQSASAAAGSKTAAASSASAASTSAGQASASATAAG  324

Query  646  FPTLTRAIQSAGKGIKAG  699
                + A  ++    KAG
Sbjct  325  KSAESAASSASTATTKAG  342


>ref|YP_003227698.1| putative side tail fiber protein [Escherichia coli O26:H11 str. 
11368]
 dbj|BAI23958.1| putative side tail fiber protein [Escherichia coli O26:H11 str. 
11368]
Length=986

 Score = 42.7 bits (99),  Expect = 0.054
 Identities = 55/199 (27%), Positives = 75/199 (37%), Gaps = 11/199 (5%)
 Frame = +1

Query  106  AATTMNPQAMGKIEKALLKFG-ADTEQTRAAMLAYAKSLLAGDSEQEATTRSTAALNSHM  282
            AA+T   QA    + A    G A T+ T A+     KS  A +S + A   S AA  +  
Sbjct  154  AASTSAGQAASSAQSASSSAGTASTKATEAS-----KSAAAAESSKSAAATSAAAAKTSE  208

Query  283  QKAQKGKKVAGPQGPNVgsgagsgaapgTRMKALGEAGLEKKFGAGSKGSEAALKVFRHE  462
              A   ++ A        + A   A       A  EA    +  A S  S AA       
Sbjct  209  TNAAASQQSAATSASTATTKASEAATSARDASASKEAAKSSETNASSSASSAASSA----  264

Query  463  LAAGANVTKAYASALNAGKNNVVTFGQKVANTAGSFGTLAKNVTAASFPTFNRGLKATAK  642
             AAG +   A  S  NA K++    GQ  +  AGS    A + +AAS         ATA 
Sbjct  265  TAAGNSAKAAKTSETNA-KSSETAAGQSASAAAGSKTAAASSASAASTSAGQASASATAA  323

Query  643  QFPTLTRAIQSAGKGIKAG  699
                 + A  ++    KAG
Sbjct  324  GKSAESAASSASTATTKAG  342


>ref|YP_003220558.1| putative side tail fiber protein [Escherichia coli O103:H2 str. 
12009]
 dbj|BAI29424.1| putative side tail fiber protein [Escherichia coli O103:H2 str. 
12009]
Length=971

 Score = 41.6 bits (96),  Expect = 0.12
 Identities = 52/199 (26%), Positives = 73/199 (36%), Gaps = 11/199 (5%)
 Frame = +1

Query  106  AATTMNPQAMGKIEKALLKFG-ADTEQTRAAMLAYAKSLLAGDSEQEATTRSTAALNSHM  282
            AA+T   QA    + A    G A T+ T A      KS  A +S + A   S AA  +  
Sbjct  154  AASTSAGQAASSAQSASSSAGTASTKATEAE-----KSAAAAESSKSAAATSAAAAKTSE  208

Query  283  QKAQKGKKVAGPQGPNVgsgagsgaapgTRMKALGEAGLEKKFGAGSKGSEAALKVFRHE  462
              A   ++ A        + A   A       A  EA    +  A S  S AA       
Sbjct  209  TNAAASQQSAATSASTATTKASEAATSARDASASKEAAKSSETNASSSASSAASSA----  264

Query  463  LAAGANVTKAYASALNAGKNNVVTFGQKVANTAGSFGTLAKNVTAASFPTFNRGLKATAK  642
              A AN  KA  ++    +++    GQ  +  AGS    A + +AAS         ATA 
Sbjct  265  -TAAANSAKAAKTSETNARSSETAAGQSASAAAGSKTAAASSASAASTSAGQASASATAA  323

Query  643  QFPTLTRAIQSAGKGIKAG  699
                 + A  ++    KAG
Sbjct  324  GKSAESAASSASTATTKAG  342


>ref|YP_002412549.1| putative tail fiber protein [Escherichia coli UMN026]
 emb|CAR13015.1| putative tail fiber protein [Escherichia coli UMN026]
Length=973

 Score = 41.6 bits (96),  Expect = 0.12
 Identities = 52/199 (26%), Positives = 73/199 (36%), Gaps = 11/199 (5%)
 Frame = +1

Query  106  AATTMNPQAMGKIEKALLKFG-ADTEQTRAAMLAYAKSLLAGDSEQEATTRSTAALNSHM  282
            AA+T   QA    + A    G A T+ T A      KS  A +S + A   S AA  +  
Sbjct  156  AASTSAGQAASSAQSASSSAGTASTKATEAE-----KSAAAAESSKSAAATSAAAAKTSE  210

Query  283  QKAQKGKKVAGPQGPNVgsgagsgaapgTRMKALGEAGLEKKFGAGSKGSEAALKVFRHE  462
              A   ++ A        + A   A       A  EA    +  A S  S AA       
Sbjct  211  TNAAASQQSAATSASTATTKASEAATSARDASASKEAAKSSETNASSSASSAASSA----  266

Query  463  LAAGANVTKAYASALNAGKNNVVTFGQKVANTAGSFGTLAKNVTAASFPTFNRGLKATAK  642
              A AN  KA  ++    +++    GQ  +  AGS    A + +AAS         ATA 
Sbjct  267  -TAAANSAKAAKTSETNARSSETAAGQSASAAAGSKTAAASSASAASTSAGQASASATAA  325

Query  643  QFPTLTRAIQSAGKGIKAG  699
                 + A  ++    KAG
Sbjct  326  GKSAESAASSASTATTKAG  344


>ref|YP_001462390.1| phage tail domain-containing protein [Escherichia coli E24377A]
 gb|ABV21111.1| putative phage tail domain protein [Escherichia coli E24377A]
Length=1137

 Score = 40.8 bits (94),  Expect = 0.21
 Identities = 54/199 (27%), Positives = 75/199 (37%), Gaps = 11/199 (5%)
 Frame = +1

Query  106  AATTMNPQAMGKIEKALLKFG-ADTEQTRAAMLAYAKSLLAGDSEQEATTRSTAALNSHM  282
            AA+T   QA    + A    G A T+ T A+     KS  A +S + A   S +A  +  
Sbjct  154  AASTSAGQAASSAQSASSSAGTASTKATEAS-----KSAAAAESSKSAAATSASAAKTSE  208

Query  283  QKAQKGKKVAGPQGPNVgsgagsgaapgTRMKALGEAGLEKKFGAGSKGSEAALKVFRHE  462
              A   +K A        + A   A       A  EA    +  A S  S AA       
Sbjct  209  TNAAASQKSAATSASAATTKASEAATSARDAAASKEAAKSSETNASSSASSAASSA----  264

Query  463  LAAGANVTKAYASALNAGKNNVVTFGQKVANTAGSFGTLAKNVTAASFPTFNRGLKATAK  642
             AAG +   A  S  NA +++    GQ  +  AGS    A + +AAS         ATA 
Sbjct  265  TAAGNSAKAAKTSETNA-RSSETAAGQSASAAAGSKTAAASSASAASTSAGQASASATAA  323

Query  643  QFPTLTRAIQSAGKGIKAG  699
                 + A  ++    KAG
Sbjct  324  GKSAESAASSASTATTKAG  342


>ref|YP_002292730.1| putative phage tail fiber protein [Escherichia coli SE11]
 dbj|BAG76979.1| putative phage tail fiber protein [Escherichia coli SE11]
Length=1132

 Score = 40.0 bits (92),  Expect = 0.35
 Identities = 51/198 (25%), Positives = 73/198 (36%), Gaps = 9/198 (4%)
 Frame = +1

Query  106  AATTMNPQAMGKIEKALLKFGADTEQTRAAMLAYAKSLLAGDSEQEATTRSTAALNSHMQ  285
            AA+T   QA    + A    G  + +   A    +KS  A +S + A   S AA  +   
Sbjct  154  AASTSAGQAASSAQSASSSAGTASAKATEA----SKSAAAAESSKSAAATSAAAAKTSET  209

Query  286  KAQKGKKVAGPQGPNVgsgagsgaapgTRMKALGEAGLEKKFGAGSKGSEAALKVFRHEL  465
             A   ++ A        + A   A       A  EA    +  A S  S AA        
Sbjct  210  NAAASQQSAATSASTATTKASEAATSARDASASKEAAKSSETNASSSASSAASSA----T  265

Query  466  AAGANVTKAYASALNAGKNNVVTFGQKVANTAGSFGTLAKNVTAASFPTFNRGLKATAKQ  645
            AAG +   A  S  NA +++    GQ  +  AGS    A + +AAS         ATA  
Sbjct  266  AAGNSAKAAKTSETNA-RSSETAAGQSASAAAGSKTAAASSASAASTSAGQASASATAAG  324

Query  646  FPTLTRAIQSAGKGIKAG  699
                + A  ++    KAG
Sbjct  325  KSAESAASSASTATTKAG  342


>ref|YP_311286.1| phage protein-related [Shigella sonnei Ss046]
 gb|AAZ89051.1| phage protein-related [Shigella sonnei Ss046]
Length=1029

 Score = 40.0 bits (92),  Expect = 0.35
 Identities = 51/199 (25%), Positives = 73/199 (36%), Gaps = 11/199 (5%)
 Frame = +1

Query  106  AATTMNPQAMGKIEKALLKFG-ADTEQTRAAMLAYAKSLLAGDSEQEATTRSTAALNSHM  282
            AA+T   QA    + A    G A T+ T A+     KS  A +S + A   S  A  +  
Sbjct  156  AASTSAGQAASSAQSASSSAGTASTKATEAS-----KSAAAAESSKSAAATSAGAAKTLE  210

Query  283  QKAQKGKKVAGPQGPNVgsgagsgaapgTRMKALGEAGLEKKFGAGSKGSEAALKVFRHE  462
              A   ++ A        + A   A       A  EA    +  A S  S AA       
Sbjct  211  TNAAASQQSAATSASTATTKASEAATSARDASASKEAAKSSETNASSSASSAASSA----  266

Query  463  LAAGANVTKAYASALNAGKNNVVTFGQKVANTAGSFGTLAKNVTAASFPTFNRGLKATAK  642
              A AN  KA  ++    +++    GQ  +  AGS    A + +AAS         ATA 
Sbjct  267  -TAAANSAKAAKTSETNARSSETAAGQSASAAAGSKTAAASSASAASTSAGQASASATAA  325

Query  643  QFPTLTRAIQSAGKGIKAG  699
                 + A  ++    KAG
Sbjct  326  GKSAESAASSASTATTKAG  344


>ref|ZP_04535755.1| conserved hypothetical protein [Escherichia sp. 3_2_53FAA]
 gb|EEH86290.1| conserved hypothetical protein [Escherichia sp. 3_2_53FAA]
Length=944

 Score = 39.7 bits (91),  Expect = 0.46
 Identities = 49/198 (24%), Positives = 72/198 (36%), Gaps = 9/198 (4%)
 Frame = +1

Query  106  AATTMNPQAMGKIEKALLKFGADTEQTRAAMLAYAKSLLAGDSEQEATTRSTAALNSHMQ  285
            AA+T   QA    + A    G  + + R A    AKS  A +S + A   S +A  +   
Sbjct  63   AASTSAGQAASSAQSASSSAGTASTKAREA----AKSAAAAESSKSAAATSASAAKTSET  118

Query  286  KAQKGKKVAGPQGPNVgsgagsgaapgTRMKALGEAGLEKKFGAGSKGSEAALKVFRHEL  465
             A   ++ A        + A   A       A  EA    +  A S  S AA        
Sbjct  119  NAAASQQSAATSASTATTKASEAATSARDASASKEAAKSSETNAASSASSAASSA-----  173

Query  466  AAGANVTKAYASALNAGKNNVVTFGQKVANTAGSFGTLAKNVTAASFPTFNRGLKATAKQ  645
             A AN  KA  ++    +++    GQ  +  A S    A + +AAS         ATA  
Sbjct  174  TAAANSAKAAKTSETNARSSETAAGQSASAAADSKTAAALSASAASTSAGQASASATAAG  233

Query  646  FPTLTRAIQSAGKGIKAG  699
                + A  ++    KAG
Sbjct  234  KSAESAASSASTATTKAG  251


>ref|YP_001725078.1| prophage tail fibre domain-containing protein [Escherichia coli 
ATCC 8739]
 gb|ACA77751.1| Prophage tail fibre domain protein [Escherichia coli ATCC 8739]
Length=1007

 Score = 39.7 bits (91),  Expect = 0.46
 Identities = 54/199 (27%), Positives = 74/199 (37%), Gaps = 11/199 (5%)
 Frame = +1

Query  106  AATTMNPQAMGKIEKALLKFG-ADTEQTRAAMLAYAKSLLAGDSEQEATTRSTAALNSHM  282
            AA+T   QA    + A    G A T+ T A+     KS  A +S + A   S  A  +  
Sbjct  154  AASTSAGQAATSAQSASSSAGTASTKATEAS-----KSAAAAESSKSAAATSAGAAKTSE  208

Query  283  QKAQKGKKVAGPQGPNVgsgagsgaapgTRMKALGEAGLEKKFGAGSKGSEAALKVFRHE  462
              A   +K A        + A   A       A  EA    +  A S  S AA       
Sbjct  209  TNAAASQKSAATSASAATTKASEAATSARDAAASKEAAKSSETNASSSASSAASSA----  264

Query  463  LAAGANVTKAYASALNAGKNNVVTFGQKVANTAGSFGTLAKNVTAASFPTFNRGLKATAK  642
             AAG +   A  S  NA +++    GQ  +  AGS    A + +AAS         ATA 
Sbjct  265  TAAGNSAKAAKTSETNA-RSSETAAGQSASAAAGSKTAAASSASAASTSAGQASASATAA  323

Query  643  QFPTLTRAIQSAGKGIKAG  699
                 + A  ++    KAG
Sbjct  324  GKSAESAASSASTATTKAG  342