GOS 1318050

From Metagenes
Warning: this metagenomic sequence has been carefully annotated by students during bioinformatics assignments. These quality annotations are therefore the result of a teaching exercise that you are most welcome to amend and extend if necessary!


Sequence
CAMERA AccNum : JCVI_READ_1093025574894
Annotathon code: GOS_1318050
Sample :
  • GPS :8°24'54s; 124°14'23w
  • Tropical South Pacific: 600 miles from F. Polynesia - International
  • Open Ocean (-2m, 27.6°C, 0.1-0.8 microns)
Authors
Team : Algarve
Username : MSVMFV
Annotated on : 2010-10-16 00:42:29
  • MarianaFilipaVal a36955@ualg.pt
  • MiguelSousaViegas a29855@ualg.pt
  • a36786@ualg.pt TatianaMartins

Synopsis

Genomic Sequence

>JCVI_READ_1093025574894 GOS_1318050 Genomic DNA
ACTTTAGATTTTGATGTAGATGACTTTACAATTACTTTAGGTGGCGACCTTTCAGGTAGTGCAACTGTAACAAACTTAGGCGATGCCACTCTAACAGCCA
CAATAACTGCCAATAGTGTAGCTTTGGGTACAGACACAACAGGAAACTTTGTTGCTGACCTTACCGCAGGGGAAGGCATAGATGTAAGCGGTGGTGGATC
TGAAAACGCAACCATAACTGTATCGGCAGAGGACGCTACAAGCAGTAACAAAGGTATAGCTAGTTTTGATAGCACAGATTTCACAGTATCAAGTGGAGCT
GTCACAGTAAATGCAGAAAGAGTTCAAGATATCGTAGGGGCAATGGTTGGCTCTAACACAGAGTCAGGCATAACTGTTACCTACGAAGATTCAGATGGCA
CACTAGATTTCAATGTAGCTGACCCTGTTATAACATTAAGTGGAGATGTAGCAGGTTCGGCTACTATGACCAATCTTGGAGATGTCACAATCTCTACCAC
CATACAAGCTAATTCAATTGCATTAGGCACAGACACCACTGGAAACTATGTATCAGCTATCTCTGCAGGAGAGGGAATAGATGTTTCAGGAAGTGGGAGT
GAAACAGCTACAGTTACCATAAGTGCCGAAGATGCCACTGATTCTAACAAAGGTATTGCCTCATTTGACGCAACTGACTTTACTGTTAGTTCTGGTGATG
TAACTGTCAATGCAGAAAGAATCCAAGACATTGTGGGAGCAATGTTTTCTTCTAATACCGAAAGCGGTATATCTGTTACTTACGAAGATAGTGATGGCAC
GATTGACTTAGATGTAAGTGACCCTACGCTTTCTTTACAGGCGATGTCACAGGTTCAGGAACAATAACCAACTTAGGCAATACTTCT

Translation

[1 - 864/887]   direct strand
>GOS_1318050 Translation [1-864   direct strand]
TLDFDVDDFTITLGGDLSGSATVTNLGDATLTATITANSVALGTDTTGNFVADLTAGEGIDVSGGGSENATITVSAEDATSSNKGIASFDSTDFTVSSGA
VTVNAERVQDIVGAMVGSNTESGITVTYEDSDGTLDFNVADPVITLSGDVAGSATMTNLGDVTISTTIQANSIALGTDTTGNYVSAISAGEGIDVSGSGS
ETATVTISAEDATDSNKGIASFDATDFTVSSGDVTVNAERIQDIVGAMFSSNTESGISVTYEDSDGTIDLDVSDPTLSLQAMSQVQEQ

[ Warning ] 5' incomplete: does not start with a Methionine

Annotator commentaries

Based on the results of this analysis it can be concluded that there are credible homologous sequences, considering the E-values (being 4e-144 the highest value) and scores of BLASTp vs ENV_NR.

The status choosen was coding, because this sequence showed presence of ORF's, and all of them had more than 60 a.a, and has homologues.

The biological process,and molecular function is not available, once the sequence showed no protein domains.

This DNA fragment probably belongs to a marine metagenome, since no other organisms were displayed during taxonomy report during the search in BLASTp vs ENV_NR, which gave the best E-values, score and %identity.




ORF finding

PROTOCOL


a) SMS ORFinder / forward strand / frames 1, 2 & 3 / min 60 AA / 'any codon' initiation / 'standard' genetic code

b) SMS ORFinder / reverse strand / frames 1, 2 & 3 / min 60 AA / 'any codon' initiation / 'standard' genetic code



RESULTS ANALYSIS


In the two chains were found four ORFS's.

The largest ORF found in the forward strand, was in frame 1, no other ORFs was found in this strand. In the reverse strand the biggest ORF found was the second in frame 1. All the ORFs found in the reverse strand have more than 60 a.a. and after checking the E-values and scores in the respectives BLASTp and BLASTx , it is clear that none has significance.

The sequence in study has no start codon because the sequence starts with a threonine and it is not a start codon, but it has a stop codon which is formed by the TAA nucleotides, so it can be concluded that the ORF is not complete.

Since the ORF has a stop codon and it shows homologs it can be concluded that the status is coding.

RAW RESULTS

a)forward strand

>ORF number 1 in reading frame 1 on the direct strand extends from base 1 to base 867.
ACTTTAGATTTTGATGTAGATGACTTTACAATTACTTTAGGTGGCGACCTTTCAGGTAGT
GCAACTGTAACAAACTTAGGCGATGCCACTCTAACAGCCACAATAACTGCCAATAGTGTA
GCTTTGGGTACAGACACAACAGGAAACTTTGTTGCTGACCTTACCGCAGGGGAAGGCATA
GATGTAAGCGGTGGTGGATCTGAAAACGCAACCATAACTGTATCGGCAGAGGACGCTACA
AGCAGTAACAAAGGTATAGCTAGTTTTGATAGCACAGATTTCACAGTATCAAGTGGAGCT
GTCACAGTAAATGCAGAAAGAGTTCAAGATATCGTAGGGGCAATGGTTGGCTCTAACACA
GAGTCAGGCATAACTGTTACCTACGAAGATTCAGATGGCACACTAGATTTCAATGTAGCT
GACCCTGTTATAACATTAAGTGGAGATGTAGCAGGTTCGGCTACTATGACCAATCTTGGA
GATGTCACAATCTCTACCACCATACAAGCTAATTCAATTGCATTAGGCACAGACACCACT
GGAAACTATGTATCAGCTATCTCTGCAGGAGAGGGAATAGATGTTTCAGGAAGTGGGAGT
GAAACAGCTACAGTTACCATAAGTGCCGAAGATGCCACTGATTCTAACAAAGGTATTGCC
TCATTTGACGCAACTGACTTTACTGTTAGTTCTGGTGATGTAACTGTCAATGCAGAAAGA
ATCCAAGACATTGTGGGAGCAATGTTTTCTTCTAATACCGAAAGCGGTATATCTGTTACT
TACGAAGATAGTGATGGCACGATTGACTTAGATGTAAGTGACCCTACGCTTTCTTTACAG
GCGATGTCACAGGTTCAGGAACAATAA

>Translation of ORF number 1 in reading frame 1 on the direct strand.
TLDFDVDDFTITLGGDLSGSATVTNLGDATLTATITANSVALGTDTTGNFVADLTAGEGI
DVSGGGSENATITVSAEDATSSNKGIASFDSTDFTVSSGAVTVNAERVQDIVGAMVGSNT
ESGITVTYEDSDGTLDFNVADPVITLSGDVAGSATMTNLGDVTISTTIQANSIALGTDTT
GNYVSAISAGEGIDVSGSGSETATVTISAEDATDSNKGIASFDATDFTVSSGDVTVNAER
IQDIVGAMFSSNTESGISVTYEDSDGTIDLDVSDPTLSLQAMSQVQEQ*

No ORFs were found in reading frame 2.

No ORFs were found in reading frame 3.

---------------------------------------------------------------------
b)reverse strand

>ORF number 1 in reading frame 1 on the reverse strand extends from base 109 to base 342.
GTAACAGATATACCGCTTTCGGTATTAGAAGAAAACATTGCTCCCACAATGTCTTGGATT
CTTTCTGCATTGACAGTTACATCACCAGAACTAACAGTAAAGTCAGTTGCGTCAAATGAG
GCAATACCTTTGTTAGAATCAGTGGCATCTTCGGCACTTATGGTAACTGTAGCTGTTTCA
CTCCCACTTCCTGAAACATCTATTCCCTCTCCTGCAGAGATAGCTGATACATAG

>Translation of ORF number 1 in reading frame 1 on the reverse strand.
VTDIPLSVLEENIAPTMSWILSALTVTSPELTVKSVASNEAIPLLESVASSALMVTVAVS
LPLPETSIPSPAEIADT*

>ORF number 2 in reading frame 1 on the reverse strand extends from base 565 to base 885.
ACTCTTTCTGCATTTACTGTGACAGCTCCACTTGATACTGTGAAATCTGTGCTATCAAAA
CTAGCTATACCTTTGTTACTGCTTGTAGCGTCCTCTGCCGATACAGTTATGGTTGCGTTT
TCAGATCCACCACCGCTTACATCTATGCCTTCCCCTGCGGTAAGGTCAGCAACAAAGTTT
CCTGTTGTGTCTGTACCCAAAGCTACACTATTGGCAGTTATTGTGGCTGTTAGAGTGGCA
TCGCCTAAGTTTGTTACAGTTGCACTACCTGAAAGGTCGCCACCTAAAGTAATTGTAAAG
TCATCTACATCAAAATCTAAA

>Translation of ORF number 2 in reading frame 1 on the reverse strand.
TLSAFTVTAPLDTVKSVLSKLAIPLLLLVASSADTVMVAFSDPPPLTSMPSPAVRSATKF
PVVSVPKATLLAVIVAVRVASPKFVTVALPERSPPKVIVKSSTSKSK

No ORFs were found in reading frame 2.

>ORF number 1 in reading frame 3 on the reverse strand extends from base 600 to base 797.
TACTGTGAAATCTGTGCTATCAAAACTAGCTATACCTTTGTTACTGCTTGTAGCGTCCTC
TGCCGATACAGTTATGGTTGCGTTTTCAGATCCACCACCGCTTACATCTATGCCTTCCCC
TGCGGTAAGGTCAGCAACAAAGTTTCCTGTTGTGTCTGTACCCAAAGCTACACTATTGGC
AGTTATTGTGGCTGTTAG

>Translation of ORF number 1 in reading frame 3 on the reverse strand.
YCEICAIKTSYTFVTACSVLCRYSYGCVFRSTTAYIYAFPCGKVSNKVSCCVCTQSYTIG
SYCGC*

Multiple Alignement

PROTOCOL


a) ClustalW2 / default parameters EBI, output order _ input



RESULTS ANALYSIS


The MSA proves that the sequence has no amino end, so this means that there is not present a start codon. The carboxylic end it is present and the STOP condon is constituted by TAA nucleotides.

None of the sequences had the same size, although an effort was made to choose those with similar sizes between them. In attempt to make a better alignement, some sequences that were largest than GOS_1318050 were deleted, respectively Microcoleus chthonoplastes PCC 7420, Lyngbya sp. PCC 8106, Synechococcus sp. RS9917, Synechococcus sp. RS 9916, Synechocystis sp. 6803, Synechococcus sp. WH 7805 and Chlorobium limicola DSM 245.

The E values of BLASTx vs NR aren't very reliable because they aren't so close to 0, ie greater than 1e-4. Bearing in mind those E values, this is not sufficiently credible, ie, the multiple sequence alignment there is no position of the ORF that is conserved.


RAW RESULTS

CLUSTAL 2.0.12 multiple sequence alignment


GOS_1318050.36_of      ------------------------------------------------------------
P.marinus              ---MANATATQLQQLYVAYFGRAADPTGLDYWVGQGTTTKSFAASMYAQDEFESVNGSLS 57
A.variabilis           ------------------------------------------------------------
M.aeruginosa           ------------------------------------------------------------
N.punctiforme          -----------------------------------------------MPPLISIVIVNYN 13
T.erythraeum           ----------------------------------------------------------MS 2
Nostoc                 ------------------------------------------------------------
K.koreensis            ------------------------------------------------------------
F.varium               ------------------------------------------------------------
C.atlanticus           MFVAGLSFGQIVTNGDDAGPGSLRDAVAQANANAGADVITFNGNFTVNLTSGEILVTDDV 60
O.terrae               ------------------------------------------------------------
                                                                                   

GOS_1318050.36_of      ------------------------------------------------------------
P.marinus              VELQVNQIYQNLFGRDGDTAGLTYWANQIRTGSLELASIANDLIYAVNNGSSATDLTALT 117
A.variabilis           ------------------------------------------------------------
M.aeruginosa           ------------------------------------------------------------
N.punctiforme          RESYLGVAIASVLAQTWQDFELLIWDDGSTDGSVAIANAYAQQDGRVRVVEAQHQGVSAA 73
T.erythraeum           KVATKNITFWNTTGYNYPGFGKTVEVPDGLGGTISITYNQIDYNDRNWKAKPDGSVPVTE 62
Nostoc                 ---------MVIRGTNNDDNLIGTTGNDIIEGLGGNDRFEGGRGNDTLTGGTGNDVFNLE 51
K.koreensis            ---MPKRSFSKLYLPVLFGLGLIPAIGLTAAPTVTATLDDQQKTDDDSDGNLDRGDTLTY 57
F.varium               ------------------------------------------------------------
C.atlanticus           TITGNGTGNTIIDGSSNSGRIFNFQMDGGIIAGASSTLDAITIQNGSITGTADGGGAIFV 120
O.terrae               ------------------------------------------------------------
                                                                                   

GOS_1318050.36_of      ------------------------------------------------------------
P.marinus              NKTNAATSYTADIRESSSAFLAYQPKSSSPWVTGTNFETAKTFFKTVTATNAPSAAEVQA 177
A.variabilis           ------------------------------------------------------------
M.aeruginosa           ------------------------------------------------------------
N.punctiforme          CKAAIAQTSGIYIGIVDSDDILAPTALAQTATVLNRHPETGFVYTDYLNIDEKGIVIGYG 133
T.erythraeum           NRFLSGEVG------NNFSLTGQVQGITATMPLIRTLGAENNYSVTLDFSKYKASAGGSK 116
Nostoc                 QQQDNDVVTDFVRGQDKIDLRNLNINDWATLQLLISNDGQDNALITTFFNGSQSQIKLLN 111
K.koreensis            EAQLDNAAGGDEAQSVLFEAALDANTTLVPGSVKVTPVAVADSFQSYGGITLSVSVANGL 117
F.varium               ---------------------------------------------------------MGG 3
C.atlanticus           SANDIVAPSSLTIVNCDFNNNSTESDTSGDGGGGAIYASDVSLDITGTEFNNNIATAASG 180
O.terrae               ------------------------------------------------------------
                                                                                   

GOS_1318050.36_of      ------------------------------------------------------------
P.marinus              SVDVIASNSSNVATSDTAFTLTTGVDEPTTTYTKFDASLTSGGTQTLGSLDKISGTTGSS 237
A.variabilis           MDP------------------ISGTNGNDNLSGTSGDDIIQG----FNGNDTLSGLGGND 38
M.aeruginosa           ------------------------------------------------------------
N.punctiforme          HRCNIAYSQDNLLVNFMTFHFRLMRRSVYDRVGGFNESFSGCAYDYDLCLRLSEVTQVRR 193
T.erythraeum           ATDGATSGDTYLAISRFFNGSKTGYTTIKVTARTLDGSELNLGDWKLFNSGTLGQGNNAR 176
Nostoc                 INPNLLQASDFIFNTVNLNQTIDGTNFADQLFGGLGNDTLRG----FNGNDVLFGEQGDD 167
K.koreensis            LANDFDPKDANLPTNAGMTVVAETVATTEGGSATLFADGSFNYTAPANFTGTDTFSYTAN 177
F.varium               KIVMISNFSEVEKSLKRCLKEKVSITAATVVGFLIAGTVAFGGEPTNVTFTTKSEGKVTV 63
C.atlanticus           SGGAIYYESQSVLTTFSLVNSDLTGNIAGRAGGAIETNTPNVLTITNVTFDGNQATGTPG 240
O.terrae               --------------------------MAKRIKKNSKSARSLLKLEALEQRQLLAGGFTDA 34
                                                                                   

GOS_1318050.36_of      ------------------------------------------------------------
P.marinus              DVLNATIKSSVTPASITGIETINVVADAAATLGLVNASGFNTVNAGGAAGALTISGLPTT 297
A.variabilis           RLEGGRGDDTLTGGAGNDVFNFEQLQDNDVVTDFVRGQDKIDVRN--------------- 83
M.aeruginosa           ------------------------------------------------------------
N.punctiforme          VQEPLYLYRLHSQSMSVTRRTEQILWSQKAIAQALWRRGLAEKLQID------------- 240
T.erythraeum           IFLNAATQILSGKNSQGKPEFCPTPPNPNSSLGKGTGLGLFELASN-------------- 222
Nostoc                 RFEGGRGNDTFYGGAGNDVFNLEQQQDNDVVTDFVRGQDKIDLRN--------------- 212
K.koreensis            DDDAMTDSGVVSISVDGQIWFVDAAAAAGGDGSQALPFNDVTSLNGADG----------- 226
F.varium               QVGSGGVNDVNGATNFISVGQYETLLGEGEGKNEELAKLLTVETDE-------------- 109
C.atlanticus           NGGALHNTGISDTNVTGSTFINNIAGSEGGALWNQAGGTMTVNASTVTGNEAQGNDSSNG 300
O.terrae               QGQQWQDIVHSNGNVYDQVLLKSSSITVDADPGQVTRVSFLDQQGD-------------- 80
                                                                                   

GOS_1318050.36_of      ------------------------------------------------------------
P.marinus              TTVTITDSAVDHTLSFKEVSGESDSATISFAQVTGGATTEINIAGIETITATSAGGSASN 357
A.variabilis           ----LNINDWATLQLLISNDGQDNALITTFSSGSQSRIKLLNINPNLLQASDFIFNTVNL 139
M.aeruginosa           ------------------------------------------------------------
N.punctiforme          ----------VELPAGRFILRRKKPLLHKIAASVLAILPLVTVTSMGLSQAQQIVPAADG 290
T.erythraeum           ---QRYVSITLSFKQKKITGSLPNDLHDIYVASNTSSTSPIDVTPPPMDINRGLQVHLPL 279
Nostoc                 ----LNINDWATLQLLISNDGQDNALITTFFNGSQSQIKLLNINPNLLQASDFIFNTVNL 268
K.koreensis            ----VGDVDESGDIIYFAAGTYSDGLELEADQKVIGSGVALTINGDDYVNAGVAPDFGGL 282
F.varium               --------------------NGVTTFASAVNEGGDTPLGTVKATATTDGALTLIKTEKDY 149
C.atlanticus           GGGLFNNGGTLIVSGATLVNNNSATGTSGSGGGILSTDGDVTVTDPNTTVNSNIANRAGG 360
O.terrae               ----------------IVQAEFSGAGTLTIALDPDTYHGPAAAVNYNQPDVMYVQGLASF 124
                                                                                   

GOS_1318050.36_of      ------------------------------------------------------------
P.marinus              SFELEAAALTTLTLDGSGD------LTLSSLNTGGTTSLKSIDASAMTGALDVTTGALSL 411
A.variabilis           NQTIEGTNFADQLFGGLGN------DTLRGFNGNDVLFGEQGDDRFEG------------ 181
M.aeruginosa           ------------------------------------------------------------
N.punctiforme          TNTIVTPNGNRLDITGGITS-----------GDGANLFHSFQQFGISPEQIANFQASPAL 339
T.erythraeum           NEIVKNAESKKEVVDISGA------QVNGKVNGAKVVADSKFG----------------- 316
Nostoc                 NQTIDGTNFADQLFGGLGN------DTLRGFFGNDVLFGEQGDDRFEG------------ 310
K.koreensis            FELASNNTIEGFNFNPGTG-------------------YSISGNAASGGVITNSSASLSG 323
F.varium               KSIISELTIEATGFDADNT----------------------------------------- 168
C.atlanticus           GIEIIDGSLTLQDVSLNSNNAGVAPNASASPGNGGGLHVSGIATIIINGGNIQGNIAANE 420
O.terrae               TITGSDASTNFSVFSVGKG----------------------------------------- 143
                                                                                   

GOS_1318050.36_of      ------------------------------------------------------------
P.marinus              GTSALTISGGSGNDSIDSSAHTGTDVSISGGAGNDTVTHVLSIADTIAGGAGTDNLITTA 471
A.variabilis           GRGDDTFYGGAGNDVFNLEQLQDNDVVIDFVRGQDKIDVRN---LNINDWATLQLLISND 238
M.aeruginosa           ----------------------------------------------MTLQEGTYIWEYGD 14
N.punctiforme          QNILGRITGGNASVINGLIQVTGGNANLFLMNPAGFIFGSNATLNVPGAFTVTTANGIGF 399
T.erythraeum           HCLSFDGVDDYLELPTATIPQTGAITISFWANGDNSLPKDNSIIAAYDQSNNRVINIHLP 376
Nostoc                 GRGNDTFYGGAGNDVFNLEQLQDNDVVTDFVRGQDKIDLRN---LNINDWATLQLLISND 367
K.koreensis            SAGIVNLQNHSGSFNWDANVSGGTSVSAIAIDGGNANYTVSGDIGLTGGRAIDVQNVTGG 383
F.varium               --ATAMEATNSGDKVTNAGEITVNEHAVGMVAGAGATAVNNAKTDKTAGITVNADAGEAN 226
C.atlanticus           GGGLWNQLGSTMTVGNTPTPNFSDNIASGDAPTTGGGAIFNNGGDLIVLAGNNIVNNLAD 480
O.terrae               NASNQALFDDTHTGGDHWANVARLTIVASPSNPNGSLFGGIRAGDAIFSAENGVVGITAS 203
                                                                                   

GOS_1318050.36_of      ------------------------------------------------------------
P.marinus              VIASAGNVSDFETFTLDTSAGNLDQDFDHLSGNTFTKLVADGDGTTTDNPTFTDAPSTIT 531
A.variabilis           GQDNALITTFSSGSQSQIKLLNINPNLLQASDFIFNTVNLN---------------QTID 283
M.aeruginosa           TTQALWFTASYNTVTNQWTVDMKKGSMDLNAFWWS------------------------N 50
N.punctiforme          GSSWFNAIGVNDYAALVGNPNGFAFSMNQPGAIANAGNLAVGVGQNLNLLGGTVVNTGQL 459
T.erythraeum           WNNSYIYFDCGNTGNSYDRIEKLAKAADFKGKWTHWVFTKNVTTGEMKIYLNGALWSSGK 436
Nostoc                 GQNNALITTFFNGDQSQIKLLNINPNLLQASDFIFNTVNLN---------------QTID 412
K.koreensis            SITATGAMTINSGSALNLINNSGNPGFEFADITVTNGSGTAINLVDNGAATYQFNGDVNL 443
F.varium               TAIGMLADGDKATATNNGKIDAQKGIGMLAKNGATIENSAG--------------ATIEA 272
C.atlanticus           GVSGSGGAILSNGGTVTITETTFTNNVSTRAGGAIEHTGGTLNLTGVNFDSNNVGVTSPL 540
O.terrae               NVQVQSVVRIGDIDAKGTATPALVFGEQSQ-----------------------FVSVDIA 240
                                                                                   

GOS_1318050.36_of      ------------------------------------------------------------
P.marinus              DLELTADASGTVVVDRKTDTSA-DAITITAKGDTTTAVTVNEEETITFASDTAASVITTL 590
A.variabilis           GTNFADQLFGGLGNDTLRGFNG-NDVLFGEQGDDRFEGGRGDDTFYGGAGNDVFNLEQLQ 342
M.aeruginosa           GDSNADGNIVLSKADNSLNMNG-TGIVWDGYDKISDTGLTGTEHNGSSLLTAGNTYTYSY 109
N.punctiforme          SAPGGQISITSVPGQNWVRLSQPGNLLSLEIQPLASSSTQPNNWTIPIASIPELLTVGNT 519
T.erythraeum           DKSKQISGMTLVKLGSGYGFYH-GQVAHLRIYDRVLSAEEINECMKVDVTPPPMDINRGL 495
Nostoc                 GTNSPDQLFGGLGNDTLRGFFG-NDVLFGEQGDDRFEGGRGNDTFYGGAGNDVFNLEQLQ 471
K.koreensis            GTTNGAGMVANSGLISLADAASFNNLFNTNGGPALDLTNVNIGTMSLDSIASSNSTSEGL 503
F.varium               GNEDTSAGTGMLVTENGTATNN-GTINVNYTGSIGMSTTTVADSTTTLTNNGTIDVKAGT 331
C.atlanticus           NSNPGNGGALHVGGDATTNITG-GSVINNQAANEGGGLWNGSGIMTIIDVVIDGNSAHSA 599
O.terrae               GGDLVQTNAKAINNSGSYGFSL-SAIDNVDSSGAPVMASTITNTSVQFSDKSPFPAPKSY 299
                                                                                   

GOS_1318050.36_of      ------------------------------------------------------------
P.marinus              NAADATKLSVTG-SADATISTISGNDNLATIDASASTGAVSIGDTANGSVVPMTITAGSG 649
A.variabilis           DNDVVIDFVRGQDKIDVRNLNINDWTTLQLLISNDGQDNALITTFFNGSQSRIKLLNINP 402
M.aeruginosa           SKDQGVEIEALLAGGVTTLGVRATSVNGTDGIKAVDGQYVFVPYDTTPPTVTVNIVDASL 169
N.punctiforme          GLTANPDGTVKLTGSNVTIPTTPGTTIVSGKVDVSSQTGGTVTALGKKVALVDANINASG 579
T.erythraeum           QVHLPLNEIVKNAKSKKEVVDISGAQLNGKVNGAKVVADSKFGHCLSFDGVDDYLELPTA 555
Nostoc                 DNDVVTDFVQGQDKIDIRNLNINDWATLQLLISNDGQDNALITTFFNGDQSQIKLLNINP 531
K.koreensis            NLTNITGSLELGNITVNDAAADAVLLSGGTLALSNVTGSVVVDNPGLSAVKIEGATLGNI 563
F.varium               GISVAGAGGAKVTFGAAGKIEVVDARGNIGININGTAKTSNTTITGGTIELKGAGTGINI 391
C.atlanticus           AAATSGGGGIYNEGGTVTTDATTQIINNIATVGPSGSGGGILNAGGTFTATGTTITNNTA 659
O.terrae               DLTTAIDTIVGSTADDVINGSHTATSLVVSALDKIDGAAGNDVLNISDSNGGTAQLSG-L 358
                                                                                   

GOS_1318050.36_of      ------------------------------------------------------------
P.marinus              GYTGSGGTKADTFTGGAGDDSIDGDAGDDSLVGAAGADDLDGGAGNDVISGGAGNDTITA 709
A.variabilis           NLLQASDFIFNTVN---LNQTIEGTNFADQLFGGLGNDTLRGFNGNDVLFGEQGDDRFEG 459
M.aeruginosa           NDGDNNSLVTFQFS-----ETVSGFTVGDVSVSGGTLSNFTQVDGNSYQATFTADDAVET 224
N.punctiforme          TNGGGTVLIGGDYQGKGTVPNADRTYVDSKSVIN-ADSHLNGNGGQVIVWGNDTTQYFGK 638
T.erythraeum           TIPQTGAITISFWANGDNSLPKDNSIIAAYDQSNNRVINIHLPWNNSYIYFDCGNTGNSY 615
Nostoc                 NLLQASDFIFNTVN---LNQTIDGTNSPDQLFGGLGNDTLRGFFGNDVLFGEQGDDRFEG 588
K.koreensis            ALGNVSVTNGSASHGVSISDTTNPITLGDYALQQGAQGLLISNTSGGVTLDSVSLGQTDS 623
F.varium               AGAGTTTVTGGTIELKGAGTGINAKGNVTLKDIAITLSDGATGTGIVYAGGDTEAEAEIS 451
C.atlanticus           NRAG-GGIEANNTDSGSVPGIVNLTNVTLDNNNAGGVAPAPGNGGGLHVSGSSAINITGG 718
O.terrae               TVKNVETFMYTSTGTLGSANAVDMTGWTGLTSANLTLQNIAANTATVTATKDTAVVLSSS 418
                                                                                   

GOS_1318050.36_of      ------------------------------------------------------------
P.marinus              GSGNDNLDGGADADTFTLSTNYTTDDTIVGGAGEDSVSMTIAGGTTTYTAASISGVEVFT 769
A.variabilis           GRGDDTFYSGAGNDVFNLEQLQDNDVVIDFVRGQDKIDLRSLN----------------- 502
M.aeruginosa           TGSVSVAAASYSDVAGNQGGAGTDTVTIDTKN---------------------------- 256
N.punctiforme          ISARGGANAGNGGFVEVSGKNFLTFNGLVDASAPNGSFGTLLLDP--------------- 683
T.erythraeum           DRIEKLAKAADFKGKWTHWVFTKNVTTGEMKIYLNGALWSSGK----------------- 658
Nostoc                 GRGNDTFYGGAGNDVFNLEQLQDNDVVIDFVQGQDKIDIRSLN----------------- 631
K.koreensis            LTSGGVAISGNNSGSIDLGTGSISSASPFTVSGGTATIDYSGAIEQTAS----------- 672
F.varium               TRDIDIKAAGKGIEATMSKVAKSTLNITTGIISTVEGATGIKITG--------------- 496
C.atlanticus           TVSGNTASKEGGGLWNNQGVMTITGTTIDGNDAQGDLVADPLEIVGGGGIFAEDGAG--- 775
O.terrae               TTVTPGLVKVVGGSTVTVTNAAGSANFPATATEVDGVN---------------------- 456
                                                                                   

GOS_1318050.36_of      ------------------------------------------------------------
P.marinus              LTGSTAANTYDFKNLTGITEVTIADGAAFNTTVQGLNSGVIVNILEPTDITTIDTANAAS 829
A.variabilis           ------------------------------------------------------------
M.aeruginosa           ------------------------------------------------------------
N.punctiforme          ------------------------------------------------------------
T.erythraeum           ------------------------------------------------------------
Nostoc                 ------------------------------------------------------------
K.koreensis            ------------------------------------------------------------
F.varium               ------------------------------------------------------------
C.atlanticus           ------------------------------------------------------------
O.terrae               ------------------------------------------------------------
                                                                                   

GOS_1318050.36_of      ------------------------------------------------------------
P.marinus              VSVDVEVSKTGGSVAITDTAAVTITGKVATGAMQALALDAVDTTSLSLKTILTHDLNTGA 889
A.variabilis           ---------------------------ISDWSTLQTLISNDGQNNALITTFFNGSQSQIK 535
M.aeruginosa           -------------------------------------------------PTLAVDIVDAS 267
N.punctiforme          ------------------------------STLTIIDAAAGTGDFDATAGNIAFNDPDIG 713
T.erythraeum           --------------------------DKSKQISGMTLVKLGSGYGFYHGQVAHLRIYDRV 692
Nostoc                 ---------------------------ISDWATLQLLISNDGQNNALITTFFNGDQSQIK 664
K.koreensis            ----------------------------GAALSVANHGSGTITLDGATIGATSGNGLQFN 704
F.varium               --------------------------------TELEELSDGETPTTATVTTTLGKTKGTG 524
C.atlanticus           -------SVVIGAGTIISNNFASGTQGSGGGILMATGTTLSIDGSAGAVMITGNSASRAG 828
O.terrae               ----------------------------GTSSVSITQTTSGTQGAVLVKDYAIGGDKAKA 488
                                                                                   

GOS_1318050.36_of      ------------------------------------------------------------
P.marinus              ITNTDKVTTFNLTTAAHTGTIDTTTFADATSLETITATGVGGDITTTTIGNGGAANDLST 949
A.variabilis           LNNINPNLLQASDFIFNTNTVNNNETIDGTNFADQLFGGLGNDTLRGFNGNDVLFGEQGD 595
M.aeruginosa           LNDGDNNSLVSFEFSEDVAGFDNSDVSVSGGTLSDFTQVDGNSYQATFTADDAVETTGSV 327
N.punctiforme          ANTVSWGAIAASGVNINLQAIGNITINDITG-ATPGVTTAGVATLNLGGGSFSLTSQNGS 772
T.erythraeum           LSAEEINECMKVDVTPPPMDINRGLQVHLPLNEIVKNAKSKKEVVDISGAQLNGKVNGAK 752
Nostoc                 LNNINPNLLQASDFIFNTNTVNLNQTIDGTNSPDQLFGGLGNDTLRGFFGNDVLFGEQGD 724
K.koreensis            NASGTYEISAVTTLNGGDAALDVTNSSGVFNFTDLSITNPSGSAINVSTASPTVTVLAGT 764
F.varium               VEVTGKANNIINVTLQDTQTPTSKTPISIAKGIDVTSNGEGGIINIDVKQSNLQVTGTGN 584
C.atlanticus           GGLEDWSLDTNTNTLTNVVFMNNTAGLDAGAFTADGGPGNGGAIHVTGPGNNTITDGSAS 888
O.terrae               GTISTVTVNGATDVTVNSNALATLTLTKVSGTVLLTDELAGHATSGSGTTLALNLNDSAV 548
                                                                                   

GOS_1318050.36_of      ------------------------------------------------------------
P.marinus              INATATVGSVITFGAITADTTDSVTDNAMVITTSATGVDTGNGNSAVVFGAVNNQYGTIT 1009
A.variabilis           DRFEGGAGNDTFYGGAGNDIAVYSGTRSQYQVTSSGGVFT----------------VTDT 639
M.aeruginosa           SVAAASYSDVAGNDGGAGTDTVTIDTKNPTLAVD-------------------------I 362
N.punctiforme          VNFVDPTNTIQTTGGAINISGASLLLGNLNTTRNFARSGD--------------ITLSAT 818
T.erythraeum           VVADSKFGHCLSFDGADDYLELPTATIPQTGAITISFWAN----------------GDNS 796
Nostoc                 DRFEGGAGNDTFYGGTGNDIAVYLGTRSQYQVTSNGGVFT----------------VTDT 768
K.koreensis            ISHNSANSGVIVSDLTGGTVSLSPAMTLSSSSADAISLTN-------------NSGATIS 811
F.varium               LVNIGAVGGTTNVDINQNVELVGTGNLVNVGAITGTTNVN--------------INKDIK 630
C.atlanticus           GNLAANEGGGFWNGSGVMTIVNTVIDANTASGSDAAVAGAAGGGGIFNEGGTVDISGTAS 948
O.terrae               TNLNQTTATVKEVDFTTSGKKSSIGAWGTAGEVATITVAG---------------DQALD 593
                                                                                   

GOS_1318050.36_of      ------------------------------------------------------------
P.marinus              LTNSGDTTDGTTTGDLTAVDVTVTSSGGDITIGDVSAS--------------EDFSLTVT 1055
A.variabilis           VANRDGVDTLREVEQIQFSDQTITIGNTSPTITLAVSP--------------ASVTEDGT 685
M.aeruginosa           VDASLNDGDNNSLVSFEFSEDVAGFDNSDVSVSGGTLS---------------DFTQVDG 407
N.punctiforme          TGNISTGNIIASGQGYSAGNVQVVSSNGGITLNQINTS-------------DTGNNGVNT 865
T.erythraeum           LPKDNSIIAAYDQSNNRVINIHLPWNNSYIYFDCGNTG--------------NSYDRIEK 842
Nostoc                 VANRDGVDTLREVEQIQFSDQTITIGNTSPTITLAVSP--------------ASVTEDGT 814
K.koreensis            LAGQLNITTTTGKGLIASGGGTLALGAATNSITTQTGVP-------------ISLNGINV 858
F.varium               VATEGTGNIVKAGAITGTLNINLNVGDKNTTSKSLNVAEG-------------------- 670
C.atlanticus           VTNNIADGAQSTGGGILNASGILTANGTTITGNQSNRAGGGIETNGSSSVTLTDVALDGN 1008
O.terrae               LGNISGLGKLATLTSTATAGVTATVDATKVTVTGGAGN---------------------- 631
                                                                                   

GOS_1318050.36_of      ------------------------------------------------------------
P.marinus              GGTATVTDFDAETFSLIDASASTGKLTVTGTSSDGNITVLGGSAGDDITLATAIATDKVH 1115
A.variabilis           PNLIYTFTRTGSTTNALTVNYSVAG----------------------------------- 710
M.aeruginosa           NSYTAIFTAD-------------------------------------------------- 417
N.punctiforme          AGTVTLTAAGNILTDRINSFSSDAGSLGRGGDIVATTTAG-------------------- 905
T.erythraeum           LAKAADFKGKWTHWVFTKNVTTGEMKIYLNGALWSSGKDKSKQISGMTLVKLGSGYG--- 899
Nostoc                 PNLIYTFTRTGSTTNALTVNYSVAGTATLNTDYAQTGAASFTATTGTITFAVGASTAILT 874
K.koreensis            HNDGVNFSSVATTGTVASDAVS-------------------------------------- 880
F.varium               ------------------------------------------------------------
C.atlanticus           MTGVVTGPGAPGNGGGLHVSGAAPVTITGGTVSGNTASKEGGGLWNNQGVMTITGTTIDG 1068
O.terrae               ------------------------------------------------------------
                                                                                   

GOS_1318050.36_of      ------------------------------------------------------------
P.marinus              SISGGGGADTILGSAGAETIIGGAGNDSLDGAAGNDSISGGAGNDTFSFAATGQLNNGDT 1175
A.variabilis           ------------------------------------------------------------
M.aeruginosa           ------------------------------------------------------------
N.punctiforme          ------------------------------------------------------------
T.erythraeum           ------------------------------------------------------------
Nostoc                 INPTADTTVESNETVALTLASGTGYTVGTTTAVTGTITNDDFPSITLAVSPASVTEDGTP 934
K.koreensis            ------------------------------------------------------------
F.varium               ------------------------------------------------------------
C.atlanticus           NDAQGDLVADPLEIVGGGGIFAEDGAGSVVIGAGTIISNNFASGTQGSGGGILMATGTTL 1128
O.terrae               ------------------------------------------------------------
                                                                                   

GOS_1318050.36_of      ------------------------------------TLDFDVDDFTITLGG-------DL 17
P.marinus              IDGGDGTDNLDVTTAVDLDSSSTASTSVTLTSIETVDLTLGADNLSFDASGWASLTGLDI 1235
A.variabilis           ------------------------TATLNTDYAQSGATSFTATTGTITFAAGASTAILTI 746
M.aeruginosa           ------------------------DAVETTGSVSVAAASYSDVAGNDGGAGTDTVTIDTK 453
N.punctiforme          ---------------------SITTGAINSSVFKTGGGDFTGTAGAVTLTATENITVNDA 944
T.erythraeum           ----------------------FYHGQVAHLRIYDRVLSTEEINECMKVDGTDATSTTDE 937
Nostoc                 NLIYTFTRTGSTTNALTINFGVAGTATLNTDYAQSGAASFTATTGTITFAAGASTAILTI 994
K.koreensis            ------------------------LTSVNNNSVSLGNVTIAGTSGTADGLDVSSSSASLL 916
F.varium               -------------------------NTALNLANVSDGTTFVKNTGNVTIEGNGTLVKGND 705
C.atlanticus           SIDGSAGAVMITGNSASRAGGGLEDWSLDTNTNTLTNVVFMNNTAGLDAGAFTADGGPGN 1188
O.terrae               -------------------------DVITLSGQVKKASDLGAGDDWVSATLNATDHKTMD 666
                                                                                   

GOS_1318050.36_of      SGSATVTNLGDA--------------TLTATITANS---VALGTDTTGNFVADLTAGEGI 60
P.marinus              AASADTYTPTVNNLRTGVTVTLGNTFVEEATLDTVAGADLTIDAEITDLVSVTITDAENV 1295
A.variabilis           NPTADTTVESNE--------------TVALTLATGTGYTVGTTTAVTGTITNDDTSPTGI 792
M.aeruginosa           NPTLAVDIVDAS--------------LNDGDNNSLVSFEFSEDVAGFDNSDVSVSGGTLS 499
N.punctiforme          INASAIAIGVDGTG---------NVTGGNVTLQTTNTAGSNISFTNINTQSVADDFVDGN 995
T.erythraeum           TDTISTNDVTDVTP------------TTDGTDTTSTTDGTDTTPTTGGTDTTPTTSETDT 985
Nostoc                 NPTADTTVESNE--------------TVALTLASGTGYTVGTTTAVTGTITNDDTLPTGI 1040
K.koreensis            LDSLTADSIAANAINLN--------GANGAITISTVNIDGVSGGAVVINNNSNPVTINGG 968
F.varium               TNTISLSNQGLINVS--------TVDTSNKEKSSHISSGEKVNVANYGTIDLSSSGITVE 757
C.atlanticus           GGAIHVTGPGNNTITDG---------SASGNLAANEGGGFWNGSGVMTIVNTVIDANTAS 1239
O.terrae               SGTLAGGDGTDT-------------LALTGQDAVAASTGTGLAAQISGFEKLEITGSNFG 713
                         :                                                         

GOS_1318050.36_of      DVSGGGS----------ENATITVSAEDATSSNKGIASFDS--------TDFTVSSGAVT 102
P.marinus              TINGEGAAADLVSLVLDATDTKTLTLTADDGTALDTGSITGTDEITTITATTTVASGTIT 1355
A.variabilis           TINLSGSQTIVEGSTSPQNVTYTVTLSQASSQIITVQYATANGTAT-AGSDYTSTTGTLT 851
M.aeruginosa           DFTQVDGNSYQATFTADDAVETTGSVSVAADSYSDVAGNNGGAGTDTVTIDTLNPTVGIT 559
N.punctiforme          TVQGGNVQVLTNGLVQGIGVGTTIATGGIFDSGAGVTTSIAGGTVTIRHDGGPGNVPFIV 1055
T.erythraeum           TPTTDGTDTTLTTDRTGTTSTTDITDATSTTDGTGTTSTTDGTDTT---PTTDGTDTTLT 1042
Nostoc                 TINLSGSQTIVEGNSSPQNVTYTVTLSQASSQIITVQYATANGTAT-AGSDYTSTTGTLT 1099
K.koreensis            TIAATTAVTGKILDVDQGSGNITLAATMSNSQNHVAEVTNRTGGTINVSGQFSDTGLGIS 1028
F.varium               EFIENAGETAKEKTALDQLTIDEVKTALKTLGVINTGAEGTFSSVGYIRFKNGELFTTAK 817
C.atlanticus           GSDAAVAGAAGGGGIFNEGGTVDISGTASVTNNIADGAQSTGGGILNAAGTLTANGTTIT 1299
O.terrae               TVDMHNLDDINQVILSGDFGGAYAIDKLASGASLTIKAGQTGNGTITVPVSAVQTESLAV 773
                                                                                   

GOS_1318050.36_of      VNAER------------------------------------------VQDIVGAMVGSNT 120
P.marinus              QGGGT------------------------------------------IADVDGLTTLNLS 1373
A.variabilis           FNPGE------------------------------------------TSKVINIPILNDS 869
M.aeruginosa           FDNSP------------------------------------------LTGQNFSTTITFQ 577
N.punctiforme          GDATSANG---------------------------------------TAGTIDVGGGTTI 1076
T.erythraeum           TGGND------------------------------------------TTSTNDVTDATST 1060
Nostoc                 FNPGE------------------------------------------TSKVINIPILNDS 1117
K.koreensis            ANNNTAG-------------------------------------TLVFSGTSKVLSTAGS 1051
F.varium               ALSGSQ----------------------------------------TVEGLSSALNQEGS 837
C.atlanticus           GNQSNRAGGGIETNGSGPVTLTDVTLDANQTGVVTGTGAPGNGGGIHVSGDSAVTITGGT 1359
O.terrae               TITSD------------------------------------------TADINAQTLATGN 791
                                                                                   

GOS_1318050.36_of      ESGITVTYED---SDGTLDFNVADPVITLSGDV------------AGSATMTNLGDVTIS 165
P.marinus              ATNASLTLGAMGTDATANNAELLSTITTSATGTGVVLTTGALYADSTVDSTTDLAMTINS 1433
A.variabilis           VNEANETFTLRLTSPTNATLGTTNTVTTTITDT------------LSASVTTTLPTNVEN 917
M.aeruginosa           FSEAVSGFAASDVTLTNGVLSNFTGSGSSYTAT-----------FIATNFQSPTGTVAVS 626
N.punctiforme          ASGNFPVLPTGGDAIGTPTGITITSVNTPPVLT------------ANSSLPNTQTNQPVT 1124
T.erythraeum           TDGTGTTSTTGGNDTASTNDVTVTTPTTDGTDTTLTTDG-----TVTTPTTDGTGTTSTT 1115
Nostoc                 VNEANETFTLRLTSPTNATLGTTNTVTTTITDT------------LSASVTTTLPTNVEN 1165
K.koreensis            NAIVLNSNNSGFVTRFTNGGLDIDTTTGSALSATSSGVLEVSGAGNSITTTSGTALTLVN 1111
F.varium               DRAFAIAKDGNFTLNSAGENEKLENVQFSLDGTMKISG-------TAATPVNIDNSNVER 890
C.atlanticus           SNGNTAANEGGALWNGSGVMTIVDTTIDANTASGNDAMSPGAAGGGGIYNEGGTVDISGT 1419
O.terrae               INTINLNVVDTNDTKNADDTVNFTQAKVVNITGN----------VLSLTSNTGAAGTVDA 841
                                                                                   

GOS_1318050.36_of      TTIQANSIALGTDT------TGNYVSAISA----------GEGIDVSGSGSETATVTISA 209
P.marinus              TTNVGAQTVLGAIDNTYGSITGTFVSHSSDDTDVGNLTAVDMTLTVSGGGDTDFQDLIAS 1493
A.variabilis           LTLTGTAAINGTGN------AGNNVLTGNS----------GNNILSGGAGNDTYAFVANA 961
M.aeruginosa           NDYFDTPGNQGAAN------SANIAMTVGG--------GPDPNDGPSGGGSVIGSGTIGA 672
N.punctiforme          LTFSSLAALVSDANNDITSIQVDVVNTGNLT--------VNGLPVIPGVTTLSSDDTLVY 1176
T.erythraeum           DGTVTTSTTGGTDTTPTTDGTDTTLTTDGT----------DTTLTTDRTGTTSTTDITDA 1165
Nostoc                 LTLTGTAAINGTGN------AGNNILTGNS----------GNNILSGGAGNDTYAFVANA 1209
K.koreensis            SAISANDVSFQSISQSGGTNAITLTNTGTN-----------GSLVVTGVATTVGSGGTIT 1160
F.varium               QVYITGNGKLEVENNAILNYSGNISADNIN--------TAEAAIAITDTGTLTLANGALN 942
C.atlanticus           ASVTNNIADGAQSTGGGILNASGILTANGTTITGNQSNRAGGGIETNGSSSVTLTDVALD 1479
O.terrae               SGMAGGKLILTTTAG-----DGSTIKGTTK---------GSNDITATGAGKITITTGNSA 887
                                                                                   

GOS_1318050.36_of      EDATDSNKGIAS--FDATDFTVSSGDVTVNA----------------------------- 238
P.marinus              GTVAVTASGSGNLTIDDANIDGTSASLSVDASAMSGTVSAVATNTSVATTLTGGSGADTL 1553
A.variabilis           ALGTDTITETATGGIDTIDFSGSTGAVRVNLGVTT-------------SQTVNSNLKLIL 1008
M.aeruginosa           DSITGSTGDDNLSGLDGNDTINGGDGNDTINGGTG------------------------- 707
N.punctiforme          TPPTDVNGSLNAFVISANDRVSSSAPVQVGINVSQIP------------PTIPTPTPTTI 1224
T.erythraeum           TSTTDGTGNTSTTDGTGTTSTTDGTDTTLTTGGND--------------TTSTTGGNDTT 1211
Nostoc                 ALGTDTITETATGGIDTIDFNGSTATVRVNLGVTT-------------SQTVNNNLKLIL 1256
K.koreensis            GLSQEAIRLTDTLAPSFNGLSMSNITREAILGVRVNG---------------ISVTNSSV 1205
F.varium               MVAPESKTALLSEPANRVGIELTGAGTTELDNYTVN-----------------ADIKGKL 985
C.atlanticus           GNMTGVVTGPGAPGNGGGLHVSGAAPVTITGGTVSG----NTASKEGGGLWNNQGVMTIT 1535
O.terrae               DTIAVAAGTTITAGDGANVITASGAGNSITTGKNGD--------------------SITV 927
                                                                                   

GOS_1318050.36_of      ---ERIQDIVG------------------------------------------------- 246
P.marinus              TGGSVADTITGGAGGDTITTNAGDDTVAGGGGNDTITGGAGDNVITGGAGNDSITAGAGF 1613
A.variabilis           SANNVIENATGGTGNDRLTGNALNNILAGGNGNDQLQGLAGNDTLWGGLGDDILTGGAGQ 1068
M.aeruginosa           -----ADIISGGTGNDNIIGLGGFDLIYGGSGNDTINGSNGIDTIVGGFGNDSLTGGGGD 762
N.punctiforme          PTTIPTPTFTPNPPPCSFQCTPGKPNVPDPNNPKIDNPVINTDPTPEDKFTDDFADHLGI 1284
T.erythraeum           STTGGNDTASTTGGNDTTPTTDGTDTTSTTGGTDTISTTDGTDTTLTTGGNDTTSTSDVT 1271
Nostoc                 SANNVIENATGGTGNDRLTGNALNNILAGGNGNDQLQGLAGNDTLWGGLGDDILTGGIGQ 1316
K.koreensis            TNAGSTDADADDDVFGFVREGLGDNGLTGTALFQNLTIADAHERAIDIVNEGSGSLDLDI 1265
F.varium               DGSNLQGTLSAKGKSRITGNVTDISKIEVTDNGMLTFGADSKIESGATSGTTATTIDLAN 1045
C.atlanticus           GTTIDGNDAQGDLVADPLEIVGGGGIFAEDGAGSVVIGAGTIISNNFASGTQGSGGGILM 1595
O.terrae               TATSGTTTISAGDGNDTITVAGQAVSNITLGGGADKVVLSSKAGSAGYFTTISGAGDADI 987
                                                                                   

GOS_1318050.36_of      ----------------------------------AMFSSNTESG---------------- 256
P.marinus              DNIDSGAGDDTIVFAANMSLSDTVAGGDGNDTITATLSSGSASQKVGTLSGVENFTLDFA 1673
A.variabilis           DKYLFQSSGVFSSSLGVDYISQFDVGQDQIVLSKATFNAVTNSAGQ-------------- 1114
M.aeruginosa           TFRFLSIYDQQDVITGFNDTIVFDITGASAFTSLGSGALGTTTYTG-------------- 808
N.punctiforme          PTPRIKTLDDAKEIARKIEEATGVKPAFIYISFVPVEIIPERNLGKAQKFT--------- 1335
T.erythraeum           DATSTTDSTDTTQKPMEYQIYQVKGNPTIKTGYQGSEWTAVIAGFN-------------- 1317
Nostoc                 DKYLFQSSGVFSSSLGVDYISQFDVGQDQIVLSKATFNAVTNSAGQ-------------- 1362
K.koreensis            INVSVNDNDDTQGEDAIRIQSEGTINTDVLVSGGTFNNLELDAVAYFAQG---------- 1315
F.varium               GNMGVEIGEKGKNVLYNTTVGDIAFNNLVANTPQDAKSGKIVLLTN-------------- 1091
C.atlanticus           ATGTTLSIDGSAGAVMITGNSASRAGGGLEDWSLDTNTNTLTNVVFMNNTAGLDAGAFTA 1655
O.terrae               LDLSAVDTGTATFNATAVTLGQGATFSDYVASATAGNVAGNAVVSWFN------------ 1035
                                                                                   

GOS_1318050.36_of      --------------------------------------------------------ISVT 260
P.marinus              QTAGSFDSTNATTAAYTVTNAADAKLIDMDNLATGSTIKITAAFDGLALDYADDATAAIT 1733
A.variabilis           --------------------------------------ALTDFAVVSDDEFVNASSARIV 1136
M.aeruginosa           ------------------------------------------------------AIAGGY 814
N.punctiforme          -------------------------------------------KQLNTLAEQDSDQLEIV 1352
T.erythraeum           -------------------------------------------CGAKKKHKATAFTIMPV 1334
Nostoc                 --------------------------------------ALTDFAVVSDDEFVNASSARIV 1384
K.koreensis            ------------------------------------------------TGTNNVTVTGIT 1327
F.varium               -----------------------------------------------------ALTENTE 1098
C.atlanticus           DGGPGNGGAIHVTGPGNNTITDGSASGNLAAAEGGAFWNGSGTMLVTGTSFDSNIASGAD 1715
O.terrae               ------------------------------------------------------FGGDTY 1041
                                                                                   

GOS_1318050.36_of      YEDSDGTIDLD---------VSDPTLSLQAMSQVQEQ----------------------- 288
P.marinus              FDDNGAAANFDNDTTFAVTDAQTVNITISGGETYTQTGATTLDATDTDYVTITGDASSTI 1793
A.variabilis           YSQGSGSLFYN---------RDGNVLGTGTVFEFARLGNPDITLSSSDFSLIA------- 1180
M.aeruginosa           LTYSGGVLSYD--------ADGSAGSTFSPLAIVTLTGSPTLSNTNVLFQNL-------- 858
N.punctiforme          VVTGKGNPIRKR------IPETTKAKVIQVAQEFRDQIVSPQNRRRTGYLRPSQQLYR-- 1404
T.erythraeum           LDDGEWKIKCD------IKDVDDRYWDVAVLFIRNNMVNMLNHFHR-------------- 1374
Nostoc                 YSQGSGSLFYN---------RDGNVLGTSTVFEFARLGNPDITLSSSDFSLIA------- 1428
K.koreensis            TTNGGGPDNFPN-------GGGIAVVGSNGSTTTFNINNNNLSEVFGEGIQIVG------ 1374
F.varium               FNLGKHKLEQG-------AVVANGDIYYNITQGNGGIWNAIFNKDGLIDKTG-------- 1143
C.atlanticus           ATNGGGALFNIG----GTLTVSGASITNNVVDGTSGSGGGILNVNGGILSVTDTDILGNS 1771
O.terrae               IVVDNGNGTYDP--------AGDVVVRLTGALDLSAAGTNAGIAAGVLTI---------- 1083
                          .                                                        

GOS_1318050.36_of      ------------------------------------------------------------
P.marinus              LTHTG---------------AISADNAVTFSLVSTDGAALDFNSTGLATADLLTTFSAST 1838
A.variabilis           ------------------------------------------------------------
M.aeruginosa           ------------------------------------------------------------
N.punctiforme          --------------------------------WIIAPLEADLQAREINNLVFLPDMGLRS 1432
T.erythraeum           ------------------------------------------------------------
Nostoc                 ------------------------------------------------------------
K.koreensis            ----------------------------------LAGANQTLTMNGTISGNQMSSMNGDG 1400
F.varium               ------------------------------------------------------------
C.atlanticus           ASRAGGGIEDNSTVDLGDGALVGSVTLFGVQLNNNIAGSAPGNGGGLHLTGGADSNITSS 1831
O.terrae               ------------------------------------------------------------
                                                                                   

GOS_1318050.36_of      ------------------------------------------------------------
P.marinus              TDGVDAADITFESFVVGNAAAAAKLTSIDLDASHAGDITAGAFDAAGATITSITMDSATS 1898
A.variabilis           ------------------------------------------------------------
M.aeruginosa           ------------------------------------------------------------
N.punctiforme          TPMAALHDGKGFLVEKYSIGLMPSISLTNTLYKDIKKSQVLAMGVSQSTQGQEPLPAVPL 1492
T.erythraeum           ------------------------------------------------------------
Nostoc                 ------------------------------------------------------------
K.koreensis            IDLDFDGDVAGSSTINTTIDVNNNTIDFDDDGVGIDFRDTAGTGNFTIRNNTFSVIAGDD 1460
F.varium               ------------------------------------------------------------
C.atlanticus           VINGNTASNEGGGVWNGSGVMTISLTTIDGNTANGAAATNGGGGIFNNASGTINLNTSTV 1891
O.terrae               ------------------------------------------------------------
                                                                                   

GOS_1318050.36_of      ------------------------------------------------------------
P.marinus              GSVIDIAGQITADSVTNITTSAVSGGSVDFSGALVISTVGTLKHTGAGNFTMDSTSTAIT 1958
A.variabilis           ------------------------------------------------------------
M.aeruginosa           ------------------------------------------------------------
N.punctiforme          ELSTLVSKLWQGKLLLDKQATLENLKTIRRQQPFGIIHMATHADFTTGALSNSYIQLWED 1552
T.erythraeum           ------------------------------------------------------------
Nostoc                 ------------------------------------------------------------
K.koreensis            GVTTDSDDGIFIFSDDDVSAGASTLNVAIQNNSFSGIDPLDKNIVVEDIRDAGRSACFNM 1520
F.varium               ------------------------------------------------------------
C.atlanticus           SNNVSTGAAAQGGGIHNKATTTLNVTASTISGNTSASNGGGIYNNGTASILNATIANNTA 1951
O.terrae               ------------------------------------------------------------
                                                                                   

GOS_1318050.36_of      ------------------------------------------------------------
P.marinus              TLERLDLTGATGTNTVDISGNTNATTINLGTGTNTILLTGAADDVNLSSTAATDTLTYGA 2018
A.variabilis           ------------------------------------------------------------
M.aeruginosa           ------------------------------------------------------------
N.punctiforme          KLRLNQLRQLRFNEPEVEMLVLSACRTALGDEESELGFAGLAVLAGVKTSVASLWSVNDA 1612
T.erythraeum           ------------------------------------------------------------
Nostoc                 ------------------------------------------------------------
K.koreensis            TGNSLGVIELDLDATGAGGRVTQASVAAMATDNNGSTVTVIDQLPTFNSTQCSSVPLP-- 1578
F.varium               ------------------------------------------------------------
C.atlanticus           TANGGGVSGESSVTVKGSIIATN--TAATGTDVDGTFVSNDYNLIGDDSGNAFPESANDI 2009
O.terrae               ------------------------------------------------------------
                                                                                   

GOS_1318050.36_of      ------------------------------------------------------------
P.marinus              STSVPSISNFAFGSGADIINIDLSEIEAALTIDLHDGNAADVNAAETFSIKEISADTTLA 2078
A.variabilis           ------------------------------------------------------------
M.aeruginosa           ------------------------------------------------------------
N.punctiforme          GTAALMTKFYQNLRTAPIKAEALRQAQVAMAKGQIYVKNGQLEG--LGVVGNLSLPTNSA 1670
T.erythraeum           ------------------------------------------------------------
Nostoc                 ------------------------------------------------------------
K.koreensis            ------------------------------------------------------------
F.varium               ------------------------------------------------------------
C.atlanticus           ENVNPAIGPLADNGGTTLTHMLLSTSAAANAGDPGDTSLDQIGNAVFGDARDIGALEDQD 2069
O.terrae               ------------------------------------------------------------
                                                                                   

GOS_1318050.36_of      ------------------------------------------------------------
P.marinus              AGDDVLVLVGSTFATTDLVETAIETGDFELTYGTALADNDGTLLFYSDGTDMYLAVSKAG 2138
A.variabilis           ------------------------------------------------------------
M.aeruginosa           ------------------------------------------------------------
N.punctiforme          DEGEQLLTHPYYWSGFTMVGNPW------------------------------------- 1693
T.erythraeum           ------------------------------------------------------------
Nostoc                 ------------------------------------------------------------
K.koreensis            ------------------------------------------------------------
F.varium               ------------------------------------------------------------
C.atlanticus           ALG--VEDFGNSLSDTVLFPNPSTNGNVSLNIPTNITNTVTIRVIDMTGKQVHSQQATSG 2127
O.terrae               ------------------------------------------------------------
                                                                                   

GOS_1318050.36_of      -----------------------------------------
P.marinus              DTPTANLATGETTVSNLMKFEGVTSISAGDITAGDIVLVA- 2178
A.variabilis           -----------------------------------------
M.aeruginosa           -----------------------------------------
N.punctiforme          -----------------------------------------
T.erythraeum           -----------------------------------------
Nostoc                 -----------------------------------------
K.koreensis            -----------------------------------------
F.varium               -----------------------------------------
C.atlanticus           SVTLN--------LNRLAVGTYLVNISDGNTSSTLKLLMSR 2160
O.terrae

Protein Domains

PROTOCOL


InterPro, default parameters at EBI.


RESULTS ANALYSIS


The largest sequence showed no information about protein domains, "no hits reported" was the result found in this search in InterPro.

During the search in the BLAST, the information "No putative conserved domains have been detected.", it is another valid prove that there are no conserved domains in this sequence.

But the second largest ORF, the number 2 in reading frame 1 on the reverse strand showed that it contains a signal peptide which is transmembranar, but this cannot be significant because the E-value of the BLASTp vs Env_NR (2e-04) it is worse than the one gave by the analysis of the largest ORF.




RAW RESULTS

Phylogeny

PROTOCOL



RESULTS ANALYSIS


It was not possible to create a phylogenetic tree, because the score of BLASTx vs NR, which was the BLAST with better results, was not very high (79,7), the homology showed in the number of identity was really low, the number of positive a.a. its very restricted, so the multiple alignement was very bad, and it wasn't able do reproduce a valid tree.

In BLASTp vs ENV_NR was not possible to reproduce any trees or any multiple alignements, because only a ingroup was valid, and no outgroups were present.

RAW RESULTS

Taxonomy report

PROTOCOL


a) BLASTx vs NR, default NCBI parameters *"1000 max target sequences"

b) BLASTp vs ENV_NR, default NCBI parameteres *"1000 max target sequences"


RESULTS ANALYSIS


a)The follow sequence is probably a cyanobacteria, according to the Lineage Report of the 12 first data have relative high score, and the BLASTx results were biologically significant,although they were not quite high.

The in group was constitued by 12 data of cyanobacteria, which have the best score (79), and the out groups has the filo verrucomicrobria, GSB, Fusobacteria, CFB and Probacteria (class Gamma-Probacteria).

Some of the in group and out group sequences were despised because their FASTA sequence was to large in comparison to the GOS sequence,in order to have a better multiple alignement.

The sequences despised were: Microcoleus chthonoplastes PCC 7420, Lyngbya sp. PCC 8106, Synechococcus sp. RS9916, Synechocystis sp. PCC 6803, Synechococcus sp WH 7805 and Chlorobium limicola DSM 245.


b)Since BLASTp vs ENV_NR just gave one result and it is widely broad (marine metagenome), is not possible to form in groups and outgroups.

With the help of SCOP it is safe to assume that the organism containing the ORF in study belongs to Marine metagenome Family.

RAW RESULTS

Lineage Report

root
. cellular organisms
. . Bacteria           [bacteria]
. . . Cyanobacteria      [cyanobacteria]
. . . . Oscillatoriales    [cyanobacteria]
. . . . . Microcoleus chthonoplastes PCC 7420 ----------------------------------   79  28 hits [cyanobacteria]          haemagglutination activity domain protein [Microcoleus chth
. . . . . Trichodesmium erythraeum IMS101 ......................................   56   4 hits [cyanobacteria]          hypothetical protein Tery_2255 [Trichodesmium erythraeum IM
. . . . . Lyngbya sp. PCC 8106 .................................................   56  14 hits [cyanobacteria]          hypothetical protein L8106_15695 [Lyngbya sp. PCC 8106] >gi
. . . . Synechococcus sp. RS9917 -----------------------------------------------   66 138 hits [cyanobacteria]          hypothetical protein RS9917_01402 [Synechococcus sp. RS9917
. . . . Prochlorococcus marinus str. NATL1A ....................................   63  14 hits [cyanobacteria]          hypothetical protein NATL1_21051 [Prochlorococcus marinus s
. . . . Synechococcus sp. RS9916 ...............................................   61  24 hits [cyanobacteria]          cell wall surface anchor family protein [Synechococcus sp. 
. . . . Anabaena variabilis ATCC 29413 .........................................   60  20 hits [cyanobacteria]          VCBS [Anabaena variabilis ATCC 29413] >gi|75704083|gb|ABA23
. . . . Microcystis aeruginosa NIES-843 ........................................   58   2 hits [cyanobacteria]          hypothetical protein MAE_00840 [Microcystis aeruginosa NIES
. . . . Synechocystis sp. PCC 6803 .............................................   58  26 hits [cyanobacteria]          hypothetical protein slr0364 [Synechocystis sp. PCC 6803] >
. . . . Nostoc punctiforme PCC 73102 ...........................................   56   4 hits [cyanobacteria]          filamentous haemagglutinin outer membrane protein [Nostoc p
. . . . Synechococcus sp. WH 7805 ..............................................   56  24 hits [cyanobacteria]          Large exoprotein involved in heme utilization or adhesion [
. . . . Nostoc sp. PCC 7120 ....................................................   54   2 hits [cyanobacteria]          hypothetical protein all3346 [Nostoc sp. PCC 7120] >gi|1713
. . . Candidatus Pelagibacter ubique HTCC1002 ----------------------------------   74  60 hits [a-proteobacteria]       hypothetical protein PU1002_01715 [Candidatus Pelagibacter 
. . . Gemmata obscuriglobus UQM 2246 ...........................................   72  26 hits [planctomycetes]         FG-GAP repeat protein [Gemmata obscuriglobus UQM 2246]
. . . Magnetospirillum magnetotacticum MS-1 ....................................   68   9 hits [a-proteobacteria]       COG5295: Autotransporter adhesin [Magnetospirillum magnetot
. . . Staphylococcus aureus subsp. aureus WBG10049 .............................   66  18 hits [firmicutes]             predicted protein [Staphylococcus aureus subsp. aureus WBG1
. . . Silicibacter sp. TrichCH4B ...............................................   66  22 hits [a-proteobacteria]       outer membrane autotransporter barrel domain protein [Silic
. . . Rickettsia bellii RML369-C ...............................................   66   4 hits [a-proteobacteria]       cell surface antigen Sca3 [Rickettsia bellii RML369-C] >gi|
. . . Candidatus Pelagibacter ubique HTCC1062 ..................................   66  52 hits [a-proteobacteria]       hypothetical protein SAR11_0932 [Candidatus Pelagibacter ub
. . . Acidiphilium cryptum JF-5 ................................................   65  18 hits [a-proteobacteria]       filamentous haemagglutinin outer membrane protein [Acidiphi
. . . Stenotrophomonas sp. SKA14 ...............................................   65  32 hits [g-proteobacteria]       outer membrane autotransporter barrel domain protein [Steno
. . . Burkholderia cenocepacia J2315 ...........................................   65  84 hits [b-proteobacteria]       putative haemagglutinin-related autotransporter protein [Bu
. . . Rickettsia australis .....................................................   65   5 hits [a-proteobacteria]       outer membrane protein A [Rickettsia australis]
. . . Shigella dysenteriae Sd197 ...............................................   65   4 hits [enterobacteria]         hypothetical protein SDY_0423 [Shigella dysenteriae Sd197] 
. . . Oceanicola granulosus HTCC2516 ...........................................   65  40 hits [a-proteobacteria]       type I secretion target repeat protein [Oceanicola granulos
. . . Rhodobacterales bacterium HTCC2654 .......................................   65  20 hits [a-proteobacteria]       putative RTX toxin [Rhodobacterales bacterium HTCC2654] >gi
. . . Beutenbergia cavernae DSM 12333 ..........................................   64  38 hits [high GC Gram+]          Ig domain protein group 1 domain protein [Beutenbergia cave
. . . beta proteobacterium KB13 ................................................   64  24 hits [b-proteobacteria]       hemagglutination activity domain protein [beta proteobacter
. . . Leuconostoc mesenteroides subsp. mesenteroides ATCC 8293 .................   64   4 hits [firmicutes]             hypothetical protein LEUM_1286 [Leuconostoc mesenteroides s
. . . Escherichia coli O55:H7 str. CB9615 ......................................   64  36 hits [enterobacteria]         hypothetical protein G2583_0601 [Escherichia coli O55:H7 st
. . . Escherichia coli O157:H7 str. FRIK2000 ...................................   64  12 hits [enterobacteria]         putative RTX family exoprotein [Escherichia coli O157:H7 st
. . . Escherichia coli O157:H7 str. FRIK966 ....................................   64  12 hits [enterobacteria]         putative RTX family exoprotein [Escherichia coli O157:H7 st
. . . Escherichia coli O157:H7 str. EC4401 .....................................   64  24 hits [enterobacteria]         BNR/Asp-box repeat domain protein [Escherichia coli O157:H7
. . . Escherichia coli O157:H7 str. EC4024 .....................................   64  12 hits [enterobacteria]         BNR/Asp-box repeat domain protein [Escherichia coli O157:H7
. . . Escherichia coli O157:H7 str. TW14359 ....................................   64  24 hits [enterobacteria]         BNR/Asp-box repeat domain protein [Escherichia coli O157:H7
. . . Escherichia coli O157:H7 str. EC508 ......................................   64  26 hits [enterobacteria]         large repetitive protein [Escherichia coli O157:H7 str. EC5
. . . Escherichia coli O157:H7 str. EC4501 .....................................   64  26 hits [enterobacteria]         large repetitive protein [Escherichia coli O157:H7 str. EC4
. . . Escherichia coli O157:H7 str. TW14588 ....................................   64  26 hits [enterobacteria]         large repetitive protein [Escherichia coli O157:H7 str. EC4
. . . Escherichia coli O157:H7 str. Sakai ......................................   64  26 hits [enterobacteria]         hypothetical protein ECs0542 [Escherichia coli O157:H7 str.
. . . Dokdonia donghaensis MED134 ..............................................   64  16 hits [CFB group bacteria]     hypothetical protein MED134_12246 [Dokdonia donghaensis MED
. . . Escherichia coli B354 ....................................................   63  40 hits [enterobacteria]         conserved hypothetical protein [Escherichia coli B354] >gi|
. . . Verrucomicrobiae bacterium DG1235 ........................................   63  48 hits [verrucomicrobia]        Putative Ig domain family [Verrucomicrobiae bacterium DG123
. . . delta proteobacterium MLMS-1 .............................................   63   4 hits [d-proteobacteria]       Flagellin-like:transferase hexapeptide repeat [delta proteo
. . . Staphylococcus aureus subsp. aureus WW2703/97 ............................   63   4 hits [firmicutes]             predicted protein [Staphylococcus aureus subsp. aureus WW27
. . . Staphylococcus aureus subsp. aureus 65-1322 ..............................   63  10 hits [firmicutes]             predicted protein [Staphylococcus aureus subsp. aureus 65-1
. . . Staphylococcus aureus subsp. aureus 68-397 ...............................   63  10 hits [firmicutes]             predicted protein [Staphylococcus aureus subsp. aureus 65-1
. . . Staphylococcus aureus subsp. aureus E1410 ................................   63  10 hits [firmicutes]             predicted protein [Staphylococcus aureus subsp. aureus 65-1
. . . Staphylococcus aureus subsp. aureus M876 .................................   63  10 hits [firmicutes]             predicted protein [Staphylococcus aureus subsp. aureus 65-1
. . . Staphylococcus aureus subsp. aureus C101 .................................   63  10 hits [firmicutes]             predicted protein [Staphylococcus aureus subsp. aureus 65-1
. . . Staphylococcus aureus subsp. aureus M809 .................................   63  10 hits [firmicutes]             predicted protein [Staphylococcus aureus subsp. aureus 65-1
. . . Staphylococcus aureus subsp. aureus 55/2053 ..............................   63  10 hits [firmicutes]             serine-rich repeat-containing protein [Staphylococcus aureu
. . . Staphylococcus aureus subsp. aureus str. JKD6008 .........................   63   5 hits [firmicutes]             serine-rich repeat-containing protein [Staphylococcus aureu
. . . Staphylococcus aureus subsp. aureus TW20 .................................   63   5 hits [firmicutes]             serine-rich repeat-containing protein [Staphylococcus aureu
. . . Psychroflexus torquis ATCC 700755 ........................................   63   6 hits [CFB group bacteria]     Polymorphic membrane protein [Psychroflexus torquis ATCC 70
. . . Staphylococcus aureus subsp. aureus MRSA252 ..............................   63   6 hits [firmicutes]             serine-rich repeat-containing protein [Staphylococcus aureu
. . . Staphylococcus aureus subsp. aureus MN8 ..................................   63   4 hits [firmicutes]             serine-rich repeat-containing protein [Staphylococcus aureu
. . . Staphylococcus aureus subsp. aureus Btn1260 ..............................   63   4 hits [firmicutes]             serine-rich repeat-containing protein [Staphylococcus aureu
. . . Kangiella koreensis DSM 16069 ............................................   62  10 hits [g-proteobacteria]       hypothetical protein Kkor_2606 [Kangiella koreensis DSM 160
. . . Moritella sp. PE36 .......................................................   62  36 hits [g-proteobacteria]       fibronectin type III domain protein [Moritella sp. PE36] >g
. . . Desulfococcus oleovorans Hxd3 ............................................   62  50 hits [d-proteobacteria]       YadA domain-containing protein [Desulfococcus oleovorans Hx
. . . Shewanella woodyi ATCC 51908 .............................................   62 100 hits [g-proteobacteria]       outer membrane adhesin like proteiin [Shewanella woodyi ATC
. . . Chlorobium ferrooxidans DSM 13031 ........................................   62  58 hits [green sulfur bacteria]  Polymorphic membrane protein, Chlamydia:Haemagluttinin:Fila
. . . Pectobacterium atrosepticum SCRI1043 .....................................   62  44 hits [enterobacteria]         putative hemagglutinin/hemolysin-related protein [Pectobact
. . . Acidaminococcus fermentans DSM 20731 .....................................   62  12 hits [firmicutes]             Hemagluttinin domain protein [Acidaminococcus fermentans DS
. . . Burkholderia multivorans CGD2M ...........................................   62  60 hits [b-proteobacteria]       outer membrane autotransporter barrel domain protein [Burkh
. . . Burkholderia multivorans CGD2 ............................................   62  60 hits [b-proteobacteria]       outer membrane autotransporter barrel domain protein [Burkh
. . . Burkholderia multivorans ATCC 17616 ......................................   62  56 hits [b-proteobacteria]       outer membrane autotransporter [Burkholderia multivorans AT
. . . Sulfurimonas denitrificans DSM 1251 ......................................   62   6 hits [e-proteobacteria]       hypothetical protein Suden_1952 [Sulfurimonas denitrificans
. . . Chlorobium chlorochromatii CaD3 ..........................................   62 162 hits [green sulfur bacteria]  VCBS [Chlorobium chlorochromatii CaD3] >gi|78170913|gb|ABB2
. . . Oceanicola batsensis HTCC2597 ............................................   62   6 hits [a-proteobacteria]       hypothetical protein OB2597_13888 [Oceanicola batsensis HTC
. . . Escherichia coli FVEC1412 ................................................   62  36 hits [enterobacteria]         conserved hypothetical protein [Escherichia coli FVEC1412] 
. . . Curvibacter putative symbiont of Hydra magnipapillata ....................   62  15 hits [b-proteobacteria]       hypothetical protein [Curvibacter putative symbiont of Hydr
. . . Burkholderia multivorans CGD1 ............................................   62  92 hits [b-proteobacteria]       outer membrane autotransporter barrel domain protein [Burkh
. . . Escherichia coli UMN026 ..................................................   62  38 hits [enterobacteria]         adhesin for cattle intestine colonization [Escherichia coli
. . . Roseobacter sp. GAI101 ...................................................   62  16 hits [a-proteobacteria]       outer membrane autotransporter barrel [Roseobacter sp. GAI1
. . . Rhodospirillum centenum SW ...............................................   62   8 hits [a-proteobacteria]       S-layer protein [Rhodospirillum centenum SW] >gi|209957622|
. . . Burkholderia cenocepacia MC0-3 ...........................................   62  68 hits [b-proteobacteria]       outer membrane autotransporter [Burkholderia cenocepacia MC
. . . Azorhizobium caulinodans ORS 571 .........................................   62  68 hits [a-proteobacteria]       hypothetical protein AZC_1915 [Azorhizobium caulinodans ORS
. . . Burkholderia ambifaria MC40-6 ............................................   62  44 hits [b-proteobacteria]       hypothetical protein BamMC406_6074 [Burkholderia ambifaria 
. . . Burkholderia cenocepacia HI2424 ..........................................   62  76 hits [b-proteobacteria]       outer membrane autotransporter [Burkholderia cenocepacia HI
. . . Burkholderia cenocepacia AU 1054 .........................................   62  42 hits [b-proteobacteria]       Outer membrane autotransporter barrel [Burkholderia cenocep
. . . Staphylococcus aureus subsp. aureus MSSA476 ..............................   62   9 hits [firmicutes]             putative cell wall-anchored protein [Staphylococcus aureus 
. . . Staphylococcus aureus subsp. aureus MW2 ..................................   62   9 hits [firmicutes]             hypothetical protein MW2575 [Staphylococcus aureus subsp. a
. . . Magnetococcus sp. MC-1 ...................................................   62  28 hits [proteobacteria]         filamentous haemagglutinin outer membrane protein [Magnetoc
. . . Geobacter uraniireducens Rf4 .............................................   61   4 hits [d-proteobacteria]       Cna B domain-containing protein [Geobacter uraniireducens R
. . . Saccharophagus degradans 2-40 ............................................   61  16 hits [g-proteobacteria]       hypothetical protein Sde_0798 [Saccharophagus degradans 2-4
. . . Escherichia coli O157:H7 EDL933 ..........................................   61  26 hits [enterobacteria]         RTX family exoprotein [Escherichia coli O157:H7 EDL933] >gi
. . . Staphylococcus aureus subsp. aureus USA300_TCH959 ........................   61   6 hits [firmicutes]             cell wall-anchored protein [Staphylococcus aureus subsp. au
. . . Gemmatimonas aurantiaca T-27 .............................................   61   8 hits [bacteria]               hypothetical protein GAU_2363 [Gemmatimonas aurantiaca T-27
. . . Verrucomicrobium spinosum DSM 4136 .......................................   61   4 hits [verrucomicrobia]        Outer membrane autotransporter barrel [Verrucomicrobium spi
. . . Haemophilus influenzae ...................................................   61   4 hits [g-proteobacteria]       HmwA [Haemophilus influenzae]
. . . Chlorobium luteolum DSM 273 ..............................................   61  22 hits [green sulfur bacteria]  VCBS [Chlorobium luteolum DSM 273] >gi|78166169|gb|ABB23267
. . . Psychrobacter arcticus 273-4 .............................................   61  72 hits [g-proteobacteria]       hypothetical protein Psyc_1601 [Psychrobacter arcticus 273-
. . . Desulfuromonas acetoxidans DSM 684 .......................................   61  40 hits [d-proteobacteria]       flagellin-like [Desulfuromonas acetoxidans DSM 684] >gi|951
. . . Staphylococcus aureus subsp. aureus ST398 ................................   60   5 hits [firmicutes]             serine-rich adhesin for platelets (Staphylococcus aureussur
. . . Ralstonia solanacearum IPO1609 ...........................................   60  24 hits [b-proteobacteria]       hemagglutinin-related protein [Ralstonia solanacearum IPO16
. . . Chthoniobacter flavus Ellin428 ...........................................   60  94 hits [verrucomicrobia]        Parallel beta-helix repeat protein [Chthoniobacter flavus E
. . . Roseovarius sp. TM1035 ...................................................   60  18 hits [a-proteobacteria]       Large exoprotein [Roseovarius sp. TM1035] >gi|149143568|gb|
. . . Shewanella amazonensis SB2B ..............................................   60  16 hits [g-proteobacteria]       putative outer membrane adhesin like protein [Shewanella am
. . . Staphylococcus aureus subsp. aureus H19 ..................................   60  10 hits [firmicutes]             predicted protein [Staphylococcus aureus subsp. aureus H19]
. . . Staphylococcus aureus A9635 ..............................................   60  10 hits [firmicutes]             cell wall-anchored protein [Staphylococcus aureus A9635] >g
. . . Desulfatibacillum alkenivorans AK-01 .....................................   60  88 hits [d-proteobacteria]       filamentous hemagglutinin family outer membrane protein [De
. . . Staphylococcus aureus ....................................................   60  12 hits [firmicutes]             RecName: Full=Serine-rich adhesin for platelets; AltName: F
. . . Azospirillum sp. B510 ....................................................   60  30 hits [a-proteobacteria]       hypothetical protein AZL_f01290 [Azospirillum sp. B510] >gi
. . . Pirellula staleyi DSM 6068 ...............................................   60  18 hits [planctomycetes]         peptidase domain protein [Pirellula staleyi DSM 6068] >gi|2
. . . Staphylococcus aureus A8115 ..............................................   60   6 hits [firmicutes]             hypothetical protein SAJG_01441 [Staphylococcus aureus A811
. . . Staphylococcus aureus A8117 ..............................................   60   6 hits [firmicutes]             hypothetical protein SAJG_01441 [Staphylococcus aureus A811
. . . Octadecabacter antarcticus 307 ...........................................   60   8 hits [a-proteobacteria]       type I secretion target GGXGXDXXX repeat protein domain pro
. . . Caulobacter sp. K31 ......................................................   60   2 hits [a-proteobacteria]       hemolysin-type calcium-binding region [Caulobacter sp. K31]
. . . Burkholderia xenovorans LB400 ............................................   60  42 hits [b-proteobacteria]       adhesin HecA [Burkholderia xenovorans LB400] >gi|91692584|g
. . . Staphylococcus aureus subsp. aureus Mu50 .................................   60  12 hits [firmicutes]             serine-threoinine rich antigen [Staphylococcus aureus subsp
. . . Staphylococcus aureus subsp. aureus N315 .................................   60  12 hits [firmicutes]             serine-threoinine rich antigen [Staphylococcus aureus subsp
. . . Staphylococcus aureus subsp. aureus JH9 ..................................   60   8 hits [firmicutes]             serine-threoinine rich antigen [Staphylococcus aureus subsp
. . . Staphylococcus aureus subsp. aureus JH1 ..................................   60   8 hits [firmicutes]             serine-threoinine rich antigen [Staphylococcus aureus subsp
. . . Staphylococcus aureus subsp. aureus Mu3 ..................................   60   8 hits [firmicutes]             serine-threoinine rich antigen [Staphylococcus aureus subsp
. . . Staphylococcus aureus subsp. aureus str. CF-Marseille ....................   60   4 hits [firmicutes]             serine-threoinine rich antigen [Staphylococcus aureus subsp
. . . Staphylococcus aureus subsp. aureus Mu50-omega ...........................   60   4 hits [firmicutes]             serine-threoinine rich antigen [Staphylococcus aureus subsp
. . . Staphylococcus aureus A9781 ..............................................   60   8 hits [firmicutes]             serine-threoinine rich antigen [Staphylococcus aureus subsp
. . . Staphylococcus aureus A9763 ..............................................   60   8 hits [firmicutes]             serine-threoinine rich antigen [Staphylococcus aureus subsp
. . . Staphylococcus aureus A9719 ..............................................   60   8 hits [firmicutes]             serine-threoinine rich antigen [Staphylococcus aureus subsp
. . . Staphylococcus aureus A9299 ..............................................   60   8 hits [firmicutes]             serine-threoinine rich antigen [Staphylococcus aureus subsp
. . . Staphylococcus aureus A6300 ..............................................   60   8 hits [firmicutes]             serine-threoinine rich antigen [Staphylococcus aureus subsp
. . . Staphylococcus aureus A6224 ..............................................   60   8 hits [firmicutes]             serine-threoinine rich antigen [Staphylococcus aureus subsp
. . . Staphylococcus aureus A5937 ..............................................   60   8 hits [firmicutes]             serine-threoinine rich antigen [Staphylococcus aureus subsp
. . . Staphylococcus aureus subsp. aureus ED98 .................................   60   8 hits [firmicutes]             serine-threoinine rich antigen [Staphylococcus aureus subsp
. . . Staphylococcus aureus A10102 .............................................   60   8 hits [firmicutes]             serine-threoinine rich antigen [Staphylococcus aureus subsp
. . . Staphylococcus aureus 04-02981 ...........................................   60   4 hits [firmicutes]             serine-threoinine rich antigen [Staphylococcus aureus subsp
. . . Streptococcus gallolyticus UCN34 .........................................   59   8 hits [firmicutes]             conserved hypothetical secreted protein [Streptococcus gall
. . . Brucella abortus bv. 3 str. Tulya ........................................   59  12 hits [a-proteobacteria]       outer membrane autotransporter barrel domain-containing pro
. . . Pedobacter heparinus DSM 2366 ............................................   59  14 hits [CFB group bacteria]     Fibronectin type III domain protein [Pedobacter heparinus D
. . . Ralstonia solanacearum MolK2 .............................................   59  20 hits [b-proteobacteria]       hemagglutinin-related (transposon inactivated) protein [Ral
. . . Cellvibrio japonicus Ueda107 .............................................   59   8 hits [g-proteobacteria]       Putative Ig domain family [Cellvibrio japonicus Ueda107] >g
. . . Burkholderia phymatum STM815 .............................................   59  46 hits [b-proteobacteria]       YadA domain-containing protein [Burkholderia phymatum STM81
. . . Agrobacterium vitis S4 ...................................................   59   8 hits [a-proteobacteria]       Ca 2+ binding protein [Agrobacterium vitis S4] >gi|22173682
. . . Labrenzia aggregata IAM 12614 ............................................   59  12 hits [a-proteobacteria]       fat protein-possibly involved in cell-cell attachment [Stap
. . . Burkholderia ambifaria AMMD ..............................................   59  62 hits [b-proteobacteria]       adhesin [Burkholderia ambifaria AMMD] >gi|115285477|gb|ABI9
. . . Staphylococcus haemolyticus JCSC1435 .....................................   59  60 hits [firmicutes]             hypothetical protein SH0326 [Staphylococcus haemolyticus JC
. . . Desulfotalea psychrophila LSv54 ..........................................   59   6 hits [d-proteobacteria]       hypothetical protein DP2105 [Desulfotalea psychrophila LSv5
. . . Chloroflexus aurantiacus J-10-fl .........................................   59   6 hits [GNS bacteria]           polymorphic outer membrane protein [Chloroflexus aurantiacu
. . . Chloroflexus sp. Y-400-fl ................................................   59   6 hits [GNS bacteria]           polymorphic outer membrane protein [Chloroflexus aurantiacu
. . . Enterococcus faecium TX1330 ..............................................   58   4 hits [firmicutes]             surface protein from Gram-positive cocci [Enterococcus faec
. . . Stenotrophomonas maltophilia K279a .......................................   58  32 hits [g-proteobacteria]       putative glycine-rich autotransporter protein [Stenotrophom
. . . Candidatus Kuenenia stuttgartiensis ......................................   58   7 hits [planctomycetes]         unknown protein [Candidatus Kuenenia stuttgartiensis]
. . . Haemophilus influenzae R2846 .............................................   58   2 hits [g-proteobacteria]       COG3210: Large exoproteins involved in heme utilization or 
. . . Pantoea ananatis LMG 20103 ...............................................   58  16 hits [enterobacteria]         YeeJ [Pantoea ananatis LMG 20103] >gi|291153190|gb|ADD77774
. . . Staphylococcus aureus subsp. aureus 132 ..................................   58   3 hits [firmicutes]             cell wall anchor domain-containing protein [Staphylococcus 
. . . Planctomyces limnophilus DSM 3776 ........................................   58  42 hits [planctomycetes]         predicted polymerase with PALM domain, HD hydrolase domain 
. . . Escherichia fergusonii ATCC 35469 ........................................   58  34 hits [enterobacteria]         adhesin for cattle intestine colonization [Escherichia ferg
. . . Chlorobium limicola DSM 245 ..............................................   58   2 hits [green sulfur bacteria]  Hemolysin-type calcium-binding region [Chlorobium limicola 
. . . Elusimicrobium minutum Pei191 ............................................   58   2 hits [bacteria]               outer membrane autotransporter [Elusimicrobium minutum Pei1
. . . Aeromonas hydrophila subsp. hydrophila ATCC 7966 .........................   58  20 hits [g-proteobacteria]       structural toxin protein RtxA [Aeromonas hydrophila subsp. 
. . . Chlorobium phaeobacteroides DSM 266 ......................................   58  10 hits [green sulfur bacteria]  hemolysin-type calcium-binding region [Chlorobium phaeobact
. . . Burkholderia sp. CCGE1003 ................................................   58  24 hits [b-proteobacteria]       hypothetical protein BC1003DRAFT_4277 [Burkholderia sp. CCG
. . . Staphylococcus aureus A9765 ..............................................   58   2 hits [firmicutes]             LPXTG-domain-containing protein cell wall surface anchor fa
. . . Desulfovibrio magneticus RS-1 ............................................   58  12 hits [d-proteobacteria]       hypothetical protein DMR_29330 [Desulfovibrio magneticus RS
. . . Hoeflea phototrophica DFL-43 .............................................   58  14 hits [a-proteobacteria]       iron-regulated protein FrpC [Hoeflea phototrophica DFL-43] 
. . . Ralstonia pickettii 12D ..................................................   58   8 hits [b-proteobacteria]       filamentous hemagglutinin family outer membrane protein [Ra
. . . Staphylococcus aureus subsp. aureus str. Newman ..........................   58   6 hits [firmicutes]             hypothetical protein NWMN_2553 [Staphylococcus aureus subsp
. . . Staphylococcus aureus A5948 ..............................................   58   6 hits [firmicutes]             hypothetical protein NWMN_2553 [Staphylococcus aureus subsp
. . . Magnetospirillum gryphiswaldense MSR-1 ...................................   58  15 hits [a-proteobacteria]       conserved hypothetical protein [Magnetospirillum gryphiswal
. . . Nitrobacter hamburgensis X14 .............................................   58  10 hits [a-proteobacteria]       Outer membrane autotransporter barrel [Nitrobacter hamburge
. . . Staphylococcus aureus subsp. aureus NCTC 8325 ............................   58   9 hits [firmicutes]             hypothetical protein SAOUHSC_02990 [Staphylococcus aureus s
. . . Staphylococcus aureus subsp. aureus USA300_FPR3757 .......................   58   6 hits [firmicutes]             cell wall anchor domain-containing protein [Staphylococcus 
. . . Staphylococcus aureus subsp. aureus USA300_TCH1516 .......................   58   6 hits [firmicutes]             cell wall anchor domain-containing protein [Staphylococcus 
. . . Staphylococcus aureus subsp. aureus USA300 ...............................   58   3 hits [firmicutes]             cell wall anchor domain-containing protein [Staphylococcus 
. . . Staphylococcus aureus subsp. aureus COL ..................................   58  15 hits [firmicutes]             LPXTG cell wall surface anchor family protein [Staphylococc
. . . Photobacterium profundum SS9 .............................................   58  14 hits [g-proteobacteria]       hypotetical protein [Photobacterium profundum SS9] >gi|4691
. . . Chelativorans sp. BNC1 ...................................................   58   4 hits [a-proteobacteria]       outer membrane autotransporter [Mesorhizobium sp. BNC1] >gi
. . . Brucella suis bv. 5 str. 513 .............................................   57   8 hits [a-proteobacteria]       outer membrane transporter [Brucella suis bv. 5 str. 513] >
. . . Slackia heliotrinireducens DSM 20476 .....................................   57   2 hits [high GC Gram+]          hypothetical protein Shel_23850 [Slackia heliotrinireducens
. . . Octadecabacter antarcticus 238 ...........................................   57  10 hits [a-proteobacteria]       outer membrane autotransporter barrel domain, putative [Oct
. . . Methylobacterium sp. 4-46 ................................................   57   8 hits [a-proteobacteria]       structural toxin protein RtxA [Methylobacterium sp. 4-46] >
. . . Burkholderia cenocepacia PC184 ...........................................   57  16 hits [b-proteobacteria]       hypothetical protein BCPG_05009 [Burkholderia cenocepacia P
. . . Psychromonas ingrahamii 37 ...............................................   57   6 hits [g-proteobacteria]       cadherin domain-containing protein [Psychromonas ingrahamii
. . . Hahella chejuensis KCTC 2396 .............................................   57  18 hits [g-proteobacteria]       outer membrane protein domain-containing protein [Hahella c
. . . Chlorobium phaeovibrioides DSM 265 .......................................   57  14 hits [green sulfur bacteria]  putative outer membrane adhesin like protein [Prosthecochlo
. . . Escherichia coli SE15 ....................................................   57   2 hits [enterobacteria]         conserved hypothetical protein [Escherichia coli SE15]
. . . Streptococcus sp. M143 ...................................................   57   2 hits [firmicutes]             LPXTG cell wall surface anchor family protein [Streptococcu
. . . Mesorhizobium opportunistum WSM2075 ......................................   57   8 hits [a-proteobacteria]       conserved hypothetical protein [Mesorhizobium opportunistum
. . . Enterococcus faecium 1,231,501 ...........................................   57   6 hits [firmicutes]             cell wall surface adhesion protein [Enterococcus faecium 1,
. . . Brucella microti CCM 4915 ................................................   57   4 hits [a-proteobacteria]       outer membrane autotransporter [Brucella microti CCM 4915] 
. . . Burkholderia glumae BGR1 .................................................   57  12 hits [b-proteobacteria]       hypothetical protein bglu_2g12590 [Burkholderia glumae BGR1
. . . Escherichia coli 83972 ...................................................   57   2 hits [enterobacteria]         hemolysin family calcium-binding protein [Escherichia coli 
. . . Burkholderia graminis C4D1M ..............................................   57  14 hits [b-proteobacteria]       YadA  domain protein [Burkholderia graminis C4D1M] >gi|1701
. . . Phaeobacter gallaeciensis BS107 ..........................................   57  10 hits [a-proteobacteria]       hypothetical protein RGBS107_08210 [Phaeobacter gallaeciens
. . . Kordia algicida OT-1 .....................................................   57  18 hits [CFB group bacteria]     probable aggregation factor core protein MAFp3, isoform C [
. . . Marinomonas primoryensis .................................................   57   3 hits [g-proteobacteria]       antifreeze protein [Marinomonas primoryensis]
. . . Burkholderia thailandensis E264 ..........................................   57   8 hits [b-proteobacteria]       serine protease [Burkholderia thailandensis E264] >gi|83651
. . . Escherichia coli CFT073 ..................................................   57   2 hits [enterobacteria]         RTX family exoprotein A gene [Escherichia coli CFT073] >gi|
. . . Actinomyces odontolyticus F0309 ..........................................   56   4 hits [high GC Gram+]          putative lipoprotein [Actinomyces odontolyticus F0309] >gi|
. . . Brucella melitensis bv. 3 str. Ether .....................................   56   4 hits [a-proteobacteria]       predicted protein [Brucella melitensis bv. 3 str. Ether] >g
. . . Desulfonatronospira thiodismutans ASO3-1 .................................   56  12 hits [d-proteobacteria]       hypothetical protein DthioDRAFT_3186 [Desulfonatronospira t
. . . Pseudomonas putida W619 ..................................................   56  22 hits [g-proteobacteria]       hypothetical protein PputW619_4380 [Pseudomonas putida W619
. . . Salmonella enterica subsp. enterica serovar Weltevreden str. HI_N05-537 ..   56   6 hits [enterobacteria]         VCBS repeat-containing protein [Salmonella enterica subsp. 
. . . Salmonella enterica subsp. enterica serovar Agona str. SL483 .............   56  18 hits [enterobacteria]         ShdA [Salmonella enterica subsp. enterica serovar Agona str
. . . Burkholderia ubonensis Bu ................................................   56   1 hit  [b-proteobacteria]       Haemagluttinin domain protein [Burkholderia ubonensis Bu]
. . . Phaeobacter gallaeciensis 2.10 ...........................................   56   4 hits [a-proteobacteria]       hypothetical protein RG210_05237 [Phaeobacter gallaeciensis
. . . Roseobacter sp. AzwK-3b ..................................................   56  24 hits [a-proteobacteria]       surface adhesion protein, putative [Roseobacter sp. AzwK-3b
. . . Burkholderia cepacia .....................................................   56  11 hits [b-proteobacteria]       cable pili-associated 22 kDa adhesin protein [Burkholderia 
. . . Caulobacter vibrioides ...................................................   56   6 hits [a-proteobacteria]       RecName: Full=S-layer protein; AltName: Full=Paracrystallin
. . . Caulobacter crescentus CB15 ..............................................   56   9 hits [a-proteobacteria]       RecName: Full=S-layer protein; AltName: Full=Paracrystallin
. . . Caulobacter crescentus NA1000 ............................................   56   6 hits [a-proteobacteria]       S-layer protein RsaA [Caulobacter crescentus CB15] >gi|2212
. . . Staphylococcus epidermidis ...............................................   56   6 hits [firmicutes]             Flagellar hook-length control protein fliK [Staphylococcus 
. . . Yersinia frederiksenii ATCC 33641 ........................................   56  16 hits [enterobacteria]         Leucyl aminopeptidase [Yersinia frederiksenii ATCC 33641] >
. . . Gluconacetobacter diazotrophicus PAl 5 ...................................   56  18 hits [a-proteobacteria]       outer membrane autotransporter barrel domain protein [Gluco
. . . Escherichia coli HS ......................................................   56   4 hits [enterobacteria]         autotransporter (AT) family porin [Escherichia coli HS] >gi
. . . Planctomyces maris DSM 8797 ..............................................   56  10 hits [planctomycetes]         VCBS [Planctomyces maris DSM 8797] >gi|148848069|gb|EDL6240
. . . Haemophilus influenzae PittEE ............................................   56   4 hits [g-proteobacteria]       HMW2A, high molecular weight adhesin 2 [Haemophilus influen
. . . Burkholderia dolosa AUO158 ...............................................   56  10 hits [b-proteobacteria]       Large exoprotein involved in heme utilization or adhesion [
. . . Maricaulis maris MCS10 ...................................................   56  16 hits [a-proteobacteria]       outer membrane autotransporter [Maricaulis maris MCS10] >gi
. . . Chromohalobacter salexigens DSM 3043 .....................................   56  14 hits [g-proteobacteria]       putative hemagglutinin/hemolysin-related protein [Chromohal
. . . Psychrobacter sp. PRwf-1 .................................................   56   4 hits [g-proteobacteria]       hypothetical protein PsycPRwf_1054 [Psychrobacter sp. PRwf-
. . . Mycoplasma hyorhinis .....................................................   56   2 hits [mycoplasmas]            82-kDa surface lipoprotein precursor [Mycoplasma hyorhinis]
. . . Mesorhizobium loti MAFF303099 ............................................   56  22 hits [a-proteobacteria]       serine proteinase [Mesorhizobium loti MAFF303099] >gi|14022
. . . Ralstonia solanacearum GMI1000 ...........................................   56  18 hits [b-proteobacteria]       putative hemagglutinin-related protein [Ralstonia solanacea
. . . Marinomonas sp. MED121 ...................................................   56   8 hits [g-proteobacteria]       Autotransporter adhesin [Marinomonas sp. MED121] >gi|861620
. . . Vibrio sp. MED222 ........................................................   56  14 hits [g-proteobacteria]       hypothetical protein MED222_04835 [Vibrio sp. MED222] >gi|8
. . . Vibrio splendidus 12B01 ..................................................   56  16 hits [g-proteobacteria]       hypothetical protein V12B01_12555 [Vibrio splendidus 12B01]
. . . Escherichia coli 53638 ...................................................   56   6 hits [enterobacteria]         EntS/YbdA MFS transporter [Escherichia coli 53638] >gi|1884
. . . Enterococcus faecalis PC1.1 ..............................................   56   4 hits [firmicutes]             LPXTG-motif cell wall anchor domain protein [Enterococcus f
. . . Chlorobaculum parvum NCIB 8327 ...........................................   56   4 hits [green sulfur bacteria]  Haemagluttinin domain protein [Chlorobaculum parvum NCIB 83
. . . Vibrio cholerae MZO-2 ....................................................   56  20 hits [g-proteobacteria]       von Willebrand factor, type A [Vibrio cholerae MZO-2] >gi|1
. . . Sagittula stellata E-37 ..................................................   56  36 hits [a-proteobacteria]       outer membrane autotransporter barrel [Sagittula stellata E
. . . Burkholderia pseudomallei 668 ............................................   56   6 hits [b-proteobacteria]       putative outer membrane protein [Burkholderia pseudomallei 
. . . Staphylococcus xylosus ...................................................   56  12 hits [firmicutes]             biofilm-associated protein [Staphylococcus xylosus]
. . . Lactococcus lactis subsp. cremoris SK11 ..................................   56   2 hits [firmicutes]             hypothetical protein LACR_1259 [Lactococcus lactis subsp. c
. . . Burkholderia vietnamiensis G4 ............................................   56   4 hits [b-proteobacteria]       filamentous haemagglutinin outer membrane protein [Burkhold
. . . Burkholderia sp. CCGE1002 ................................................   55   8 hits [b-proteobacteria]       Uncharacterized protein with a C-terminal OMP (outer membra
. . . Brucella abortus bv. 2 str. 86/8/59 ......................................   55   9 hits [a-proteobacteria]       outer membrane autotransporter [Brucella abortus bv. 2 str.
. . . Brucella abortus NCTC 8038 ...............................................   55   6 hits [a-proteobacteria]       outer membrane transporter [Brucella abortus NCTC 8038] >gi
. . . Enterococcus faecium TC 6 ................................................   55   6 hits [firmicutes]             gram-positive cocci surface protein [Enterococcus faecium T
. . . Brucella abortus bv. 4 str. 292 ..........................................   55   9 hits [a-proteobacteria]       outermembrane transporter [Brucella abortus bv. 4 str. 292]
. . . Brucella abortus str. 2308 A .............................................   55   8 hits [a-proteobacteria]       autotransporter-associated beta strand repeat-containing pr
. . . Brucella ceti str. Cudo ..................................................   55   4 hits [a-proteobacteria]       outer membrane autotransporter barrel domain protein [Bruce
. . . Vibrio splendidus LGP32 ..................................................   55  12 hits [g-proteobacteria]       hypothetical protein VS_II0855 [Vibrio splendidus LGP32] >g
. . . Vibrio fischeri MJ11 .....................................................   55   4 hits [g-proteobacteria]       iron-regulated protein FrpC [Vibrio fischeri MJ11] >gi|1973
. . . Stenotrophomonas maltophilia R551-3 ......................................   55  30 hits [g-proteobacteria]       outer membrane autotransporter barrel domain protein [Steno
. . . Brucella abortus S19 .....................................................   55   8 hits [a-proteobacteria]       outermembrane transporter [Brucella abortus S19] >gi|189020
. . . Salmonella enterica subsp. enterica serovar Hadar str. RI_05P066 .........   55   6 hits [enterobacteria]         VCBS repeat-containing protein [Salmonella enterica subsp. 
. . . Escherichia coli DH1 .....................................................   55   2 hits [enterobacteria]         conserved hypothetical protein [Escherichia coli DH1]
. . . Burkholderia sp. 383 .....................................................   55  26 hits [b-proteobacteria]       outer membrane autotransporter barrel [Burkholderia sp. 383
. . . Brucella abortus bv. 1 str. 9-941 ........................................   55   8 hits [a-proteobacteria]       outermembrane transporter [Brucella abortus bv. 1 str. 9-94
. . . Salmonella enterica subsp. enterica serovar Paratyphi A str. ATCC 9150 ...   55   4 hits [enterobacteria]         large repetitive protein [Salmonella enterica subsp. enteri
. . . Salmonella enterica subsp. enterica serovar Paratyphi A str. AKU_12601 ...   55   8 hits [enterobacteria]         large repetitive protein [Salmonella enterica subsp. enteri
. . . Pseudomonas putida KT2440 ................................................   55  22 hits [g-proteobacteria]       surface adhesion protein, putative [Pseudomonas putida KT24
. . . Legionella pneumophila str. Paris ........................................   55   6 hits [g-proteobacteria]       hypothetical protein lpp0779 [Legionella pneumophila str. P
. . . Escherichia coli K-12 ....................................................   55   2 hits [enterobacteria]         RecName: Full=Putative uncharacterized protein ydbA
. . . Escherichia coli .........................................................   55   3 hits [enterobacteria]         ABC-type transport protein ydbA.2 - Escherichia coli (strai
. . . Loktanella vestfoldensis SKA53 ...........................................   55   8 hits [a-proteobacteria]       hypothetical protein SKA53_00375 [Loktanella vestfoldensis 
. . . Ralstonia solanacearum UW551 .............................................   55  14 hits [b-proteobacteria]       Hypothetical Protein RRSL_04357 [Ralstonia solanacearum UW5
. . . Caulobacter segnis ATCC 21756 ............................................   55  10 hits [a-proteobacteria]       Hemolysin-type calcium-binding region [Caulobacter segnis A
. . . Fusobacterium varium ATCC 27725 ..........................................   55   4 hits [fusobacteria]           LOW QUALITY PROTEIN: conserved hypothetical protein [Fusoba
. . . Pectobacterium carotovorum subsp. carotovorum PC1 ........................   55  10 hits [enterobacteria]         von Willebrand factor type A [Pectobacterium carotovorum su
. . . Dickeya zeae Ech1591 .....................................................   55  10 hits [enterobacteria]         Ig family protein [Dickeya zeae Ech1591] >gi|247536661|gb|A
. . . Haemophilus influenzae 6P18H1 ............................................   55   2 hits [g-proteobacteria]       HMW1A, high molecular weight adhesin 1 [Haemophilus influen
. . . Salmonella enterica subsp. enterica serovar Paratyphi B str. SPB7 ........   55   4 hits [enterobacteria]         hypothetical protein SPAB_03407 [Salmonella enterica subsp.
. . . Vibrio cholerae RC385 ....................................................   55   2 hits [g-proteobacteria]       RTX protein [Vibrio cholerae RC385] >gi|150420894|gb|EDN131
. . . Roseobacter sp. MED193 ...................................................   55   4 hits [a-proteobacteria]       hypothetical protein MED193_12118 [Roseobacter sp. MED193] 
. . . Croceibacter atlanticus HTCC2559 .........................................   55  14 hits [CFB group bacteria]     probable extracellular nuclease [Croceibacter atlanticus HT
. . . Roseovarius nubinhibens ISM ..............................................   55   2 hits [a-proteobacteria]       putative RTX family exoprotein [Roseovarius nubinhibens ISM
. . . Enterococcus faecium DO ..................................................   55   6 hits [firmicutes]             Surface protein from Gram-positive cocci, anchor region [En
. . . Enterobacter cancerogenus ATCC 35316 .....................................   55  16 hits [enterobacteria]         exoprotein, RTX family [Enterobacter cancerogenus ATCC 3531
. . . Stackebrandtia nassauensis DSM 44728 .....................................   55   6 hits [high GC Gram+]          hypothetical protein Snas_4445 [Stackebrandtia nassauensis 
. . . Pectobacterium carotovorum subsp. carotovorum WPP14 ......................   55   1 hit  [enterobacteria]         putative hemagglutinin/hemolysin-related protein [Pectobact
. . . Salmonella enterica subsp. enterica serovar Typhi str. E00-7866 ..........   55   6 hits [enterobacteria]         large repetitive protein [Salmonella enterica subsp. enteri
. . . Opitutus terrae PB90-1 ...................................................   55   4 hits [verrucomicrobia]        hypothetical protein Oter_3963 [Opitutus terrae PB90-1] >gi
. . . Salmonella enterica subsp. enterica serovar Javiana str. GA_MM04042433 ...   55   6 hits [enterobacteria]         VCBS repeat-containing protein [Salmonella enterica subsp. 
. . . Salmonella enterica subsp. enterica serovar Schwarzengrund str. SL480 ....   55   6 hits [enterobacteria]         VCBS repeat-containing protein [Salmonella enterica subsp. 
. . . Salmonella enterica subsp. enterica serovar Schwarzengrund str. CVM19633 .   55   6 hits [enterobacteria]         VCBS repeat-containing protein [Salmonella enterica subsp. 
. . . Roseobacter litoralis Och 149 ............................................   55   8 hits [a-proteobacteria]       VCBS repeat domain protein [Roseobacter litoralis Och 149] 
. . . Marinomonas sp. MWYL1 ....................................................   55  20 hits [g-proteobacteria]       filamentous haemagglutinin outer membrane protein [Marinomo
. . . Bermanella marisrubri ....................................................   55   4 hits [g-proteobacteria]       Flagellar capping protein [Oceanobacter sp. RED65] >gi|9442
. . . Flavobacteria bacterium BBFL7 ............................................   55   8 hits [CFB group bacteria]     conserved hypothetical protein [Flavobacteria bacterium BBF
. . . Burkholderia pseudomallei 1710b ..........................................   55  12 hits [b-proteobacteria]       Hep_Hag family protein [Burkholderia pseudomallei 1710b] >g
. . . Burkholderia pseudomallei 1710a ..........................................   55  12 hits [b-proteobacteria]       Hep_Hag family protein [Burkholderia pseudomallei 1710b] >g
. . . Pseudomonas putida F1 ....................................................   55  20 hits [g-proteobacteria]       glycoprotein [Pseudomonas putida F1] >gi|148510133|gb|ABQ76
. . . Salmonella enterica subsp. enterica serovar Typhimurium str. D23580 ......   54   3 hits [enterobacteria]         large repetitive protein [Salmonella enterica subsp. enteri
. . . Fusobacterium sp. 3_1_36A2 ...............................................   54   8 hits [fusobacteria]           AT family autotransporter [Fusobacterium sp. 3_1_36A2] >gi|
. . . Brucella sp. 83/13 .......................................................   54   2 hits [a-proteobacteria]       outer membrane autotransporter [Brucella sp. 83/13]
. . . Salmonella enterica subsp. enterica serovar Tennessee str. CDC07-0191 ....   54   4 hits [enterobacteria]         VCBS repeat-containing protein [Salmonella enterica subsp. 
. . . Streptococcus salivarius SK126 ...........................................   54   8 hits [firmicutes]             gram positive anchor domain protein [Streptococcus salivari
. . . Salmonella enterica subsp. enterica serovar Typhi str. AG3 ...............   54   3 hits [enterobacteria]         large repetitive protein [Salmonella enterica subsp. enteri
. . . Salmonella enterica subsp. enterica serovar Typhi str. E98-2068 ..........   54   2 hits [enterobacteria]         large repetitive protein [Salmonella enterica subsp. enteri
. . . Salmonella enterica subsp. enterica serovar Enteritidis str. P125109 .....   54   4 hits [enterobacteria]         large repetitive protein [Salmonella enterica subsp. enteri
. . . Phenylobacterium zucineum HLK1 ...........................................   54   8 hits [a-proteobacteria]       Hemolysin-type calcium-binding region [Phenylobacterium zuc
. . . Geobacillus sp. Y412MC10 .................................................   54   4 hits [firmicutes]             S-layer domain protein [Geobacillus sp. Y412MC10] >gi|26128
. . . Methylobacterium populi BJ001 ............................................   54  32 hits [a-proteobacteria]       GLUG domain protein [Methylobacterium populi BJ001] >gi|179
. . . Salmonella enterica subsp. enterica serovar 4,[5],12:i:- str. CVM23701 ...   54   6 hits [enterobacteria]         VCBS repeat-containing protein [Salmonella enterica subsp. 
. . . Pseudomonas putida GB-1 ..................................................   54  58 hits [g-proteobacteria]       hypothetical protein PputGB1_0186 [Pseudomonas putida GB-1]
. . . Methylobacterium extorquens PA1 ..........................................   54  48 hits [a-proteobacteria]       hypothetical protein Mext_2409 [Methylobacterium extorquens
. . . Salmonella enterica subsp. enterica serovar Typhimurium ..................   54   3 hits [enterobacteria]         biofilm associated protein A [Salmonella enterica subsp. en
. . . Burkholderia pseudomallei 305 ............................................   54  18 hits [b-proteobacteria]       protein YbcL [Burkholderia pseudomallei 305] >gi|134247588|
. . . Parvibaculum lavamentivorans DS-1 ........................................   54   4 hits [a-proteobacteria]       outer membrane autotransporter [Parvibaculum lavamentivoran
. . . Rhodobacterales bacterium HTCC2255 .......................................   54  26 hits [a-proteobacteria]       calcium binding hemolysin protein, putative [alpha proteoba
. . . Glaciecola sp. HTCC2999 ..................................................   54  26 hits [g-proteobacteria]       calcium binding hemolysin protein, putative [alpha proteoba
. . . Bradyrhizobium japonicum USDA 110 ........................................   54   6 hits [a-proteobacteria]       hypothetical protein blr4714 [Bradyrhizobium japonicum USDA
. . . Salmonella enterica subsp. enterica serovar Typhi str. CT18 ..............   54   4 hits [enterobacteria]         large repetitive protein [Salmonella enterica subsp. enteri
. . . Salmonella enterica subsp. enterica serovar Typhi str. E98-3139 ..........   54   6 hits [enterobacteria]         large repetitive protein [Salmonella enterica subsp. enteri
. . . Salmonella enterica subsp. enterica serovar Typhi ........................   54  12 hits [enterobacteria]         large repetitive protein [Salmonella enterica subsp. enteri
. . . Enterococcus faecalis HIP11704 ...........................................   54   4 hits [firmicutes]             cell wall surface anchor family protein [Enterococcus faeca
. . . Sebaldella termitidis ATCC 33386 .........................................   54   2 hits [fusobacteria]           outer membrane autotransporter barrel domain protein [Sebal
. . . Fusobacterium sp. 7_1 ....................................................   54   4 hits [fusobacteria]           outer membrane protein [Fusobacterium sp. 7_1] >gi|22943288
. . . Labrenzia alexandrii DFL-11 ..............................................   54   4 hits [a-proteobacteria]       type I secretion target GGXGXDXXX repeat protein domain pro
. . . Agrobacterium radiobacter K84 ............................................   54  34 hits [a-proteobacteria]       outer membrane pathogenesis protein [Agrobacterium radiobac
. . . Escherichia coli SE11 ....................................................   54   6 hits [enterobacteria]         putative adhesin [Escherichia coli SE11] >gi|209912715|dbj|
. . . Escherichia coli E24377A .................................................   54   6 hits [enterobacteria]         putative invasin [Escherichia coli E24377A] >gi|157077138|g
. . . Actinomyces odontolyticus ATCC 17982 .....................................   54   4 hits [high GC Gram+]          hypothetical protein ACTODO_00671 [Actinomyces odontolyticu
. . . Haemophilus influenzae 3655 ..............................................   54   4 hits [g-proteobacteria]       HMW2A, high molecular weight adhesin 2 [Haemophilus influen
. . . gamma proteobacterium HTCC2207 ...........................................   54  12 hits [g-proteobacteria]       OmpA-like transmembrane domain protein [marine gamma proteo
. . . Burkholderia sp. CCGE1001 ................................................   53  14 hits [b-proteobacteria]       filamentous hemagglutinin family outer membrane protein [Bu
. . . Acinetobacter calcoaceticus RUH2202 ......................................   53   4 hits [g-proteobacteria]       LOW QUALITY PROTEIN: cell-surface adhesin [Acinetobacter ca
. . . Catenulispora acidiphila DSM 44928 .......................................   53   4 hits [high GC Gram+]          Ricin B lectin [Catenulispora acidiphila DSM 44928] >gi|256
. . . Neisseria sicca ATCC 29256 ...............................................   53   6 hits [b-proteobacteria]       Hep_Hag family protein [Neisseria sicca ATCC 29256] >gi|255
. . . Nitrosomonas sp. AL212 ...................................................   53   8 hits [b-proteobacteria]       hypothetical protein NAL212DRAFT_0735 [Nitrosomonas sp. AL2
. . . Paenibacillus sp. JDR-2 ..................................................   53  10 hits [firmicutes]             cell wall/surface repeat protein [Paenibacillus sp. JDR-2] 
. . . Methylobacterium extorquens AM1 ..........................................   53  56 hits [a-proteobacteria]       hypothetical protein MexAM1_META1p2412 [Methylobacterium ex
. . . Alteromonas macleodii ATCC 27126 .........................................   53   5 hits [g-proteobacteria]       hypothetical protein AmacA2_10885 [Alteromonas macleodii AT
. . . Persephonella marina EX-H1 ...............................................   53  10 hits [aquificales]            hemagglutination activity domain protein [Persephonella mar
. . . Salmonella enterica subsp. enterica serovar Typhi str. J185 ..............   53   2 hits [enterobacteria]         putative surface-exposed virulence protein [Salmonella ente
. . . Salmonella enterica subsp. enterica serovar Typhi str. E98-0664 ..........   53   4 hits [enterobacteria]         large repetitive protein [Salmonella enterica subsp. enteri
. . . Salmonella enterica subsp. enterica serovar Typhi str. E02-1180 ..........   53   2 hits [enterobacteria]         putative surface-exposed virulence protein [Salmonella ente
. . . Escherichia coli F11 .....................................................   53   4 hits [enterobacteria]         EntS/YbdA MFS transporter [Escherichia coli F11] >gi|190908
. . . Methylacidiphilum infernorum V4 ..........................................   53   4 hits [verrucomicrobia]        Large exoprotein involved in heme utilization or adhesion [
. . . Burkholderia phytofirmans PsJN ...........................................   53  20 hits [b-proteobacteria]       YadA domain protein [Burkholderia phytofirmans PsJN] >gi|18
. . . Shewanella sediminis HAW-EB3 .............................................   53   8 hits [g-proteobacteria]       fibronectin type III domain-containing protein [Shewanella 
. . . unidentified eubacterium SCB49 ...........................................   53   4 hits [CFB group bacteria]     probable extracellular nuclease [unidentified eubacterium S
. . . Fusobacterium nucleatum subsp. polymorphum ATCC 10953 ....................   53   2 hits [fusobacteria]           AT family autotransporter [Fusobacterium nucleatum subsp. p
. . . Pseudomonas aeruginosa UCBPP-PA14 ........................................   53   8 hits [g-proteobacteria]       hypothetical protein PA14_40260 [Pseudomonas aeruginosa UCB
. . . Geobacter sulfurreducens PCA .............................................   53  28 hits [d-proteobacteria]       cadherin domain/calx-beta domain-containing protein [Geobac
. . . Fusobacterium nucleatum subsp. nucleatum ATCC 25586 ......................   53   2 hits [fusobacteria]           hypothetical protein FN1526 [Fusobacterium nucleatum subsp.
. . Polysphondylium pallidum PN500 ---------------------------------------------   75  64 hits [cellular slime molds]   hypothetical protein PPL_09793 [Polysphondylium pallidum PN
. . Drosophila willistoni ......................................................   73  42 hits [flies]                  GK12566 [Drosophila willistoni] >gi|194163794|gb|EDW78695.1
. . Yarrowia lipolytica CLIB122 ................................................   67  40 hits [ascomycetes]            YALI0C06391p [Yarrowia lipolytica] >gi|199425235|emb|CAG818
. . Yarrowia lipolytica ........................................................   67  40 hits [ascomycetes]            YALI0C06391p [Yarrowia lipolytica] >gi|199425235|emb|CAG818
. . Drosophila grimshawi .......................................................   65   8 hits [flies]                  GH18720 [Drosophila grimshawi] >gi|193893784|gb|EDV92650.1|
. . Mus musculus (mouse) .......................................................   65  21 hits [rodents]                PREDICTED: similar to mucin 3 [Mus musculus]
. . Phytophthora infestans T30-4 ...............................................   64   5 hits [oomycetes]              mucin-like protein [Phytophthora infestans T30-4]
. . Pan troglodytes ............................................................   63   2 hits [primates]               PREDICTED: similar to KMQK697 [Pan troglodytes]
. . Homo sapiens (man) .........................................................   63 176 hits [primates]               unnamed protein product [Homo sapiens]
. . Drosophila erecta ..........................................................   63   6 hits [flies]                  GG12068 [Drosophila erecta] >gi|190656094|gb|EDV53326.1| GG
. . Nectria haematococca mpVI 77-13-4 ..........................................   62   3 hits [ascomycetes]            hypothetical protein NECHADRAFT_84492 [Nectria haematococca
. . Drosophila sechellia .......................................................   62   4 hits [flies]                  GM16292 [Drosophila sechellia] >gi|194127087|gb|EDW49130.1|
. . Drosophila melanogaster ....................................................   62  64 hits [flies]                  papilin, isoform G [Drosophila melanogaster] >gi|272477223|
. . Caenorhabditis elegans (nematode) ..........................................   62  16 hits [nematodes]              hypothetical protein H02F09.3 [Caenorhabditis elegans] >gi|
. . Leishmania mexicana ........................................................   62   2 hits [kinetoplastids]         secreted acid phosphatase 2 (SAP2) [Leishmania mexicana]
. . Naegleria gruberi strain NEG-M .............................................   61   7 hits [eukaryotes]             predicted protein [Naegleria gruberi] >gi|284082885|gb|EFC3
. . Naegleria gruberi ..........................................................   61   7 hits [eukaryotes]             predicted protein [Naegleria gruberi] >gi|284082885|gb|EFC3
. . Ciona intestinalis .........................................................   61   2 hits [tunicates]              PREDICTED: similar to zymogen granule membrane glycoprotein
. . Halorhabdus utahensis DSM 12940 ............................................   60  16 hits [euryarchaeotes]         hypothetical protein Huta_2263 [Halorhabdus utahensis DSM 1
. . Gibberella zeae PH-1 .......................................................   60   5 hits [ascomycetes]            hypothetical protein FG03188.1 [Gibberella zeae PH-1]
. . Kluyveromyces lactis NRRL Y-1140 ...........................................   60   6 hits [ascomycetes]            unnamed protein product [Kluyveromyces lactis] >gi|49641308
. . Kluyveromyces lactis .......................................................   60   6 hits [ascomycetes]            unnamed protein product [Kluyveromyces lactis] >gi|49641308
. . Candida dubliniensis CD36 ..................................................   60   4 hits [ascomycetes]            hypothetical GPI-anchored protein, putative [Candida dublin
. . Perkinsus marinus ATCC 50983 ...............................................   59   4 hits [eukaryotes]             dentin sialophosphoprotein precursor, putative [Perkinsus m
. . Drosophila simulans ........................................................   59   4 hits [flies]                  GD18032 [Drosophila simulans] >gi|194201158|gb|EDX14734.1| 
. . Drosophila virilis .........................................................   59  20 hits [flies]                  GJ14166 [Drosophila virilis] >gi|194142263|gb|EDW58671.1| G
. . Dictyostelium discoideum AX4 ...............................................   58   8 hits [cellular slime molds]   hypothetical protein DDB_G0295727 [Dictyostelium discoideum
. . Trichoplax adhaerens .......................................................   58   6 hits [placozoans]             hypothetical protein TRIADDRAFT_62951 [Trichoplax adhaerens
. . Branchiostoma floridae .....................................................   58   4 hits [lancelets]              hypothetical protein BRAFLDRAFT_89036 [Branchiostoma florid
. . Drosophila ananassae .......................................................   57  16 hits [flies]                  GF24532 [Drosophila ananassae] >gi|190623842|gb|EDV39366.1|
. . Saccharomyces cerevisiae YJM789 ............................................   57   4 hits [ascomycetes]            pathogen-related protein [Saccharomyces cerevisiae YJM789]
. . Halogeometricum borinquense DSM 11551 ......................................   57   4 hits [euryarchaeotes]         hypothetical protein HborDRAFT_2098 [Halogeometricum borinq
. . Sclerotinia sclerotiorum 1980 UF-70 ........................................   57   2 hits [ascomycetes]            hypothetical protein SS1G_04213 [Sclerotinia sclerotiorum 1
. . Lachancea thermotolerans CBS 6340 ..........................................   56   2 hits [ascomycetes]            KLTH0D03894p [Lachancea thermotolerans] >gi|238934272|emb|C
. . Lachancea thermotolerans ...................................................   56   2 hits [ascomycetes]            KLTH0D03894p [Lachancea thermotolerans] >gi|238934272|emb|C
. . Micromonas pusilla CCMP1545 ................................................   56   3 hits [green algae]            predicted protein [Micromonas pusilla CCMP1545]
. . Neurospora crassa OR74A ....................................................   56   4 hits [ascomycetes]            hypothetical protein NCU04373 [Neurospora crassa OR74A] >gi
. . Neurospora crassa ..........................................................   56   2 hits [ascomycetes]            hypothetical protein NCU04373 [Neurospora crassa OR74A] >gi
. . Drosophila yakuba ..........................................................   56  12 hits [flies]                  GE10512 [Drosophila yakuba] >gi|194184802|gb|EDW98413.1| GE
. . uncultured haloarchaeon ....................................................   56   4 hits [euryarchaeotes]         probable cell surface adhesin [uncultured haloarchaeon]
. . Methanoculleus marisnigri JR1 ..............................................   56   6 hits [euryarchaeotes]         Ig domain-containing protein [Methanoculleus marisnigri JR1
. . Candida tropicalis MYA-3404 ................................................   55   8 hits [ascomycetes]            predicted protein [Candida tropicalis MYA-3404] >gi|2401348
. . Saccharomyces cerevisiae (yeast) ...........................................   55   4 hits [ascomycetes]            AOF1001 [Saccharomyces cerevisiae]
. . Drosophila pseudoobscura pseudoobscura .....................................   55  10 hits [flies]                  GA22193 [Drosophila pseudoobscura pseudoobscura] >gi|198145
. . Drosophila persimilis ......................................................   55   2 hits [flies]                  GL22810 [Drosophila persimilis] >gi|194107351|gb|EDW29394.1
. . Monosiga brevicollis MX1 ...................................................   55   2 hits [choanoflagellates]      hypothetical protein [Monosiga brevicollis MX1] >gi|1637750
. . Theileria annulata strain Ankara ...........................................   55   2 hits [apicomplexans]          hypothetical protein [Theileria annulata strain Ankara] >gi
. . Theileria annulata .........................................................   55   2 hits [apicomplexans]          hypothetical protein [Theileria annulata strain Ankara] >gi
. . Candida albicans SC5314 ....................................................   55   8 hits [ascomycetes]            hypothetical protein CaO19.5401 [Candida albicans SC5314] >
. . Hydra magnipapillata .......................................................   55   5 hits [hydrozoans]             PREDICTED: hypothetical protein, partial [Hydra magnipapill
. . Magnaporthe oryzae 70-15 ...................................................   55  10 hits [ascomycetes]            hypothetical protein MGG_00209 [Magnaporthe grisea 70-15] >
. . Ostreococcus lucimarinus CCE9901 ...........................................   55  14 hits [green algae]            predicted protein [Ostreococcus lucimarinus CCE9901] >gi|14
. . Oryza sativa Japonica Group (Japanese rice) ................................   55   5 hits [monocots]               hypothetical protein OsJ_20530 [Oryza sativa Japonica Group]
. . Trichomonas vaginalis G3 ...................................................   55  20 hits [trichomonads]           flocculin [Trichomonas vaginalis G3] >gi|121889921|gb|EAX95
. . Debaryomyces hansenii CBS767 ...............................................   55   9 hits [ascomycetes]            hypothetical protein DEHA0E25971g [Debaryomyces hansenii CB
. . Leishmania donovani ........................................................   55   1 hit  [kinetoplastids]         histidine secretory acid phosphatase [Leishmania donovani]
. . Oryza sativa Indica Group (Indian rice) ....................................   55   1 hit  [monocots]               lustrin A-like [Oryza sativa Japonica Group] >gi|51091192|d
. . Haloferax volcanii DS2 .....................................................   55   2 hits [euryarchaeotes]         cell surface glycoprotein [Haloferax volcanii DS2] >gi|1175
. . Haloferax volcanii .........................................................   55   2 hits [euryarchaeotes]         cell surface glycoprotein [Haloferax volcanii DS2] >gi|1175
. . Magnaporthe oryzae (rice blast fungus) .....................................   53   6 hits [ascomycetes]            chitin binding protein 4 [Magnaporthe oryzae]
. . Pediculus humanus corporis (human body lice) ...............................   53   4 hits [lice]                   papilin, putative [Pediculus humanus corporis] >gi|21251711
. . Ostreococcus tauri .........................................................   53   3 hits [green algae]            Haemagluttinin motif:Hep_Hag (ISS) [Ostreococcus tauri]
. Invertebrate iridescent virus 6 ----------------------------------------------   72  36 hits [viruses]                443R [Invertebrate iridescent virus 6] >gi|34223713|sp|P183


------------------------------------------------------------------------------------------------------------------

Lineage Report

marine metagenome [metagenomes]
. marine metagenome -  512 1021 hits [metagenomes]  hypothetical protein GOS_5169218 [marine metagenome]

BLAST

PROTOCOL


a)BLASTp versus SWISSPROT, NCBI default parameters apart from "Number of descriptions_1000"

b)BLASTp versus NR, NCBI default parameters apart from "Number of descriptions_1000"

c)BLASTp versus ENV_NR, NCBI default parameters apart from "Number of descriptions_1000"

d)BLASTx versus SWISSPROT, NCBI default parameters apart from "Number of descriptions_1000"

e)BLASTx versus NR, NCBI default parameters apart from "Number of descriptions_1000"



RESULTS ANALYSIS


The sequence did not showed significant results during the search in BLASTp vs Swissprot nor in the BLASTp vs NR, where 20 hits and 1 hit were found respectively, the best E-values were 0,59 with 35.0 of score and 3.1 with 36,6 of score.

Best results were given in the last attempt with BLASTp vs ENV_NR. 580 hits were found, the best E-values was definitely > than 1e-4 (4e-144), and the scores were quite high (512).

It is safe to assume that this sequence has known homologues, because looking at the first aligned given by BLAST, there is 100% of identity, 100% of positive aminoacids, and no gaps. Since BLASTp ENV_NR lineage report just showed only one organism, BLASTx was analysed in order to make a MSA and try to make a valid tree, BLASTx vs NR displayed better E-values 4e-13 and a score of 79,7, which was not great but was the best alternative found.

RAW RESULTS

a)BLASTp versus SWISSPROT
                                                                   Score     E
Sequences producing significant alignments:                       (Bits)  Value

sp|Q196W9.1|VF396_IIV3  RecName: Full=Uncharacterized protein ...  35.0    0.59 
sp|C3N5E1.1|TRM1_SULIA  RecName: Full=N(2),N(2)-dimethylguanos...  34.3    1.0  
sp|Q553U5.1|CECR1_DICDI  RecName: Full=Adenosine deaminase CEC...  33.9    1.1  
sp|Q9H582.2|ZN644_HUMAN  RecName: Full=Zinc finger protein 644...  33.9    1.3  
sp|C4L8X0.1|PNP_TOLAT  RecName: Full=Polyribonucleotide nucleo...  33.5    1.6  
sp|C5BFC1.1|PNP_EDWI9  RecName: Full=Polyribonucleotide nucleo...  32.7    2.5  
sp|O82427.2|SMT2_ORYSJ  RecName: Full=24-methylenesterol C-met...  32.0    4.1  
sp|Q971V9.1|TRM1_SULTO  RecName: Full=N(2),N(2)-dimethylguanos...  31.2    7.0  
sp|A1S467.1|PNP_SHEAM  RecName: Full=Polyribonucleotide nucleo...  31.2    7.3  
sp|Q32BG9.2|PNP_SHIDS  RecName: Full=Polyribonucleotide nucleo...  31.2    7.5  
sp|B1IQV7.1|PNP_ECOLC  RecName: Full=Polyribonucleotide nucleo...  31.2    7.6  
sp|B7M072.2|PNP_ECO8A  RecName: Full=Polyribonucleotide nucleo...  31.2    7.7  
sp|A1AG69.2|PNP_ECOK1  RecName: Full=Polyribonucleotide nucleo...  31.2    7.7  
sp|A7ZS61.1|PNP_ECO24  RecName: Full=Polyribonucleotide nucleo...  31.2    7.7  
sp|B7UJ59.1|PNP_ECO27  RecName: Full=Polyribonucleotide nucleo...  31.2    7.8  
sp|P05055.3|PNP_ECOLI  RecName: Full=Polyribonucleotide nucleo...  31.2    7.9  
sp|Q0T0B7.1|PNP_SHIF8  RecName: Full=Polyribonucleotide nucleo...  31.2    7.9  
sp|Q31W43.2|PNP_SHIBS  RecName: Full=Polyribonucleotide nucleo...  31.2    8.2  
sp|Q7MYZ0.1|PNP_PHOLL  RecName: Full=Polyribonucleotide nucleo...  30.8    9.5  
sp|B2FPB2.1|SECA_STRMK  RecName: Full=Protein translocase subu...  30.8    9.7  

ALIGNMENTS
>sp|Q196W9.1|VF396_IIV3 RecName: Full=Uncharacterized protein 091L
Length=1096

 Score = 35.0 bits (79),  Expect = 0.59, Method: Compositional matrix adjust.
 Identities = 28/95 (29%), Positives = 51/95 (53%), Gaps = 11/95 (11%)

Query  143  VITLSGDVAGSATMTNLGDVTIST---TIQANSIALGTDTTGNYVSAISAGEGIDVSGSG  199
            ++ L+GD++G+AT+  + +  +S    T   +    GTDTTG  V+ ++ G G+ +SG+ 
Sbjct  529  IVQLAGDLSGTATVPKIANAVVSNQKLTPGTSGTLKGTDTTGA-VADVTLGSGLTISGT-  586

Query  200  SETATVTISAEDATDSNKGIASFDATDFTVSSGDV  234
                  T+S + A+    G + F    F  +SGD+
Sbjct  587  ------TLSVDAASVPKAGSSQFGTVQFNATSGDL  615


>sp|C3N5E1.1|TRM1_SULIA RecName: Full=N(2),N(2)-dimethylguanosine tRNA methyltransferase; 
AltName: Full=tRNA(guanine-26,N(2)-N(2)) methyltransferase; 
AltName: Full=tRNA 2,2-dimethylguanosine-26 methyltransferase; 
AltName: Full=tRNA(m(2,2)G26)dimethyltransferase
Length=378

 Score = 34.3 bits (77),  Expect = 1.0, Method: Compositional matrix adjust.
 Identities = 23/69 (33%), Positives = 35/69 (50%), Gaps = 2/69 (2%)

Query  191  EGIDVSGSGSETATVTISAEDATDSNKGIASFDATDFTVSSGDVTVNAERIQDIVGAMFS  250
            E ID+   GS    + +S+ +AT  N GIA+F ATD +   G    +  R  D +    S
Sbjct  125  EYIDIDPFGSPVPFI-LSSINATIRN-GIAAFTATDLSPLEGSSRTSCRRKYDAINYKLS  182

Query  251  SNTESGISV  259
            S+ E G+ +
Sbjct  183  SSKELGLRI  191


 Score = 31.6 bits (70),  Expect = 6.0, Method: Compositional matrix adjust.
 Identities = 22/69 (31%), Positives = 36/69 (52%), Gaps = 2/69 (2%)

Query  58   EGIDVSGGGSENATITVSAEDATSSNKGIASFDSTDFTVSSGAVTVNAERVQDIVGAMVG  117
            E ID+   GS    I +S+ +AT  N GIA+F +TD +   G+   +  R  D +   + 
Sbjct  125  EYIDIDPFGSPVPFI-LSSINATIRN-GIAAFTATDLSPLEGSSRTSCRRKYDAINYKLS  182

Query  118  SNTESGITV  126
            S+ E G+ +
Sbjct  183  SSKELGLRI  191


>sp|Q553U5.1|CECR1_DICDI RecName: Full=Adenosine deaminase CECR1 homolog; Flags: Precursor
Length=543

 Score = 33.9 bits (76),  Expect = 1.1, Method: Compositional matrix adjust.
 Identities = 26/98 (26%), Positives = 49/98 (50%), Gaps = 10/98 (10%)

Query  50   FVADLTAGEGIDVSGGGSENATITVSAEDATSSNKGIASFDSTDFTVSSGAVTVNAERVQ  109
            +V+D+ A  G+D+   G     +T+S +D    N G  S+D  + T S G   +N ++++
Sbjct  447  YVSDMRAHPGLDLLNRG---LPVTISPDDPAIFNYGGLSYDFFELTYSWG---LNLQQLK  500

Query  110  DI-VGAMVGSNT--ESGITVTYEDSDGTLDFNVADPVI  144
             + + ++  SNT  +S   + Y   +    FN  D +I
Sbjct  501  QLAINSINHSNTFNQSEYNLLYNAWEVKW-FNFIDYII  537


 Score = 33.1 bits (74),  Expect = 1.8, Method: Compositional matrix adjust.
 Identities = 23/72 (31%), Positives = 38/72 (52%), Gaps = 7/72 (9%)

Query  183  YVSAISAGEGIDVSGSGSETATVTISAEDATDSNKGIASFDATDFTVSSGDVTVNAERIQ  242
            YVS + A  G+D+   G     VTIS +D    N G  S+D  + T S G   +N ++++
Sbjct  447  YVSDMRAHPGLDLLNRG---LPVTISPDDPAIFNYGGLSYDFFELTYSWG---LNLQQLK  500

Query  243  DI-VGAMFSSNT  253
             + + ++  SNT
Sbjct  501  QLAINSINHSNT  512


>sp|Q9H582.2|ZN644_HUMAN RecName: Full=Zinc finger protein 644; AltName: Full=Zinc finger 
motif enhancer-binding protein 2; Short=Zep-2
Length=1327

 Score = 33.9 bits (76),  Expect = 1.3, Method: Composition-based stats.
 Identities = 36/150 (24%), Positives = 66/150 (44%), Gaps = 7/150 (4%)

Query  25   NLGDATLTATITANSVALGTDTTGNFVADLTAG--EGIDVSGGGSENATITVSAEDATSS  82
            N+ D  +   IT     L  D   NF++D  +G  +  D      +N T+T+  E +   
Sbjct  25   NMDDLKINTDITGAKEELLDDN--NFISDKESGVHKPKDCQTSFQKNNTLTLPEELSKDK  82

Query  83   NKGIASFDSTDFTVSSGAVTVNAERVQDIVGAMV-GSNTESGITVTYEDSDGTLDFNVAD  141
            ++   S   +   + +GA TV++E      GA V G  + S +T T   + G++      
Sbjct  83   SENALSGGQSSLFIHAGAPTVSSENFILPKGAAVNGPVSHSSLTKTSNMNKGSVSLTTGQ  142

Query  142  PVITLSGDVAGSATMTNLGDVTISTTIQAN  171
            PV   + +    +T+    D+ +ST  +A+
Sbjct  143  PVDQPTTE--SCSTLKVAADLQLSTPQKAS  170


>sp|C4L8X0.1|PNP_TOLAT RecName: Full=Polyribonucleotide nucleotidyltransferase; AltName: 
Full=Polynucleotide phosphorylase; Short=PNPase
Length=720

 Score = 33.5 bits (75),  Expect = 1.6, Method: Compositional matrix adjust.
 Identities = 26/106 (24%), Positives = 49/106 (46%), Gaps = 13/106 (12%)

Query  193  IDVSGSGSETATVTIS-AEDAT-------DSNKGIASFDATDFTVSSGDVTVNAERIQDI  244
            I + G   E   + +  A DA        D   G+A  D +DF      + +N E+I+D+
Sbjct  508  IKIEGITKEIMQIALKQARDARLHILTVMDKAIGVARDDISDFAPRIHTIKINPEKIKDV  567

Query  245  VGA----MFSSNTESGISVTYEDSDGTIDLDVSDPTLSLQAMSQVQ  286
            +G     + +   E+G ++  ED DGT+ +       + +A+ ++Q
Sbjct  568  IGKGGSVIRALTEETGTTIELED-DGTVKIAAVSGEAAQEAIRRIQ  612


>sp|C5BFC1.1|PNP_EDWI9 RecName: Full=Polyribonucleotide nucleotidyltransferase; AltName: 
Full=Polynucleotide phosphorylase; Short=PNPase
Length=707

 Score = 32.7 bits (73),  Expect = 2.5, Method: Compositional matrix adjust.
 Identities = 20/82 (24%), Positives = 44/82 (53%), Gaps = 9/82 (10%)

Query  210  EDATDSNKGIASFDATDFTVSSGDVTVNAERIQDIVGA----MFSSNTESGISVTYEDSD  265
            E A ++ +G    D ++F      + +N E+I+D++G     + +   E+G ++  ED D
Sbjct  538  EQAINAPRG----DISEFAPRIHTIKINPEKIKDVIGKGGSVIRALTEETGTTIEIED-D  592

Query  266  GTIDLDVSDPTLSLQAMSQVQE  287
            GT+ +  +D   +  A+ +++E
Sbjct  593  GTVKIAATDGDKAKHAIRRIEE  614


>sp|O82427.2|SMT2_ORYSJ RecName: Full=24-methylenesterol C-methyltransferase 2; Short=24-sterol 
C-methyltransferase 2; Short=Sterol-C-methyltransferase 
2
Length=363

 Score = 32.0 bits (71),  Expect = 4.1, Method: Compositional matrix adjust.
 Identities = 16/43 (37%), Positives = 23/43 (53%), Gaps = 4/43 (9%)

Query  220  ASFDATDFTVSSGDVTVNAERIQDIVGAMFSSNTESGISVTYE  262
            ASFD       S + T +A R+QD+ G +F      G+ V+YE
Sbjct  195  ASFDGA----YSIEATCHAPRLQDVYGEVFRVLKPGGLYVSYE  233


>sp|Q971V9.1|TRM1_SULTO RecName: Full=N(2),N(2)-dimethylguanosine tRNA methyltransferase; 
AltName: Full=tRNA(guanine-26,N(2)-N(2)) methyltransferase; 
AltName: Full=tRNA 2,2-dimethylguanosine-26 methyltransferase; 
AltName: Full=tRNA(m(2,2)G26)dimethyltransferase
Length=374

 Score = 31.2 bits (69),  Expect = 7.0, Method: Compositional matrix adjust.
 Identities = 22/66 (33%), Positives = 32/66 (48%), Gaps = 2/66 (3%)

Query  193  IDVSGSGSETATVTISAEDATDSNKGIASFDATDFTVSSGDVTVNAERIQDIVGAMFSSN  252
            +D+   GS  A   +SA +AT  NKG  +F ATD +        +A R  D++    S +
Sbjct  126  VDIDPFGS-PAPFILSAINAT-INKGYVAFTATDLSALECSSKFSARRKYDLICERLSFS  183

Query  253  TESGIS  258
             E GI 
Sbjct  184  KELGIR  189


>sp|A1S467.1|PNP_SHEAM RecName: Full=Polyribonucleotide nucleotidyltransferase; AltName: 
Full=Polynucleotide phosphorylase; Short=PNPase
Length=699

 Score = 31.2 bits (69),  Expect = 7.3, Method: Compositional matrix adjust.
 Identities = 22/81 (27%), Positives = 45/81 (55%), Gaps = 5/81 (6%)

Query  211  DATDSNKGIASFDATDFTVSSGDVTVNAERIQDIV---GAMFSSNT-ESGISVTYEDSDG  266
            +  D   G A  D +DF      + +N E+I+D++   GA+  + T E+G ++  ED DG
Sbjct  534  NVMDQAIGSARPDISDFAPRITTIKINPEKIRDVIGKGGAVIRALTEETGTTIELED-DG  592

Query  267  TIDLDVSDPTLSLQAMSQVQE  287
            T+ +  ++   + +A+ +++E
Sbjct  593  TVKIASNNGDATREAIRRIEE  613


>sp|Q32BG9.2|PNP_SHIDS RecName: Full=Polyribonucleotide nucleotidyltransferase; AltName: 
Full=Polynucleotide phosphorylase; Short=PNPase
Length=711

 Score = 31.2 bits (69),  Expect = 7.5, Method: Compositional matrix adjust.
 Identities = 19/82 (23%), Positives = 44/82 (53%), Gaps = 9/82 (10%)

Query  210  EDATDSNKGIASFDATDFTVSSGDVTVNAERIQDIVGA----MFSSNTESGISVTYEDSD  265
            E A ++ +G    D ++F      + +N ++I+D++G     + +   E+G ++  ED D
Sbjct  538  EQAINAPRG----DISEFAPRIHTIKINPDKIKDVIGKGGSVIRALTEETGTTIEIED-D  592

Query  266  GTIDLDVSDPTLSLQAMSQVQE  287
            GT+ +  +D   +  A+ +++E
Sbjct  593  GTVKIAATDGEKAKHAIRRIEE  614

---------------------------------------------------------------------------------------------------------------

b)BLASTp versus NR

                                                                   Score     E
Sequences producing significant alignments:                       (Bits)  Value

ref|ZP_00958302.1|  putative RTX family exoprotein [Roseovariu...  36.6    3.1  

ALIGNMENTS
>ref|ZP_00958302.1| putative RTX family exoprotein [Roseovarius nubinhibens ISM]
 gb|EAP76764.1| putative RTX family exoprotein [Roseovarius nubinhibens ISM]
Length=1065

 Score = 36.6 bits (83),  Expect = 3.1, Method: Compositional matrix adjust.
 Identities = 30/85 (35%), Positives = 48/85 (56%), Gaps = 10/85 (11%)

Query  131  SDG--TLDFNVADPVITLSGDVAGSATMTNLGDVTISTTIQANSIALGTDTTGNYVSAIS  188
            SDG  T++  V +P  T S D+AG        ++TI TT    S+  GT+ +G   +A  
Sbjct  251  SDGMHTVNVTVVEPDGTTS-DLAGP-------EITIDTTPPETSVTQGTEASGEIFNAEE  302

Query  189  AGEGIDVSGSGSETATVTISAEDAT  213
             G+GI+++GSG   AT++++ E  T
Sbjct  303  FGQGIELAGSGEPGATISVTVEGVT  327



  Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
    Posted date:  Apr 5, 2010  5:43 PM
  Number of letters in database: -605,171,825
  Number of sequences in database:  10,820,686

Lambda     K      H
   0.308    0.125    0.327 
Gapped
Lambda     K      H
   0.267   0.0410    0.140 
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 10820686
Number of Hits to DB: 186012882
Number of extensions: 7620659
Number of successful extensions: 33882
Number of sequences better than 100: 1336
Number of HSP's better than 100 without gapping: 0
Number of HSP's gapped: 30644
Number of HSP's successfully gapped: 3349
Length of query: 289
Length of database: 3689795467
Length adjustment: 136
Effective length of query: 153
Effective length of database: 2218182171
Effective search space: 339381872163
Effective search space used: 339381872163
T: 11
A: 40
X1: 16 (7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (20.8 bits)
S2: 71 (32.0 bits)
------------------------------------------------------------------------------------------------------

c) BLASTp vs ENV_NR

                                                                   Score     E
Sequences producing significant alignments:                       (Bits)  Value

gb|ECQ56051.1|  hypothetical protein GOS_5169218 [marine metag...   512    4e-144
gb|EBI55730.1|  hypothetical protein GOS_9072598 [marine metag...   508    9e-143
gb|ECU60947.1|  hypothetical protein GOS_3684502 [marine metag...   470    2e-131
gb|EBK02554.1|  hypothetical protein GOS_8800807 [marine metag...   421    8e-117
gb|ECO48045.1|  hypothetical protein GOS_5264726 [marine metag...   410    1e-113
gb|ECA77004.1|  hypothetical protein GOS_3535274 [marine metag...   279    5e-74 
gb|EBN00096.1|  hypothetical protein GOS_8317746 [marine metag...   275    7e-73 
gb|ECF90785.1|  hypothetical protein GOS_4361104 [marine metag...   268    2e-70 
gb|EBN00095.1|  hypothetical protein GOS_8317745 [marine metag...   231    2e-59 
gb|EBF10661.1|  hypothetical protein GOS_9651759 [marine metag...   213    5e-54 
gb|ECD62170.1|  hypothetical protein GOS_6230582 [marine metag...   210    3e-53 
gb|EBZ19920.1|  hypothetical protein GOS_6319502 [marine metag...   199    8e-50 
gb|ECY49850.1|  hypothetical protein GOS_2349906 [marine metag...   182    1e-44 
gb|EBU13896.1|  hypothetical protein GOS_7160436 [marine metag...   181    2e-44 
gb|ECF14109.1|  hypothetical protein GOS_3913411 [marine metag...   174    3e-42 
gb|ECC68425.1|  hypothetical protein GOS_6432322 [marine metag...   150    4e-35 
gb|ECO48044.1|  hypothetical protein GOS_5264725 [marine metag...   146    6e-34 
gb|EBW38477.1|  hypothetical protein GOS_6755274 [marine metag...   142    1e-32 
gb|ECK79673.1|  hypothetical protein GOS_5943479 [marine metag...   140    4e-32 
gb|EBF94718.1|  hypothetical protein GOS_9514675 [marine metag...   139    9e-32 
gb|EBO60502.1|  hypothetical protein GOS_8052260 [marine metag...   137    3e-31 
gb|ECB21161.1|  hypothetical protein GOS_5256826 [marine metag...   134    3e-30 
gb|EBV76117.1|  hypothetical protein GOS_6854773 [marine metag...   129    6e-29 
gb|EBR10882.1|  hypothetical protein GOS_7651555 [marine metag...   127    4e-28 
gb|ECE51122.1|  hypothetical protein GOS_6404106 [marine metag...   124    3e-27 
gb|EBC30112.1|  hypothetical protein GOS_92341 [marine metagen...   120    3e-26 
gb|EDI64836.1|  hypothetical protein GOS_397186 [marine metage...   119    1e-25 
gb|EBO12333.1|  hypothetical protein GOS_8132913 [marine metag...   116    6e-25 
gb|ECS85011.1|  hypothetical protein GOS_3083047 [marine metag...   115    9e-25 
gb|ECP00209.1|  hypothetical protein GOS_3244776 [marine metag...   114    4e-24 
gb|EBE87298.1|  hypothetical protein GOS_9690626 [marine metag...   114    4e-24 
gb|ECN47508.1|  hypothetical protein GOS_5783137 [marine metag...   113    6e-24 
gb|ECI60160.1|  hypothetical protein GOS_4158775 [marine metag...   111    2e-23 
gb|EBM46864.1|  hypothetical protein GOS_8405108 [marine metag...   110    3e-23 
gb|ECK39805.1|  hypothetical protein GOS_4018050 [marine metag...   110    3e-23 
gb|ECJ58439.1|  hypothetical protein GOS_3764684 [marine metag...   110    4e-23 
gb|ECY52169.1|  hypothetical protein GOS_2345855 [marine metag...   105    2e-21 
gb|EBV21080.1|  hypothetical protein GOS_6939859 [marine metag...   103    5e-21 
gb|ECI60422.1|  hypothetical protein GOS_4147248 [marine metag...   102    9e-21 
gb|EDE92849.1|  hypothetical protein GOS_1045758 [marine metag...  96.3    8e-19 
gb|EBI94255.1|  hypothetical protein GOS_9007689 [marine metag...  95.9    1e-18 
gb|ECR44511.1|  hypothetical protein GOS_5147886 [marine metag...  95.5    1e-18 
gb|EBE45128.1|  hypothetical protein GOS_9761584 [marine metag...  94.4    3e-18 
gb|ECA31822.1|  hypothetical protein GOS_5321136 [marine metag...  93.2    6e-18 
gb|ECX47690.1|  hypothetical protein GOS_2531194 [marine metag...  93.2    7e-18 
gb|EBK17612.1|  hypothetical protein GOS_8775896 [marine metag...  93.2    7e-18 
gb|EBS15778.1|  hypothetical protein GOS_7481998 [marine metag...  93.2    7e-18 
gb|ECR09597.1|  hypothetical protein GOS_3063459 [marine metag...  89.7    7e-17 
gb|EDI26276.1|  hypothetical protein GOS_461932 [marine metage...  89.4    9e-17 
gb|EDE30477.1|  hypothetical protein GOS_1153860 [marine metag...  87.8    3e-16 
gb|ECE17310.1|  hypothetical protein GOS_4009675 [marine metag...  87.0    5e-16 
gb|EBA93356.1|  hypothetical protein GOS_316922 [marine metage...  86.7    6e-16 
gb|ECE44479.1|  hypothetical protein GOS_6436683 [marine metag...  85.9    9e-16 
gb|ECA74234.1|  hypothetical protein GOS_3644143 [marine metag...  85.5    1e-15 
gb|ECJ58438.1|  hypothetical protein GOS_3764683 [marine metag...  84.3    3e-15 
gb|EDE17411.1|  hypothetical protein GOS_1176743 [marine metag...  84.3    3e-15 
gb|EBP28278.1|  hypothetical protein GOS_7936162 [marine metag...  84.3    3e-15 
gb|ECC80836.1|  hypothetical protein GOS_5942011 [marine metag...  82.0    1e-14 
gb|ECY25851.1|  hypothetical protein GOS_2390320 [marine metag...  81.3    2e-14 
gb|EBP73787.1|  hypothetical protein GOS_7862641 [marine metag...  81.3    3e-14 
gb|ECQ56052.1|  hypothetical protein GOS_5169219 [marine metag...  79.3    9e-14 
gb|ECX82963.1|  hypothetical protein GOS_2468239 [marine metag...  79.0    1e-13 
gb|EBY19293.1|  hypothetical protein GOS_3951438 [marine metag...  79.0    1e-13 
gb|EBL72200.1|  hypothetical protein GOS_8525075 [marine metag...  79.0    1e-13 
gb|EBG94045.1|  hypothetical protein GOS_9347983 [marine metag...  78.6    1e-13 
gb|EBM13093.1|  hypothetical protein GOS_8458125 [marine metag...  78.6    2e-13 
gb|EDI26772.1|  hypothetical protein GOS_461213 [marine metage...  78.6    2e-13 
gb|EBX30343.1|  hypothetical protein GOS_6609041 [marine metag...  76.6    6e-13 
gb|ECZ32386.1|  hypothetical protein GOS_2203601 [marine metag...  76.6    6e-13 
gb|EBQ39398.1|  hypothetical protein GOS_7758136 [marine metag...  76.3    9e-13 
gb|EBF20249.1|  hypothetical protein GOS_9636315 [marine metag...  75.9    1e-12 
gb|ECS72932.1|  hypothetical protein GOS_3551614 [marine metag...  75.9    1e-12 
gb|EBX27608.1|  hypothetical protein GOS_6613241 [marine metag...  75.9    1e-12 
gb|ECI32550.1|  hypothetical protein GOS_5243037 [marine metag...  75.5    1e-12 
gb|ECC98583.1|  hypothetical protein GOS_5232199 [marine metag...  75.5    2e-12 
gb|ECD16282.1|  hypothetical protein GOS_4524411 [marine metag...  75.1    2e-12 
gb|ECA04922.1|  hypothetical protein GOS_6423350 [marine metag...  74.7    3e-12 
gb|ECO22409.1|  hypothetical protein GOS_6331339 [marine metag...  74.3    3e-12 
gb|EDH31491.1|  hypothetical protein GOS_627876 [marine metage...  74.3    3e-12 
gb|ECE27434.1|  hypothetical protein GOS_3616572 [marine metag...  74.3    3e-12 
gb|EBQ80931.1|  hypothetical protein GOS_7694594 [marine metag...  73.2    7e-12 
gb|EDE57624.1|  hypothetical protein GOS_1106703 [marine metag...  72.8    9e-12 
gb|ECX93403.1|  hypothetical protein GOS_2448931 [marine metag...  72.8    1e-11 
gb|EDI66600.1|  hypothetical protein GOS_394367 [marine metage...  72.4    1e-11 
gb|ECY19826.1|  hypothetical protein GOS_2400256 [marine metag...  72.0    2e-11 
gb|EBX42754.1|  hypothetical protein GOS_6588904 [marine metag...  71.6    2e-11 
gb|ECD45146.1|  hypothetical protein GOS_3394013 [marine metag...  71.6    2e-11 
gb|EBX32528.1|  hypothetical protein GOS_6605480 [marine metag...  71.2    2e-11 
gb|ECL10081.1|  hypothetical protein GOS_4715056 [marine metag...  70.5    4e-11 
gb|ECI32547.1|  hypothetical protein GOS_5243034 [marine metag...  70.5    4e-11 
gb|ECY20890.1|  hypothetical protein GOS_2398556 [marine metag...  69.7    8e-11 
gb|EBW14363.1|  hypothetical protein GOS_6793587 [marine metag...  68.2    2e-10 
gb|EDG59738.1|  hypothetical protein GOS_754451 [marine metage...  67.8    3e-10 
gb|EDC98173.1|  hypothetical protein GOS_1380445 [marine metag...  67.4    3e-10 
gb|EBN25393.1|  hypothetical protein GOS_8276775 [marine metag...  66.2    9e-10 
gb|EBD44687.1|  hypothetical protein GOS_9928433 [marine metag...  65.9    1e-09 
gb|EBK26187.1|  hypothetical protein GOS_8761644 [marine metag...  65.9    1e-09 
gb|EBM38701.1|  hypothetical protein GOS_8418231 [marine metag...  65.5    1e-09 
gb|ECS68693.1|  hypothetical protein GOS_3728776 [marine metag...  65.5    2e-09 
gb|ECI84929.1|  hypothetical protein GOS_3181684 [marine metag...  64.3    3e-09 
gb|ECR72433.1|  hypothetical protein GOS_4044285 [marine metag...  64.3    3e-09 
gb|ECS72933.1|  hypothetical protein GOS_3551615 [marine metag...  64.3    3e-09 
gb|ECS69245.1|  hypothetical protein GOS_3705600 [marine metag...  64.3    3e-09 
gb|EBM24693.1|  hypothetical protein GOS_8439880 [marine metag...  64.3    4e-09 
gb|EDI79625.1|  hypothetical protein GOS_373445 [marine metage...  63.9    4e-09 
gb|EBN01152.1|  hypothetical protein GOS_8316022 [marine metag...  63.9    4e-09 
gb|EBK78456.1|  hypothetical protein GOS_8675540 [marine metag...  63.9    4e-09 
gb|ECI60423.1|  hypothetical protein GOS_4147249 [marine metag...  63.5    5e-09 
gb|ECK51963.1|  hypothetical protein GOS_3539912 [marine metag...  63.5    5e-09 
gb|EBP12701.1|  hypothetical protein GOS_7962674 [marine metag...  63.2    7e-09 
gb|EDB42701.1|  hypothetical protein GOS_1826056 [marine metag...  63.2    7e-09 
gb|ECT94926.1|  hypothetical protein GOS_3942041 [marine metag...  63.2    7e-09 
gb|EDI99669.1|  hypothetical protein GOS_1772591 [marine metag...  63.2    7e-09 
gb|EDH13988.1|  hypothetical protein GOS_659311 [marine metage...  62.8    8e-09 
gb|EDI60058.1|  hypothetical protein GOS_405208 [marine metage...  62.4    1e-08 
gb|EBE26229.1|  hypothetical protein GOS_9793258 [marine metag...  62.4    1e-08 
gb|EDD34354.1|  hypothetical protein GOS_1319586 [marine metag...  62.4    1e-08 
gb|EBR17651.1|  hypothetical protein GOS_7641531 [marine metag...  62.0    2e-08 
gb|EBO56089.1|  hypothetical protein GOS_8059569 [marine metag...  62.0    2e-08 
gb|EBM12783.1|  hypothetical protein GOS_8458625 [marine metag...  62.0    2e-08 
gb|EBY17519.1|  hypothetical protein GOS_4156167 [marine metag...  62.0    2e-08 
gb|ECK00320.1|  hypothetical protein GOS_5578377 [marine metag...  61.6    2e-08 
gb|EBL84113.1|  hypothetical protein GOS_8505547 [marine metag...  61.6    2e-08 
gb|EDG30285.1|  hypothetical protein GOS_805236 [marine metage...  61.6    2e-08 
gb|EBA58722.1|  hypothetical protein GOS_5014 [marine metagenome]  61.6    2e-08 
gb|EBN04827.1|  hypothetical protein GOS_8310454 [marine metag...  61.6    2e-08 
gb|EBY97548.1|  hypothetical protein GOS_3699701 [marine metag...  61.2    3e-08 
gb|ECV20887.1|  hypothetical protein GOS_2943458 [marine metag...  60.8    4e-08 
gb|ECS60108.1|  hypothetical protein GOS_4070134 [marine metag...  60.8    4e-08 
gb|EBW49552.1|  hypothetical protein GOS_6737888 [marine metag...  60.5    4e-08 
gb|EDC21123.1|  hypothetical protein GOS_1516476 [marine metag...  60.5    4e-08 
gb|EBE92881.1|  hypothetical protein GOS_9681259 [marine metag...  60.5    4e-08 
gb|ECA89932.1|  hypothetical protein GOS_3038027 [marine metag...  60.5    4e-08 
gb|EBK39364.1|  hypothetical protein GOS_8740186 [marine metag...  60.5    5e-08 
gb|ECC87825.1|  hypothetical protein GOS_5666071 [marine metag...  60.5    5e-08 
gb|EDC35091.1|  hypothetical protein GOS_1491442 [marine metag...  60.5    5e-08 
gb|EDB48990.1|  hypothetical protein GOS_1815279 [marine metag...  60.5    5e-08 
gb|EBL24108.1|  hypothetical protein GOS_8602498 [marine metag...  60.1    6e-08 
gb|EBB08137.1|  hypothetical protein GOS_292273 [marine metage...  60.1    6e-08 
gb|EBU77606.1|  hypothetical protein GOS_7007802 [marine metag...  60.1    6e-08 
gb|EBD63892.1|  hypothetical protein GOS_9896556 [marine metag...  60.1    6e-08 
gb|EBQ89141.1|  hypothetical protein GOS_7682073 [marine metag...  60.1    7e-08 
gb|EBO15230.1|  hypothetical protein GOS_8127955 [marine metag...  60.1    7e-08 
gb|ECA74233.1|  hypothetical protein GOS_3644142 [marine metag...  60.1    7e-08 
gb|EDD05746.1|  hypothetical protein GOS_1367052 [marine metag...  60.1    7e-08 
gb|EDB06220.1|  hypothetical protein GOS_1888570 [marine metag...  59.7    8e-08 
gb|ECU06694.1|  hypothetical protein GOS_3471926 [marine metag...  59.7    9e-08 
gb|EBC85426.1|  hypothetical protein GOS_3776 [marine metagenome]  59.7    9e-08 
gb|EBY81985.1|  hypothetical protein GOS_4308020 [marine metag...  59.3    1e-07 
gb|ECI30484.1|  hypothetical protein GOS_5326106 [marine metag...  59.3    1e-07 
gb|EBW59587.1|  hypothetical protein GOS_6721954 [marine metag...  59.3    1e-07 
gb|ECD25370.1|  hypothetical protein GOS_4164708 [marine metag...  59.3    1e-07 
gb|ECQ06364.1|  hypothetical protein GOS_3624809 [marine metag...  59.3    1e-07 
gb|EBD43734.1|  hypothetical protein GOS_9929999 [marine metag...  59.3    1e-07 
gb|EDG00557.1|  hypothetical protein GOS_856708 [marine metage...  58.9    1e-07 
gb|EBC76198.1|  hypothetical protein GOS_18184 [marine metagen...  58.9    1e-07 
gb|EBL21789.1|  hypothetical protein GOS_8605652 [marine metag...  58.9    1e-07 
gb|ECS71907.1|  hypothetical protein GOS_3595630 [marine metag...  58.9    1e-07 
gb|EBT40305.1|  hypothetical protein GOS_7279530 [marine metag...  58.9    1e-07 
gb|EBW22403.1|  hypothetical protein GOS_6780724 [marine metag...  58.9    1e-07 
gb|EBN73485.1|  hypothetical protein GOS_8197190 [marine metag...  58.9    1e-07 
gb|EBQ56247.1|  hypothetical protein GOS_7732377 [marine metag...  58.9    1e-07 
gb|EBV84348.1|  hypothetical protein GOS_6842015 [marine metag...  58.9    2e-07 
gb|EBE11790.1|  hypothetical protein GOS_9817792 [marine metag...  58.9    2e-07 
gb|EBX59023.1|  hypothetical protein GOS_6563619 [marine metag...  58.5    2e-07 
gb|ECE35179.1|  hypothetical protein GOS_3305944 [marine metag...  58.5    2e-07 
gb|EBZ08460.1|  hypothetical protein GOS_3280270 [marine metag...  58.5    2e-07 
gb|EDD23828.1|  hypothetical protein GOS_1335326 [marine metag...  58.5    2e-07 
gb|EBO96333.1|  hypothetical protein GOS_7990663 [marine metag...  58.5    2e-07 
gb|EDD43369.1|  hypothetical protein GOS_1305430 [marine metag...  58.5    2e-07 
gb|EDB14178.1|  hypothetical protein GOS_1875163 [marine metag...  58.5    2e-07 
gb|ECY27122.1|  hypothetical protein GOS_2388061 [marine metag...  58.5    2e-07 
gb|ECR10369.1|  hypothetical protein GOS_3033979 [marine metag...  58.5    2e-07 
gb|EBU90169.1|  hypothetical protein GOS_6987740 [marine metag...  58.5    2e-07 
gb|EDH22221.1|  hypothetical protein GOS_644371 [marine metage...  58.2    2e-07 
gb|ECT97509.1|  hypothetical protein GOS_3839902 [marine metag...  58.2    2e-07 
gb|EBW09435.1|  hypothetical protein GOS_6801620 [marine metag...  58.2    2e-07 
gb|EBX36840.1|  hypothetical protein GOS_6598403 [marine metag...  58.2    2e-07 
gb|EBQ93937.1|  hypothetical protein GOS_7674941 [marine metag...  58.2    2e-07 
gb|EDC69039.1|  hypothetical protein GOS_1431568 [marine metag...  58.2    2e-07 
gb|EDB37696.1|  hypothetical protein GOS_1834557 [marine metag...  58.2    2e-07 
gb|EBU31161.1|  hypothetical protein GOS_7133760 [marine metag...  58.2    2e-07 
gb|EBV31202.1|  hypothetical protein GOS_6925460 [marine metag...  58.2    3e-07 
gb|EDH37426.1|  hypothetical protein GOS_617362 [marine metage...  57.8    3e-07 
gb|ECO76430.1|  hypothetical protein GOS_4154776 [marine metag...  57.8    3e-07 
gb|EBQ33108.1|  hypothetical protein GOS_7767416 [marine metag...  57.8    3e-07 
gb|ECS03563.1|  hypothetical protein GOS_6312620 [marine metag...  57.8    3e-07 
gb|EBM92760.1|  hypothetical protein GOS_8329888 [marine metag...  57.8    3e-07 
gb|ECS80710.1|  hypothetical protein GOS_3253434 [marine metag...  57.8    3e-07 
gb|ECI02696.1|  hypothetical protein GOS_6437327 [marine metag...  57.8    3e-07 
gb|ECB45473.1|  hypothetical protein GOS_4294631 [marine metag...  57.8    3e-07 
gb|EBV40755.1|  hypothetical protein GOS_6910927 [marine metag...  57.8    3e-07 
gb|EBD40113.1|  hypothetical protein GOS_9935675 [marine metag...  57.8    3e-07 
gb|ECM41762.1|  hypothetical protein GOS_3020101 [marine metag...  57.8    3e-07 
gb|ECI56427.1|  hypothetical protein GOS_4296957 [marine metag...  57.8    3e-07 
gb|EDG07735.1|  hypothetical protein GOS_844438 [marine metage...  57.8    3e-07 
gb|ECK57304.1|  hypothetical protein GOS_3336067 [marine metag...  57.8    3e-07 
gb|ECB85267.1|  hypothetical protein GOS_6233971 [marine metag...  57.8    3e-07 
gb|EDI03242.1|  hypothetical protein GOS_500344 [marine metage...  57.8    3e-07 
gb|EDD26970.1|  hypothetical protein GOS_1330026 [marine metag...  57.4    4e-07 
gb|ECT55150.1|  hypothetical protein GOS_5519507 [marine metag...  57.4    4e-07 
gb|EBQ03226.1|  hypothetical protein GOS_7814100 [marine metag...  57.4    4e-07 
gb|EBQ71299.1|  hypothetical protein GOS_7709359 [marine metag...  57.4    4e-07 
gb|ECB88803.1|  hypothetical protein GOS_6093529 [marine metag...  57.4    4e-07 
gb|ECA87796.1|  hypothetical protein GOS_3120972 [marine metag...  57.4    4e-07 
gb|ECI72967.1|  hypothetical protein GOS_3651076 [marine metag...  57.4    4e-07 
gb|EDI40521.1|  hypothetical protein GOS_438579 [marine metage...  57.4    4e-07 
gb|EBQ76081.1|  hypothetical protein GOS_7701908 [marine metag...  57.4    4e-07 
gb|ECF70940.1|  hypothetical protein GOS_5153866 [marine metag...  57.4    4e-07 
gb|EBK40928.1|  hypothetical protein GOS_8737658 [marine metag...  57.4    4e-07 
gb|ECS78260.1|  hypothetical protein GOS_3341351 [marine metag...  57.4    4e-07 
gb|EBN86456.1|  hypothetical protein GOS_8175663 [marine metag...  57.4    4e-07 
gb|ECA89959.1|  hypothetical protein GOS_3037437 [marine metag...  57.0    5e-07 
gb|EBW35397.1|  hypothetical protein GOS_6759939 [marine metag...  57.0    5e-07 
gb|EDC11791.1|  hypothetical protein GOS_1533052 [marine metag...  57.0    6e-07 
gb|ECX44925.1|  hypothetical protein GOS_2536245 [marine metag...  57.0    6e-07 
gb|ECS89331.1|  hypothetical protein GOS_8935956 [marine metag...  57.0    6e-07 
gb|EBL93783.1|  hypothetical protein GOS_8489705 [marine metag...  56.6    6e-07 
gb|EBT92859.1|  hypothetical protein GOS_7192971 [marine metag...  56.6    6e-07 
gb|ECU49574.1|  hypothetical protein GOS_4134731 [marine metag...  56.6    6e-07 
gb|ECQ26478.1|  hypothetical protein GOS_6340275 [marine metag...  56.6    6e-07 
gb|EDI21038.1|  hypothetical protein GOS_470074 [marine metage...  56.6    6e-07 
gb|ECR08355.1|  hypothetical protein GOS_3107045 [marine metag...  56.6    6e-07 
gb|ECJ92246.1|  hypothetical protein GOS_5921123 [marine metag...  56.6    6e-07 
gb|ECE83753.1|  hypothetical protein GOS_5094545 [marine metag...  56.6    7e-07 
gb|ECE67000.1|  hypothetical protein GOS_5769361 [marine metag...  56.6    7e-07 
gb|EBI60594.1|  hypothetical protein GOS_9064360 [marine metag...  56.6    7e-07 
gb|ECB22464.1|  hypothetical protein GOS_5207900 [marine metag...  56.6    7e-07 
gb|ECC25510.1|  hypothetical protein GOS_4609863 [marine metag...  56.6    7e-07 
gb|EDD35626.1|  hypothetical protein GOS_1317812 [marine metag...  56.6    8e-07 
gb|ECU49573.1|  hypothetical protein GOS_4134730 [marine metag...  56.2    8e-07 
gb|EDI65054.1|  hypothetical protein GOS_396837 [marine metage...  56.2    8e-07 
gb|EBI19527.1|  hypothetical protein GOS_9133580 [marine metag...  56.2    8e-07 
gb|ECU60948.1|  hypothetical protein GOS_3684503 [marine metag...  56.2    8e-07 
gb|ECO20540.1|  hypothetical protein GOS_6398916 [marine metag...  56.2    9e-07 
gb|EBI61767.1|  hypothetical protein GOS_9062422 [marine metag...  56.2    9e-07 
gb|ECB85102.1|  hypothetical protein GOS_6240227 [marine metag...  56.2    9e-07 
gb|ECV13613.1|  hypothetical protein GOS_2956723 [marine metag...  56.2    1e-06 
gb|ECP35066.1|  hypothetical protein GOS_6447140 [marine metag...  56.2    1e-06 
gb|ECP14791.1|  hypothetical protein GOS_6151891 [marine metag...  56.2    1e-06 
gb|EBL64432.1|  hypothetical protein GOS_8537933 [marine metag...  55.8    1e-06 
gb|EBZ75867.1|  hypothetical protein GOS_4069712 [marine metag...  55.8    1e-06 
gb|EBC80910.1|  hypothetical protein GOS_11000 [marine metagen...  55.8    1e-06 
gb|EDB39462.1|  hypothetical protein GOS_1831595 [marine metag...  55.8    1e-06 
gb|ECM87620.1|  hypothetical protein GOS_4660312 [marine metag...  55.8    1e-06 
gb|EDG04812.1|  hypothetical protein GOS_849479 [marine metage...  55.8    1e-06 
gb|ECI86682.1|  hypothetical protein GOS_3111499 [marine metag...  55.8    1e-06 
gb|ECT68630.1|  hypothetical protein GOS_4973544 [marine metag...  55.8    1e-06 
gb|EBL39148.1|  hypothetical protein GOS_8579400 [marine metag...  55.8    1e-06 
gb|EBW41992.1|  hypothetical protein GOS_6749768 [marine metag...  55.8    1e-06 
gb|ECG11935.1|  hypothetical protein GOS_3557640 [marine metag...  55.8    1e-06 
gb|EDI78636.1|  hypothetical protein GOS_374929 [marine metage...  55.5    1e-06 
gb|ECI68283.1|  hypothetical protein GOS_3845741 [marine metag...  55.5    1e-06 
gb|EBY04914.1|  hypothetical protein GOS_5627071 [marine metag...  55.5    1e-06 
gb|EBL71022.1|  hypothetical protein GOS_8527052 [marine metag...  55.5    1e-06 
gb|EBR02648.1|  hypothetical protein GOS_7662229 [marine metag...  55.5    1e-06 
gb|EBC06316.1|  hypothetical protein GOS_130323 [marine metage...  55.5    1e-06 
gb|ECT68629.1|  hypothetical protein GOS_4973543 [marine metag...  55.5    2e-06 
gb|EBK41710.1|  hypothetical protein GOS_8736388 [marine metag...  55.5    2e-06 
gb|ECP87510.1|  hypothetical protein GOS_4348176 [marine metag...  55.5    2e-06 
gb|EDC41851.1|  hypothetical protein GOS_1479484 [marine metag...  55.5    2e-06 
gb|EBZ20232.1|  hypothetical protein GOS_6308873 [marine metag...  55.5    2e-06 
gb|EBZ18144.1|  hypothetical protein GOS_6385931 [marine metag...  55.5    2e-06 
gb|EBI51408.1|  hypothetical protein GOS_9079850 [marine metag...  55.5    2e-06 
gb|EBQ74621.1|  hypothetical protein GOS_7704174 [marine metag...  55.1    2e-06 
gb|EBL94727.1|  hypothetical protein GOS_8488186 [marine metag...  55.1    2e-06 
gb|EDI33638.1|  hypothetical protein GOS_450249 [marine metage...  55.1    2e-06 
gb|ECI30219.1|  hypothetical protein GOS_5337669 [marine metag...  55.1    2e-06 
gb|EBP15319.1|  hypothetical protein GOS_7958066 [marine metag...  55.1    2e-06 
gb|ECQ98213.1|  hypothetical protein GOS_3504630 [marine metag...  55.1    2e-06 
gb|ECI11229.1|  hypothetical protein GOS_6101269 [marine metag...  55.1    2e-06 
gb|EBX83153.1|  hypothetical protein GOS_6526044 [marine metag...  55.1    2e-06 
gb|ECG24329.1|  hypothetical protein GOS_3061751 [marine metag...  55.1    2e-06 
gb|ECR81301.1|  hypothetical protein GOS_3698010 [marine metag...  55.1    2e-06 
gb|ECE47880.1|  hypothetical protein GOS_6315758 [marine metag...  55.1    2e-06 
gb|EBQ90942.1|  hypothetical protein GOS_7679332 [marine metag...  55.1    2e-06 
gb|ECC68424.1|  hypothetical protein GOS_6432321 [marine metag...  55.1    2e-06 
gb|ECY33672.1|  hypothetical protein GOS_2377113 [marine metag...  55.1    2e-06 
gb|ECO79554.1|  hypothetical protein GOS_4038412 [marine metag...  54.7    2e-06 
gb|ECN46407.1|  hypothetical protein GOS_5829638 [marine metag...  54.7    2e-06 
gb|EBK22029.1|  hypothetical protein GOS_8768469 [marine metag...  54.7    2e-06 
gb|ECC47402.1|  hypothetical protein GOS_3768112 [marine metag...  54.7    3e-06 
gb|ECU36351.1|  hypothetical protein GOS_4655189 [marine metag...  54.7    3e-06 
gb|EBC44481.1|  hypothetical protein GOS_68804 [marine metagen...  54.7    3e-06 
gb|EBN01151.1|  hypothetical protein GOS_8316021 [marine metag...  54.7    3e-06 
gb|EDD51592.1|  hypothetical protein GOS_1291261 [marine metag...  54.7    3e-06 
gb|ECO05179.1|  hypothetical protein GOS_3512988 [marine metag...  54.7    3e-06 
gb|EBK59179.1|  hypothetical protein GOS_8707213 [marine metag...  54.7    3e-06 
gb|ECB24775.1|  hypothetical protein GOS_5120641 [marine metag...  54.3    3e-06 
gb|EBP27899.1|  hypothetical protein GOS_7936845 [marine metag...  54.3    3e-06 
gb|EBQ26101.1|  hypothetical protein GOS_7777786 [marine metag...  54.3    3e-06 
gb|ECB48470.1|  hypothetical protein GOS_4184752 [marine metag...  54.3    3e-06 
gb|EBO44070.1|  hypothetical protein GOS_8079896 [marine metag...  54.3    4e-06 
gb|EBC87605.1|  hypothetical protein GOS_433 [marine metagenome]   54.3    4e-06 
gb|EDD17173.1|  hypothetical protein GOS_1346914 [marine metag...  54.3    4e-06 
gb|ECA89439.1|  hypothetical protein GOS_3058571 [marine metag...  54.3    4e-06 
gb|EBK09305.1|  hypothetical protein GOS_8789627 [marine metag...  53.9    4e-06 
gb|EBV33417.1|  hypothetical protein GOS_6922163 [marine metag...  53.9    4e-06 
gb|ECI85964.1|  hypothetical protein GOS_3138214 [marine metag...  53.9    4e-06 
gb|EBQ74782.1|  hypothetical protein GOS_7703935 [marine metag...  53.9    4e-06 
gb|ECC47690.1|  hypothetical protein GOS_3757145 [marine metag...  53.9    4e-06 
gb|ECA02325.1|  hypothetical protein GOS_3040395 [marine metag...  53.9    4e-06 
gb|EDF55496.1|  hypothetical protein GOS_934580 [marine metage...  53.9    5e-06 
gb|ECR70060.1|  hypothetical protein GOS_4141648 [marine metag...  53.9    5e-06 
gb|EBH66344.1|  hypothetical protein GOS_9224002 [marine metag...  53.9    5e-06 
gb|EBC08727.1|  hypothetical protein GOS_126278 [marine metage...  53.9    5e-06 
gb|ECD60913.1|  hypothetical protein GOS_6279160 [marine metag...  53.9    5e-06 
gb|EBM92242.1|  hypothetical protein GOS_8330694 [marine metag...  53.9    5e-06 
gb|EBQ48190.1|  hypothetical protein GOS_7744783 [marine metag...  53.9    5e-06 
gb|ECJ20780.1|  hypothetical protein GOS_5225250 [marine metag...  53.9    5e-06 
gb|ECE84098.1|  hypothetical protein GOS_5081094 [marine metag...  53.5    5e-06 
gb|ECA94481.1|  hypothetical protein GOS_6343880 [marine metag...  53.5    5e-06 
gb|EBP50407.1|  hypothetical protein GOS_7900854 [marine metag...  53.5    6e-06 
gb|ECZ53473.1|  hypothetical protein GOS_2166663 [marine metag...  53.5    6e-06 
gb|ECE80352.1|  hypothetical protein GOS_5226024 [marine metag...  53.5    6e-06 
gb|ECA33738.1|  hypothetical protein GOS_5249477 [marine metag...  53.5    6e-06 
gb|EBN15092.1|  hypothetical protein GOS_8293630 [marine metag...  53.5    6e-06 
gb|EDI72880.1|  hypothetical protein GOS_384160 [marine metage...  53.1    7e-06 
gb|EBP74410.1|  hypothetical protein GOS_7861611 [marine metag...  53.1    7e-06 
gb|EBQ36299.1|  hypothetical protein GOS_7762613 [marine metag...  53.1    7e-06 
gb|EDA06906.1|  hypothetical protein GOS_2069416 [marine metag...  53.1    7e-06 
gb|EDC61963.1|  hypothetical protein GOS_1444293 [marine metag...  53.1    7e-06 
gb|EBQ94796.1|  hypothetical protein GOS_7673605 [marine metag...  53.1    7e-06 
gb|ECK44903.1|  hypothetical protein GOS_3815570 [marine metag...  53.1    7e-06 
gb|EBB73959.1|  hypothetical protein GOS_184142 [marine metage...  53.1    8e-06 
gb|ECY28431.1|  hypothetical protein GOS_2385877 [marine metag...  53.1    8e-06 
gb|EBL25832.1|  hypothetical protein GOS_8600010 [marine metag...  53.1    8e-06 
gb|ECI90856.1|  hypothetical protein GOS_6433861 [marine metag...  53.1    8e-06 
gb|EDD54900.1|  hypothetical protein GOS_1285578 [marine metag...  53.1    8e-06 
gb|ECP18089.1|  hypothetical protein GOS_6018377 [marine metag...  53.1    9e-06 
gb|EBC46022.1|  hypothetical protein GOS_66364 [marine metagen...  52.8    9e-06 
gb|ECH67958.1|  hypothetical protein GOS_4298974 [marine metag...  52.8    1e-05 
gb|ECY48280.1|  hypothetical protein GOS_2352635 [marine metag...  52.8    1e-05 
gb|EDH30858.1|  hypothetical protein GOS_629032 [marine metage...  52.4    1e-05 
gb|EBW93010.1|  hypothetical protein GOS_6668605 [marine metag...  52.4    1e-05 
gb|EBO01163.1|  hypothetical protein GOS_8151607 [marine metag...  52.4    1e-05 
gb|ECS78261.1|  hypothetical protein GOS_3341352 [marine metag...  52.4    1e-05 
gb|ECN25780.1|  hypothetical protein GOS_3163402 [marine metag...  52.4    1e-05 
gb|ECM35565.1|  hypothetical protein GOS_3253923 [marine metag...  52.4    1e-05 
gb|ECK49542.1|  hypothetical protein GOS_3636216 [marine metag...  52.0    1e-05 
gb|ECF18320.1|  hypothetical protein GOS_3746048 [marine metag...  52.0    1e-05 
gb|ECP91256.1|  hypothetical protein GOS_4208682 [marine metag...  52.0    2e-05 
gb|ECH99851.1|  hypothetical protein GOS_3063469 [marine metag...  52.0    2e-05 
gb|ECS09634.1|  hypothetical protein GOS_6064936 [marine metag...  52.0    2e-05 
gb|ECG36931.1|  hypothetical protein GOS_6054134 [marine metag...  52.0    2e-05 
gb|EBP45029.1|  hypothetical protein GOS_7909450 [marine metag...  52.0    2e-05 
gb|EBC79664.1|  hypothetical protein GOS_12906 [marine metagen...  52.0    2e-05 
gb|EBT10042.1|  hypothetical protein GOS_7328960 [marine metag...  52.0    2e-05 
gb|ECZ15369.1|  hypothetical protein GOS_2232709 [marine metag...  52.0    2e-05 
gb|ECE72111.1|  hypothetical protein GOS_5560832 [marine metag...  51.6    2e-05 
gb|EBA53816.1|  hypothetical protein GOS_9137 [marine metagenome]  51.6    2e-05 
gb|EBX34373.1|  hypothetical protein GOS_6602411 [marine metag...  51.6    2e-05 
gb|EBH87328.1|  hypothetical protein GOS_9187801 [marine metag...  51.6    2e-05 
gb|ECI84253.1|  hypothetical protein GOS_3209968 [marine metag...  51.2    3e-05 
gb|ECZ76250.1|  hypothetical protein GOS_2125362 [marine metag...  51.2    3e-05 
gb|EDG30803.1|  hypothetical protein GOS_804347 [marine metage...  51.2    3e-05 
gb|ECO75233.1|  hypothetical protein GOS_4198078 [marine metag...  51.2    3e-05 
gb|ECH04715.1|  hypothetical protein GOS_3340665 [marine metag...  51.2    3e-05 
gb|ECL51746.1|  hypothetical protein GOS_3089448 [marine metag...  50.8    3e-05 
gb|EBJ86821.1|  hypothetical protein GOS_8826606 [marine metag...  50.8    3e-05 
gb|EBZ61235.1|  hypothetical protein GOS_4631525 [marine metag...  50.8    4e-05 
gb|EBF42807.1|  hypothetical protein GOS_9599336 [marine metag...  50.8    4e-05 
gb|ECN16545.1|  hypothetical protein GOS_3521211 [marine metag...  50.8    4e-05 
gb|EBZ79846.1|  hypothetical protein GOS_3912423 [marine metag...  50.8    4e-05 
gb|EBZ19921.1|  hypothetical protein GOS_6319503 [marine metag...  50.4    4e-05 
gb|EDH35802.1|  hypothetical protein GOS_620299 [marine metage...  50.4    5e-05 
gb|EDI01451.1|  hypothetical protein GOS_503249 [marine metage...  50.4    5e-05 
gb|ECL73988.1|  hypothetical protein GOS_5682273 [marine metag...  50.4    5e-05 
gb|EBH64761.1|  hypothetical protein GOS_9226740 [marine metag...  50.4    5e-05 
gb|EDC52234.1|  hypothetical protein GOS_1461542 [marine metag...  50.4    5e-05 
gb|ECR77231.1|  hypothetical protein GOS_3856721 [marine metag...  50.1    6e-05 
gb|EBC74226.1|  hypothetical protein GOS_21289 [marine metagen...  50.1    6e-05 
gb|EBJ46116.1|  hypothetical protein GOS_8894567 [marine metag...  50.1    6e-05 
gb|EBC58430.1|  hypothetical protein GOS_46384 [marine metagen...  50.1    6e-05 
gb|EDH23211.1|  hypothetical protein GOS_642576 [marine metage...  50.1    6e-05 
gb|ECA91929.1|  hypothetical protein GOS_6444055 [marine metag...  50.1    7e-05 
gb|EDI78340.1|  hypothetical protein GOS_375353 [marine metage...  50.1    7e-05 
gb|ECI30966.1|  hypothetical protein GOS_5306059 [marine metag...  49.7    7e-05 
gb|EBQ74415.1|  hypothetical protein GOS_7704484 [marine metag...  49.7    7e-05 
gb|ECR02010.1|  hypothetical protein GOS_3358575 [marine metag...  49.7    8e-05 
gb|EBF28872.1|  hypothetical protein GOS_9622275 [marine metag...  49.7    8e-05 
gb|EBQ62136.1|  hypothetical protein GOS_7723425 [marine metag...  49.7    8e-05 
gb|EDI72702.1|  hypothetical protein GOS_384434 [marine metage...  49.7    8e-05 
gb|EDH25110.1|  hypothetical protein GOS_639066 [marine metage...  49.7    8e-05 
gb|EBQ44697.1|  hypothetical protein GOS_7750330 [marine metag...  49.7    8e-05 
gb|EDD47724.1|  hypothetical protein GOS_1297929 [marine metag...  49.7    8e-05 
gb|ECO47615.1|  hypothetical protein GOS_5280885 [marine metag...  49.7    8e-05 
gb|ECL44735.1|  hypothetical protein GOS_3358441 [marine metag...  49.7    9e-05 
gb|EDF20943.1|  hypothetical protein GOS_995663 [marine metage...  49.7    9e-05 
gb|EBM20178.1|  hypothetical protein GOS_8447110 [marine metag...  49.3    1e-04 
gb|ECH87173.1|  hypothetical protein GOS_3543989 [marine metag...  49.3    1e-04 
gb|EBB07607.1|  hypothetical protein GOS_293120 [marine metage...  49.3    1e-04 
gb|ECU17695.1|  hypothetical protein GOS_3041221 [marine metag...  49.3    1e-04 
gb|EBF11040.1|  hypothetical protein GOS_9651174 [marine metag...  49.3    1e-04 
gb|ECF30372.1|  hypothetical protein GOS_3267757 [marine metag...  49.3    1e-04 
gb|EBP08194.1|  hypothetical protein GOS_7970293 [marine metag...  49.3    1e-04 
gb|ECA61898.1|  hypothetical protein GOS_4135397 [marine metag...  49.3    1e-04 
gb|EBQ79615.1|  hypothetical protein GOS_7696542 [marine metag...  49.3    1e-04 
gb|ECY21907.1|  hypothetical protein GOS_2396905 [marine metag...  48.9    1e-04 
gb|ECB22056.1|  hypothetical protein GOS_5222253 [marine metag...  48.9    1e-04 
gb|ECC52676.1|  hypothetical protein GOS_3563073 [marine metag...  48.9    1e-04 
gb|EBF94177.1|  hypothetical protein GOS_9515562 [marine metag...  48.9    1e-04 
gb|ECY33168.1|  hypothetical protein GOS_2377841 [marine metag...  48.9    1e-04 
gb|EBC59564.1|  hypothetical protein GOS_44631 [marine metagen...  48.9    1e-04 
gb|EBK19866.1|  hypothetical protein GOS_8772099 [marine metag...  48.9    1e-04 
gb|ECI63942.1|  hypothetical protein GOS_4011769 [marine metag...  48.9    1e-04 
gb|EBW42853.1|  hypothetical protein GOS_6748473 [marine metag...  48.9    1e-04 
gb|ECX82124.1|  hypothetical protein GOS_2469718 [marine metag...  48.9    1e-04 
gb|EBQ17752.1|  hypothetical protein GOS_7791197 [marine metag...  48.9    1e-04 
gb|EBN05405.1|  hypothetical protein GOS_8309482 [marine metag...  48.9    1e-04 
gb|EBI80070.1|  hypothetical protein GOS_9031785 [marine metag...  48.5    2e-04 
gb|ECT94815.1|  hypothetical protein GOS_3947816 [marine metag...  48.5    2e-04 
gb|ECS84450.1|  hypothetical protein GOS_3104728 [marine metag...  48.5    2e-04 
gb|EBY63173.1|  hypothetical protein GOS_5063305 [marine metag...  48.5    2e-04 
gb|EBJ12401.1|  hypothetical protein GOS_8976793 [marine metag...  48.5    2e-04 
gb|ECU45738.1|  hypothetical protein GOS_4290975 [marine metag...  48.5    2e-04 
gb|EBZ12771.1|  hypothetical protein GOS_3116420 [marine metag...  48.1    2e-04 
gb|ECD18231.1|  hypothetical protein GOS_4440848 [marine metag...  48.1    2e-04 
gb|ECU09663.1|  hypothetical protein GOS_3359000 [marine metag...  48.1    2e-04 
gb|EBC74228.1|  hypothetical protein GOS_21291 [marine metagen...  48.1    2e-04 
gb|ECE53015.1|  hypothetical protein GOS_6332951 [marine metag...  48.1    2e-04 
gb|ECJ50691.1|  hypothetical protein GOS_4060821 [marine metag...  48.1    2e-04 
gb|ECV20323.1|  hypothetical protein GOS_2944482 [marine metag...  48.1    3e-04 
gb|ECO95086.1|  hypothetical protein GOS_3435589 [marine metag...  48.1    3e-04 
gb|ECA33713.1|  hypothetical protein GOS_5250164 [marine metag...  48.1    3e-04 
gb|ECQ54270.1|  hypothetical protein GOS_5238140 [marine metag...  48.1    3e-04 
gb|ECR31364.1|  hypothetical protein GOS_5683390 [marine metag...  47.8    3e-04 
gb|EDC55226.1|  hypothetical protein GOS_1456217 [marine metag...  47.8    3e-04 
gb|EBJ34566.1|  hypothetical protein GOS_8913756 [marine metag...  47.8    3e-04 
gb|EBQ06591.1|  hypothetical protein GOS_7808590 [marine metag...  47.8    3e-04 
gb|ECC28311.1|  hypothetical protein GOS_4500562 [marine metag...  47.8    4e-04 
gb|ECF24308.1|  hypothetical protein GOS_3507433 [marine metag...  47.4    4e-04 
gb|EBA53833.1|  hypothetical protein GOS_5087 [marine metagenome]  47.4    4e-04 
gb|ECS85756.1|  hypothetical protein GOS_3053708 [marine metag...  47.4    4e-04 
gb|EBO17778.1|  hypothetical protein GOS_8123634 [marine metag...  47.4    5e-04 
gb|EBB80936.1|  hypothetical protein GOS_172280 [marine metage...  47.0    5e-04 
gb|ECI02698.1|  hypothetical protein GOS_6437329 [marine metag...  47.0    6e-04 
gb|ECZ03595.1|  hypothetical protein GOS_2252898 [marine metag...  47.0    6e-04 
gb|ECL21424.1|  hypothetical protein GOS_4266440 [marine metag...  47.0    6e-04 
gb|EBI57557.1|  hypothetical protein GOS_9069482 [marine metag...  47.0    6e-04 
gb|ECY24190.1|  hypothetical protein GOS_2393207 [marine metag...  47.0    6e-04 
gb|ECO27353.1|  hypothetical protein GOS_6132510 [marine metag...  46.6    6e-04 
gb|ECH98077.1|  hypothetical protein GOS_3133142 [marine metag...  46.6    7e-04 
gb|ECP90126.1|  hypothetical protein GOS_4252935 [marine metag...  46.6    8e-04 
gb|ECI52499.1|  hypothetical protein GOS_4450177 [marine metag...  46.2    8e-04 
gb|ECM05537.1|  hypothetical protein GOS_4408269 [marine metag...  46.2    0.001 
gb|ECK22481.1|  hypothetical protein GOS_4687397 [marine metag...  46.2    0.001 
gb|EBQ15130.1|  hypothetical protein GOS_7795123 [marine metag...  46.2    0.001 
gb|EDF07137.1|  hypothetical protein GOS_1020065 [marine metag...  46.2    0.001 
gb|EDI30271.1|  hypothetical protein GOS_455866 [marine metage...  45.8    0.001 
gb|ECS81854.1|  hypothetical protein GOS_3209217 [marine metag...  45.8    0.001 
gb|EBV94293.1|  hypothetical protein GOS_6826168 [marine metag...  45.8    0.001 
gb|EDI71104.1|  hypothetical protein GOS_386984 [marine metage...  45.4    0.001 
gb|EBV57911.1|  hypothetical protein GOS_6883424 [marine metag...  45.4    0.001 
gb|ECY32069.1|  hypothetical protein GOS_2379663 [marine metag...  45.4    0.001 
gb|EDI79867.1|  hypothetical protein GOS_373041 [marine metage...  45.4    0.001 
gb|EBX72419.1|  hypothetical protein GOS_6542802 [marine metag...  45.4    0.002 
gb|EBQ33946.1|  hypothetical protein GOS_7766170 [marine metag...  45.4    0.002 
gb|EBQ06590.1|  hypothetical protein GOS_7808589 [marine metag...  45.4    0.002 
gb|EBX54727.1|  hypothetical protein GOS_6570373 [marine metag...  45.4    0.002 
gb|ECF35399.1|  hypothetical protein GOS_3097236 [marine metag...  45.4    0.002 
gb|EDI76719.1|  hypothetical protein GOS_378034 [marine metage...  45.4    0.002 
gb|EDF59598.1|  hypothetical protein GOS_927354 [marine metage...  45.4    0.002 
gb|EBT98691.1|  hypothetical protein GOS_7184160 [marine metag...  45.4    0.002 
gb|EBK54297.1|  hypothetical protein GOS_8715293 [marine metag...  45.4    0.002 
gb|EDD12939.1|  hypothetical protein GOS_1354376 [marine metag...  45.1    0.002 
gb|EDI25974.1|  hypothetical protein GOS_462420 [marine metage...  44.7    0.002 
gb|ECK58375.1|  hypothetical protein GOS_3299056 [marine metag...  44.7    0.002 
gb|ECV57283.1|  hypothetical protein GOS_2873585 [marine metag...  44.7    0.003 
gb|ECY41003.1|  hypothetical protein GOS_2365211 [marine metag...  44.7    0.003 
gb|ECO55260.1|  hypothetical protein GOS_4974999 [marine metag...  44.7    0.003 
gb|EDI20221.1|  hypothetical protein GOS_471411 [marine metage...  44.3    0.003 
gb|ECF60342.1|  hypothetical protein GOS_5585385 [marine metag...  44.3    0.003 
gb|EBQ92526.1|  hypothetical protein GOS_7676960 [marine metag...  44.3    0.003 
gb|ECE25507.1|  hypothetical protein GOS_3693719 [marine metag...  44.3    0.004 
gb|ECG75707.1|  hypothetical protein GOS_4469859 [marine metag...  44.3    0.004 
gb|ECA59077.1|  hypothetical protein GOS_4246460 [marine metag...  44.3    0.004 
gb|ECE17424.1|  hypothetical protein GOS_4005227 [marine metag...  44.3    0.004 
gb|EBA55801.1|  hypothetical protein GOS_8565 [marine metagenome]  43.9    0.004 
gb|ECE35819.1|  hypothetical protein GOS_3281331 [marine metag...  43.9    0.005 
gb|EDI25975.1|  hypothetical protein GOS_462421 [marine metage...  43.9    0.005 
gb|ECJ41737.1|  hypothetical protein GOS_4401947 [marine metag...  43.9    0.005 
gb|ECB51676.1|  hypothetical protein GOS_4060990 [marine metag...  43.5    0.005 
gb|ECL32950.1|  hypothetical protein GOS_3833517 [marine metag...  43.5    0.006 
gb|ECK62229.1|  hypothetical protein GOS_3155220 [marine metag...  43.5    0.006 
gb|ECK58745.1|  hypothetical protein GOS_3285625 [marine metag...  43.1    0.007 
gb|EBQ83830.1|  hypothetical protein GOS_7690135 [marine metag...  43.1    0.007 
gb|EBV03734.1|  hypothetical protein GOS_6966398 [marine metag...  43.1    0.008 
gb|ECH73021.1|  hypothetical protein GOS_4097798 [marine metag...  43.1    0.008 
gb|EBB79425.1|  hypothetical protein GOS_174924 [marine metage...  43.1    0.009 
gb|ECL62110.1|  hypothetical protein GOS_6172642 [marine metag...  42.7    0.009 
gb|ECC58127.1|  hypothetical protein GOS_3350303 [marine metag...  42.7    0.010 
gb|EBQ54209.1|  hypothetical protein GOS_7735437 [marine metag...  42.7    0.011 
gb|ECT81377.1|  hypothetical protein GOS_4457971 [marine metag...  42.4    0.012 
gb|ECC66413.1|  hypothetical protein GOS_3031177 [marine metag...  42.4    0.012 
gb|ECL97168.1|  hypothetical protein GOS_4747194 [marine metag...  42.4    0.013 
gb|ECL83957.1|  hypothetical protein GOS_5277294 [marine metag...  42.4    0.015 
gb|ECE36820.1|  hypothetical protein GOS_3245178 [marine metag...  42.0    0.016 
gb|EBR97701.1|  hypothetical protein GOS_7510795 [marine metag...  42.0    0.017 
gb|EBV71922.1|  hypothetical protein GOS_6861066 [marine metag...  42.0    0.019 
gb|ECJ48415.1|  hypothetical protein GOS_4142850 [marine metag...  41.6    0.021 
gb|EBX89300.1|  hypothetical protein GOS_6516549 [marine metag...  41.6    0.022 
gb|ECY32883.1|  hypothetical protein GOS_2378293 [marine metag...  41.6    0.023 
gb|EBY29098.1|  hypothetical protein GOS_6434065 [marine metag...  41.6    0.023 
gb|EBV64910.1|  hypothetical protein GOS_6872165 [marine metag...  41.6    0.023 
gb|ECB53563.1|  hypothetical protein GOS_3989619 [marine metag...  41.6    0.024 
gb|ECN22861.1|  hypothetical protein GOS_3277612 [marine metag...  41.6    0.024 
gb|ECL31043.1|  hypothetical protein GOS_3906772 [marine metag...  41.6    0.024 
gb|EBO14569.1|  hypothetical protein GOS_8129042 [marine metag...  41.6    0.025 
gb|EBM48654.1|  hypothetical protein GOS_8402117 [marine metag...  41.6    0.026 
gb|EBZ93847.1|  hypothetical protein GOS_3364732 [marine metag...  41.2    0.027 
gb|EBC56208.1|  hypothetical protein GOS_50002 [marine metagen...  41.2    0.029 
gb|EBP71240.1|  hypothetical protein GOS_7866704 [marine metag...  41.2    0.033 
gb|EBF73167.1|  hypothetical protein GOS_9549867 [marine metag...  40.8    0.040 
gb|ECA05706.1|  hypothetical protein GOS_6394462 [marine metag...  40.8    0.043 
gb|EBN05477.1|  hypothetical protein GOS_8309363 [marine metag...  40.4    0.044 
gb|ECL10052.1|  hypothetical protein GOS_4715540 [marine metag...  40.4    0.048 
gb|ECJ64229.1|  hypothetical protein GOS_3533860 [marine metag...  40.4    0.051 
gb|EBV65977.1|  hypothetical protein GOS_6870468 [marine metag...  40.4    0.053 
gb|EBM16506.1|  hypothetical protein GOS_8452757 [marine metag...  40.0    0.063 
gb|ECD50361.1|  hypothetical protein GOS_3202061 [marine metag...  40.0    0.065 
gb|EBV78798.1|  hypothetical protein GOS_6850679 [marine metag...  40.0    0.066 
gb|EDI02037.1|  hypothetical protein GOS_502300 [marine metage...  40.0    0.067 
gb|ECC42595.1|  hypothetical protein GOS_3952615 [marine metag...  40.0    0.072 
gb|EDF21109.1|  hypothetical protein GOS_995372 [marine metage...  40.0    0.074 
gb|EBX24013.1|  hypothetical protein GOS_6619055 [marine metag...  39.7    0.076 
gb|ECR57052.1|  hypothetical protein GOS_4638058 [marine metag...  39.7    0.079 
gb|EDJ30593.1|  hypothetical protein GOS_1718958 [marine metag...  39.7    0.081 
gb|EDE00714.1|  hypothetical protein GOS_1206379 [marine metag...  39.7    0.095 
gb|ECI80290.1|  hypothetical protein GOS_3357137 [marine metag...  39.3    0.11  
gb|EDI23578.1|  hypothetical protein GOS_466127 [marine metage...  39.3    0.11  
gb|EBL53005.1|  hypothetical protein GOS_8556422 [marine metag...  39.3    0.11  
gb|EBA62758.1|  hypothetical protein GOS_578 [marine metagenome]   39.3    0.11  
gb|ECM71110.1|  hypothetical protein GOS_5324512 [marine metag...  39.3    0.12  
gb|ECO14355.1|  hypothetical protein GOS_3154837 [marine metag...  39.3    0.13  
gb|EBD84828.1|  hypothetical protein GOS_9862495 [marine metag...  38.9    0.14  
gb|EDJ10798.1|  hypothetical protein GOS_1753547 [marine metag...  38.9    0.14  
gb|EBC65144.1|  hypothetical protein GOS_35455 [marine metagen...  38.9    0.14  
gb|EDE33180.1|  hypothetical protein GOS_1149056 [marine metag...  38.9    0.15  
gb|EDF61955.1|  hypothetical protein GOS_923461 [marine metage...  38.9    0.15  
gb|ECH05405.1|  hypothetical protein GOS_3315345 [marine metag...  38.9    0.15  
gb|EDG90605.1|  hypothetical protein GOS_700343 [marine metage...  38.9    0.16  
gb|ECU71726.1|  hypothetical protein GOS_3276940 [marine metag...  38.5    0.20  
gb|EBQ58919.1|  hypothetical protein GOS_7728363 [marine metag...  38.1    0.23  
gb|ECH78678.1|  hypothetical protein GOS_3883468 [marine metag...  38.1    0.24  
gb|EBQ39593.1|  hypothetical protein GOS_7757854 [marine metag...  38.1    0.26  
gb|EBB01098.1|  hypothetical protein GOS_303750 [marine metage...  37.7    0.30  
gb|EDH57437.1|  hypothetical protein GOS_580836 [marine metage...  37.7    0.36  
gb|EBH07060.1|  hypothetical protein GOS_9325689 [marine metag...  37.4    0.37  
gb|EDE30162.1|  hypothetical protein GOS_1154391 [marine metag...  37.4    0.39  
gb|ECS43734.1|  hypothetical protein GOS_4707944 [marine metag...  37.4    0.39  
gb|EDE48981.1|  hypothetical protein GOS_1121832 [marine metag...  37.4    0.39  
gb|ECQ85367.1|  hypothetical protein GOS_4012359 [marine metag...  37.4    0.41  
gb|EBQ29828.1|  hypothetical protein GOS_7772297 [marine metag...  37.4    0.45  
gb|EBL72475.1|  hypothetical protein GOS_8524583 [marine metag...  37.0    0.50  
gb|EDJ24238.1|  hypothetical protein GOS_1730226 [marine metag...  37.0    0.51  
gb|ECG11666.1|  hypothetical protein GOS_3567320 [marine metag...  37.0    0.53  
gb|EDG55943.1|  hypothetical protein GOS_761054 [marine metage...  36.6    0.64  
gb|ECN91610.1|  hypothetical protein GOS_4045949 [marine metag...  36.6    0.64  
gb|EBQ40545.1|  hypothetical protein GOS_7756527 [marine metag...  36.6    0.64  
gb|EDE23960.1|  hypothetical protein GOS_1165396 [marine metag...  36.2    0.92  
gb|ECY30155.1|  hypothetical protein GOS_2382816 [marine metag...  36.2    0.97  
gb|ECU10896.1|  hypothetical protein GOS_3312185 [marine metag...  35.8    1.2   
gb|ECX20090.1|  hypothetical protein GOS_2581104 [marine metag...  35.4    1.4   
gb|EBB03845.1|  hypothetical protein GOS_299134 [marine metage...  35.4    1.5   
gb|EBE73783.1|  hypothetical protein GOS_9713503 [marine metag...  35.4    1.7   
gb|EDI47906.1|  hypothetical protein GOS_425783 [marine metage...  35.4    1.7   
gb|EDI21189.1|  hypothetical protein GOS_469883 [marine metage...  35.0    2.0   
gb|EBX60025.1|  hypothetical protein GOS_6562062 [marine metag...  35.0    2.0   
gb|EBF03180.1|  hypothetical protein GOS_9663959 [marine metag...  35.0    2.2   
gb|EBD97241.1|  hypothetical protein GOS_9841982 [marine metag...  34.7    2.7   
gb|EDI22871.1|  hypothetical protein GOS_467200 [marine metage...  34.7    3.1   
gb|EDD80419.1|  hypothetical protein GOS_1241778 [marine metag...  34.3    3.8   
gb|EBQ37048.1|  hypothetical protein GOS_7761601 [marine metag...  34.3    3.9   
gb|EDI08376.1|  hypothetical protein GOS_491400 [marine metage...  34.3    4.0   
gb|ECG99460.1|  hypothetical protein GOS_3541103 [marine metag...  33.9    4.7   
gb|ECP91362.1|  hypothetical protein GOS_4203748 [marine metag...  33.5    5.9   
gb|EDD37363.1|  hypothetical protein GOS_1315369 [marine metag...  33.5    6.0   
gb|ECY35063.1|  hypothetical protein GOS_2374854 [marine metag...  33.5    6.5   
gb|EBC29939.1|  hypothetical protein GOS_92627 [marine metagen...  33.5    6.7   

ALIGNMENTS
>gb|ECQ56051.1| hypothetical protein GOS_5169218 [marine metagenome]
Length=279

 Score =  512 bits (1319),  Expect = 4e-144, Method: Compositional matrix adjust.
 Identities = 277/277 (100%), Positives = 277/277 (100%), Gaps = 0/277 (0%)

Query  1    TLDFDVDDFTITLGGDLSGSATVTNLGDATLTATITANSVALGTDTTGNFVADLTAGEGI  60
            TLDFDVDDFTITLGGDLSGSATVTNLGDATLTATITANSVALGTDTTGNFVADLTAGEGI
Sbjct  1    TLDFDVDDFTITLGGDLSGSATVTNLGDATLTATITANSVALGTDTTGNFVADLTAGEGI  60

Query  61   DVSGGGSENATITVSAEDATSSNKGIASFDSTDFTVSSGAVTVNAERVQDIVGAMVGSNT  120
            DVSGGGSENATITVSAEDATSSNKGIASFDSTDFTVSSGAVTVNAERVQDIVGAMVGSNT
Sbjct  61   DVSGGGSENATITVSAEDATSSNKGIASFDSTDFTVSSGAVTVNAERVQDIVGAMVGSNT  120

Query  121  ESGITVTYEDSDGTLDFNVADPVITLSGDVAGSATMTNLGDVTISTTIQANSIALGTDTT  180
            ESGITVTYEDSDGTLDFNVADPVITLSGDVAGSATMTNLGDVTISTTIQANSIALGTDTT
Sbjct  121  ESGITVTYEDSDGTLDFNVADPVITLSGDVAGSATMTNLGDVTISTTIQANSIALGTDTT  180

Query  181  GNYVSAISAGEGIDVSGSGSETATVTISAEDATDSNKGIASFDATDFTVSSGDVTVNAER  240
            GNYVSAISAGEGIDVSGSGSETATVTISAEDATDSNKGIASFDATDFTVSSGDVTVNAER
Sbjct  181  GNYVSAISAGEGIDVSGSGSETATVTISAEDATDSNKGIASFDATDFTVSSGDVTVNAER  240

Query  241  IQDIVGAMFSSNTESGISVTYEDSDGTIDLDVSDPTL  277
            IQDIVGAMFSSNTESGISVTYEDSDGTIDLDVSDPTL
Sbjct  241  IQDIVGAMFSSNTESGISVTYEDSDGTIDLDVSDPTL  277


 Score =  210 bits (534),  Expect = 3e-53, Method: Compositional matrix adjust.
 Identities = 109/147 (74%), Positives = 130/147 (88%), Gaps = 0/147 (0%)

Query  134  TLDFNVADPVITLSGDVAGSATMTNLGDVTISTTIQANSIALGTDTTGNYVSAISAGEGI  193
            TLDF+V D  ITL GD++GSAT+TNLGD T++ TI ANS+ALGTDTTGN+V+ ++AGEGI
Sbjct  1    TLDFDVDDFTITLGGDLSGSATVTNLGDATLTATITANSVALGTDTTGNFVADLTAGEGI  60

Query  194  DVSGSGSETATVTISAEDATDSNKGIASFDATDFTVSSGDVTVNAERIQDIVGAMFSSNT  253
            DVSG GSE AT+T+SAEDAT SNKGIASFD+TDFTVSSG VTVNAER+QDIVGAM  SNT
Sbjct  61   DVSGGGSENATITVSAEDATSSNKGIASFDSTDFTVSSGAVTVNAERVQDIVGAMVGSNT  120

Query  254  ESGISVTYEDSDGTIDLDVSDPTLSLQ  280
            ESGI+VTYEDSDGT+D +V+DP ++L 
Sbjct  121  ESGITVTYEDSDGTLDFNVADPVITLS  147


>gb|EBI55730.1| hypothetical protein GOS_9072598 [marine metagenome]
Length=441

 Score =  508 bits (1307),  Expect = 9e-143, Method: Compositional matrix adjust.
 Identities = 273/279 (97%), Positives = 276/279 (98%), Gaps = 0/279 (0%)

Query  1    TLDFDVDDFTITLGGDLSGSATVTNLGDATLTATITANSVALGTDTTGNFVADLTAGEGI  60
            TLDFDVDDFTITLGGDLSGSATVTNLGDATLTATI ANSVALGTDTTGNF+ADLTAGEGI
Sbjct  39   TLDFDVDDFTITLGGDLSGSATVTNLGDATLTATIAANSVALGTDTTGNFIADLTAGEGI  98

Query  61   DVSGGGSENATITVSAEDATSSNKGIASFDSTDFTVSSGAVTVNAERVQDIVGAMVGSNT  120
            DVSGGGSENATITVSAEDATSSNKGIASFDSTDFTVSSGAVTVNAERVQDIVGAMVGSNT
Sbjct  99   DVSGGGSENATITVSAEDATSSNKGIASFDSTDFTVSSGAVTVNAERVQDIVGAMVGSNT  158

Query  121  ESGITVTYEDSDGTLDFNVADPVITLSGDVAGSATMTNLGDVTISTTIQANSIALGTDTT  180
            ESGI+VTYEDSDGTLDFNVADPVITLSGDVAGSATMTNLGDVTISTTIQANSIALGTDTT
Sbjct  159  ESGISVTYEDSDGTLDFNVADPVITLSGDVAGSATMTNLGDVTISTTIQANSIALGTDTT  218

Query  181  GNYVSAISAGEGIDVSGSGSETATVTISAEDATDSNKGIASFDATDFTVSSGDVTVNAER  240
            GNYVSAISAGEGIDVSGSGSETATVTISAEDATDSNKGIASFDATDFTVSS DVTVNAER
Sbjct  219  GNYVSAISAGEGIDVSGSGSETATVTISAEDATDSNKGIASFDATDFTVSSRDVTVNAER  278

------------------------------------------------------------------------------------------------------

d)BLASTx versus SWISSPROT

                                                                   Score     E
Sequences producing significant alignments:                       (Bits)  Value

sp|P18305.2|Y43R_IIV6  RecName: Full=Uncharacterized protein 443R  72.8    3e-12
sp|Q5SSG8.1|MUC21_HUMAN  RecName: Full=Mucin-21; Short=MUC-21;...  63.5    2e-09
sp|Q6GDE9.1|SRAP_STAAR  RecName: Full=Serine-rich adhesin for ...  63.2    2e-09
sp|Q868Z9.2|PPN_DROME  RecName: Full=Papilin; Flags: Precursor     62.4    4e-09
sp|Q6G620.1|SRAP_STAAS  RecName: Full=Serine-rich adhesin for ...  62.0    5e-09
sp|Q8NUJ3.1|SRAP_STAAW  RecName: Full=Serine-rich adhesin for ...  62.0    5e-09
sp|Q8VQ99.1|SRAP_STAAU  RecName: Full=Serine-rich adhesin for ...  60.5    1e-08
sp|Q7A362.1|SRAP_STAAN  RecName: Full=Serine-rich adhesin for ...  60.1    2e-08
sp|Q4L9P0.1|SRAP_STAHJ  RecName: Full=Serine-rich adhesin for ...  59.3    3e-08
sp|Q2FUW1.1|SRAP_STAA8  RecName: Full=Serine-rich adhesin for ...  58.2    7e-08
sp|Q2FDK5.1|SRAP_STAA3  RecName: Full=Serine-rich adhesin for ...  58.2    7e-08
sp|Q5HCP3.1|SRAP_STAAC  RecName: Full=Serine-rich adhesin for ...  58.2    7e-08
sp|P35828.4|SLAP_CAUCR  RecName: Full=S-layer protein; AltName...  57.0    1e-07
sp|P33666.2|YDBA_ECOLI  RecName: Full=Putative uncharacterized...  55.8    3e-07
sp|Q8QZQ8.1|261R_IIV6  RecName: Full=Uncharacterized protein 261R  55.1    6e-07
sp|P25062.1|CSG_HALVO  RecName: Full=Cell surface glycoprotein...  55.1    6e-07
sp|Q9P6S0.3|YJP1_SCHPO  RecName: Full=Putative cell agglutinat...  53.5    2e-06
sp|P50401.1|GUXA_CELFI  RecName: Full=Exoglucanase A; AltName:...  53.1    2e-06
sp|P76347.3|YEEJ_ECOLI  RecName: Full=Uncharacterized protein ...  52.4    4e-06
sp|Q9JMS3.1|YUAQ_ECOLI  RecName: Full=Uncharacterized protein ...  52.4    4e-06
sp|Q196W9.1|VF396_IIV3  RecName: Full=Uncharacterized protein ...  51.2    8e-06
sp|Q8TGE1.1|AWA1_YEAST  RecName: Full=Cell wall protein AWA1; ...  50.8    1e-05
sp|Q8X8V7.2|YEEJ_ECO57  RecName: Full=Uncharacterized protein ...  50.4    1e-05
sp|Q18DN4.1|HMU_HALWD  RecName: Full=Halomucin; Flags: Precursor   50.1    2e-05
sp|Q7Z5P9.2|MUC19_HUMAN  RecName: Full=Mucin-19; Short=MUC-19;...  47.8    9e-05
sp|P08399.2|PHXR5_MOUSE  RecName: Full=Putative per-hexamer re...  47.8    9e-05
sp|Q12459.2|PRM7_YEAST  RecName: Full=Pheromone-regulated prot...  47.4    1e-04
sp|Q9JMS5.1|YUAO_ECOLI  RecName: Full=Uncharacterized protein ...  47.4    1e-04
sp|P28968.1|GP2_EHV1B  RecName: Full=Glycoprotein gp2; AltName...  47.4    1e-04
sp|Q08860.1|FLIC_SHIFL  RecName: Full=Flagellin                    47.0    2e-04
sp|Q54KD5.1|Y7399_DICDI  RecName: Full=P17/29C-like protein DD...  47.0    2e-04
sp|P29760.1|AMYI_YEAST  RecName: Full=Glucoamylase S2; AltName...  46.6    2e-04
sp|P04065.2|AMYH_YEAST  RecName: Full=Glucoamylase S1; AltName...  45.4    4e-04
sp|Q02910.2|CPN_DROME  RecName: Full=Calphotin                     45.1    6e-04
sp|P25927.2|BIGA_SALTY  RecName: Full=Putative surface-exposed...  44.3    0.001
sp|Q9C0Y2.1|YKL1_SCHPO  RecName: Full=Putative cell agglutinat...  43.9    0.001
sp|Q86S05.1|LIG_DROME  RecName: Full=Protein lingerer              43.5    0.002
sp|P50899.1|GUXB_CELFI  RecName: Full=Exoglucanase B; AltName:...  43.5    0.002
sp|Q52657.2|OMPA_RICCN  RecName: Full=Outer membrane protein A...  43.5    0.002
sp|Q4WLB9.1|YA090_ASPFU  RecName: Full=Uncharacterized protein...  43.5    0.002
sp|Q54GV8.1|Y8625_DICDI  RecName: Full=Uncharacterized transme...  42.4    0.004
sp|Q8TFG9.2|YL61_SCHPO  RecName: Full=Uncharacterized serine/t...  42.0    0.005
sp|P40889.2|YJW5_YEAST  RecName: Full=Y' element ATP-dependent...  42.0    0.005
sp|P40434.1|YIR7_YEAST  RecName: Full=Y' element ATP-dependent...  42.0    0.005
sp|Q9Z6U5.1|PMP21_CHLPN  RecName: Full=Probable outer membrane...  41.6    0.006
sp|P07897.2|PGCA_RAT  RecName: Full=Aggrecan core protein; Alt...  41.6    0.006
sp|Q1EAR5.2|CHI2_COCIM  RecName: Full=Endochitinase 2; Flags: ...  40.8    0.011
sp|Q9Z812.2|PMP20_CHLPN  RecName: Full=Probable outer membrane...  40.8    0.011
sp|P35827.2|SLAP_CAMFE  RecName: Full=S-layer protein; AltName...  40.8    0.011
sp|P08640.2|MUC1_YEAST  RecName: Full=Flocculation protein FLO...  40.4    0.014
sp|P24216.5|FLID_ECOLI  RecName: Full=Flagellar hook-associate...  40.4    0.014
sp|Q8T2I5.1|LYSG3_DICDI  RecName: Full=Probable GH family 25 l...  40.4    0.014
sp|C7G046.1|Y6969_DICDI  RecName: Full=von Willebrand factor A...  40.0    0.019
sp|Q6S6W0.1|GP2_EHV1V  RecName: Full=Glycoprotein gp2; AltName...  40.0    0.019
sp|Q9NZW4.2|DSPP_HUMAN  RecName: Full=Dentin sialophosphoprote...  39.7    0.025
sp|P47033.1|PRY3_YEAST  RecName: Full=Cell wall protein PRY3; ...  39.7    0.025
sp|Q9C105.1|YKT4_SCHPO  RecName: Full=Chitinase-like protein P...  39.3    0.032
sp|Q05164.2|HPF1_YEAST  RecName: Full=Haze protective factor 1...  39.3    0.032
sp|Q6GIK4.1|CLFA_STAAR  RecName: Full=Clumping factor A; AltNa...  39.3    0.032
sp|Q9XD84.1|TIBA_ECOLX  RecName: Full=Adhesin/invasin tibA aut...  39.3    0.032
sp|P50400.1|GUND_CELFI  RecName: Full=Endoglucanase D; AltName...  39.3    0.032
sp|P22698.1|SPG7_DICDI  RecName: Full=Spore germination protei...  39.3    0.032
sp|Q9Y520.2|BA2L2_HUMAN  RecName: Full=Protein BAT2-like 2; Al...  38.9    0.042
sp|Q09624.3|LOV1_CAEEL  RecName: Full=Location of vulva defect...  38.9    0.042
sp|P58297.2|FLID_ECO57  RecName: Full=Flagellar hook-associate...  38.9    0.042
sp|Q04433.1|FIT1_YEAST  RecName: Full=Facilitator of iron tran...  38.9    0.042
sp|P15921.1|OMPA_RICRI  RecName: Full=Outer membrane protein A...  38.9    0.042
sp|P03764.2|STF_LAMBD  RecName: Full=Side tail fiber protein       38.5    0.055
sp|Q04893.1|YM96_YEAST  RecName: Full=Uncharacterized protein ...  38.5    0.055
sp|P18127.1|ICEN_XANCT  RecName: Full=Ice nucleation protein       38.5    0.055
sp|Q6ZRS2.2|SRCAP_HUMAN  RecName: Full=Helicase SRCAP; AltName...  38.1    0.072
sp|Q6FT26.1|DSE2_CANGA  RecName: Full=Protein DSE2; AltName: F...  37.7    0.093
sp|Q9HM69.2|CSG_HALSA  RecName: Full=Cell surface glycoprotein...  37.7    0.093
sp|P47179.1|DAN4_YEAST  RecName: Full=Cell wall protein DAN4; ...  37.7    0.093
sp|P96989.1|OMPB_RICTY  RecName: Full=Outer membrane protein B...  37.7    0.093
sp|Q9QX47.1|SON_MOUSE  RecName: Full=Protein SON                   37.7    0.093
sp|O94854.3|K0754_HUMAN  RecName: Full=Uncharacterized protein...  37.4    0.12 
sp|Q09165.2|DIG1_CAEEL  RecName: Full=Mesocentin; Flags: Precu...  37.4    0.12 
sp|P04949.2|FLIC_ECOLI  RecName: Full=Flagellin                    37.4    0.12 
sp|Q9Z899.2|PMP6_CHLPN  RecName: Full=Probable outer membrane ...  37.4    0.12 
sp|P24488.1|HAP2_SCHPO  RecName: Full=Transcriptional activato...  37.4    0.12 
sp|P53214.1|MTL1_YEAST  RecName: Full=Protein MTL1; AltName: F...  37.4    0.12 
sp|Q8CWX2.1|SYFB_STRMU  RecName: Full=Phenylalanyl-tRNA synthe...  37.0    0.16 
sp|P98088.3|MUC5A_HUMAN  RecName: Full=Mucin-5AC; Short=MUC-5A...  36.6    0.21 
sp|P15320.1|HLYA_SERMA  RecName: Full=Hemolysin; Flags: Precursor  36.6    0.21 
sp|Q09788.1|YA9A_SCHPO  RecName: Full=Uncharacterized serine-r...  36.6    0.21 
sp|Q9Z880.2|PMP18_CHLPN  RecName: Full=Probable outer membrane...  36.6    0.21 
sp|A3A8Q4.2|P2C18_ORYSJ  RecName: Full=Probable protein phosph...  36.2    0.27 
sp|Q32KG4.1|RGAG1_MOUSE  RecName: Full=Retrotransposon gag dom...  36.2    0.27 
sp|P42835.1|EGT2_YEAST  RecName: Full=Protein EGT2; AltName: F...  36.2    0.27 
sp|O88799.1|ZAN_MOUSE  RecName: Full=Zonadhesin; Flags: Precursor  36.2    0.27 
sp|P38844.1|DSE2_YEAST  RecName: Full=Protein DSE2; AltName: F...  36.2    0.27 
sp|P54681.2|RTOA_DICDI  RecName: Full=Protein rtoA; AltName: F...  36.2    0.27 
sp|A6ZXT5.2|PRM7_YEAS7  RecName: Full=Pheromone-regulated prot...  35.8    0.35 
sp|P54197.2|CHI2_COCP7  RecName: Full=Endochitinase 2; Flags: ...  35.8    0.35 
sp|P52143.3|YPJA_ECOLI  RecName: Full=Uncharacterized outer me...  35.8    0.35 
sp|P39180.3|AG43_ECOLI  RecName: Full=Antigen 43; Short=AG43; ...  35.8    0.35 
sp|P12255.4|FHAB_BORPE  RecName: Full=Filamentous hemagglutinin    35.8    0.35 
sp|P18583.3|SON_HUMAN  RecName: Full=Protein SON; AltName: Ful...  35.8    0.35 
sp|Q03155.1|AIDA_ECOLX  RecName: Full=AIDA-I autotransporter; ...  35.8    0.35 
sp|C4L8X0.1|PNP_TOLAT  RecName: Full=Polyribonucleotide nucleo...  35.4    0.46 
sp|A4XBH2.1|CH602_SALTO  RecName: Full=60 kDa chaperonin 2; Al...  35.4    0.46 
sp|Q5UQ50.1|COLL6_MIMIV  RecName: Full=Collagen-like protein 6     35.4    0.46 
sp|P07856.2|SERI1_BOMMO  RecName: Full=Sericin 1; AltName: Ful...  35.4    0.46 
sp|P53278.1|YG3A_YEAST  RecName: Full=Uncharacterized protein ...  35.4    0.46 
sp|Q12140.1|BSC1_YEAST  RecName: Full=Bypass of stop codon pro...  35.4    0.46 
sp|P76072.2|STFR_ECOLI  RecName: Full=Side tail fiber protein ...  35.4    0.46 
sp|Q0RRL9.2|CH601_FRAAA  RecName: Full=60 kDa chaperonin 1; Al...  35.0    0.61 
sp|Q07591.1|EAE_CITFR  RecName: Full=Intimin; AltName: Full=At...  35.0    0.61 
sp|P46739.1|NFAA_ECOLX  RecName: Full=Non-fimbrial adhesin 1; ...  35.0    0.61 
sp|Q12218.1|TIR4_YEAST  RecName: Full=Cell wall protein TIR4; ...  35.0    0.61 
sp|P68343.1|VGP3_EBVA8  RecName: Full=Envelope glycoprotein GP...  35.0    0.61 
sp|Q8ZRF8.1|YAIT_SALTY  RecName: Full=Uncharacterized protein ...  35.0    0.61 
sp|Q07888.2|YL067_YEAST  RecName: Full=Y' element ATP-dependen...  34.7    0.79 
sp|B9E371.1|LEU1_CLOK1  RecName: Full=2-isopropylmalate syntha...  34.7    0.79 
sp|Q12215.1|WSC3_YEAST  RecName: Full=Cell wall integrity and ...  34.7    0.79 
sp|Q9KKA3.2|OMPB_RICCN  RecName: Full=Outer membrane protein B...  34.7    0.79 
sp|P11248.1|PRM2_RAT  RecName: Full=Protamine-2; AltName: Full...  34.7    0.79 
sp|A1S467.1|PNP_SHEAM  RecName: Full=Polyribonucleotide nucleo...  34.7    0.79 
sp|P97399.2|DSPP_MOUSE  RecName: Full=Dentin sialophosphoprote...  34.3    1.0  
sp|P32623.3|CRH2_YEAST  RecName: Full=Probable glycosidase CRH...  34.3    1.0  
sp|Q7VL91.1|PNP_HAEDU  RecName: Full=Polyribonucleotide nucleo...  34.3    1.0  
sp|P35829.1|SLAP_LACAC  RecName: Full=S-layer protein; AltName...  34.3    1.0  
sp|Q9ZFG9.1|ALGE7_AZOVI  RecName: Full=Poly(beta-D-mannuronate...  34.3    1.0  
sp|Q7M4S9.1|YB113_YEAST  RecName: Full=Uncharacterized protein...  33.9    1.3  
sp|Q70E73.3|RAPH1_HUMAN  RecName: Full=Ras-associated and plec...  33.9    1.3  
sp|Q9W539.4|HR4_DROME  RecName: Full=Hormone receptor 4; Short...  33.9    1.3  
sp|P36110.1|PRY2_YEAST  RecName: Full=Protein PRY2; AltName: F...  33.9    1.3  
sp|P15636.1|API_ACHLY  RecName: Full=Protease 1; AltName: Full...  33.9    1.3  
sp|P46590.1|ALS1_CANAL  RecName: Full=Agglutinin-like protein ...  33.9    1.3  
sp|P07067.1|VG37_BPT2  Long tail fiber protein p37 (Protein Gp...  33.9    1.3  
sp|P36909.1|CHIT_STRLI  RecName: Full=Chitinase C; Flags: Prec...  33.9    1.3  
sp|P40954.2|CHI3_CANAL  RecName: Full=Chitinase 3; Flags: Prec...  33.9    1.3  
sp|C3N5E1.1|TRM1_SULIA  RecName: Full=N(2),N(2)-dimethylguanos...  33.5    1.8  
sp|Q99208.2|YL066_YEAST  RecName: Full=Y' element ATP-dependen...  33.5    1.8  
sp|A8M497.1|CH602_SALAI  RecName: Full=60 kDa chaperonin 2; Al...  33.5    1.8  
sp|Q0C593.1|RECR_HYPNA  RecName: Full=Recombination protein recR   33.5    1.8  
sp|Q03099.1|YMN3_YEAST  RecName: Full=Y' element ATP-dependent...  33.5    1.8  
sp|P53345.1|YRF13_YEAST  RecName: Full=Y' element ATP-dependen...  33.5    1.8  
sp|Q9HGI7.2|ERF3_CANMA  RecName: Full=Eukaryotic peptide chain...  33.5    1.8  
sp|P53819.1|YRF16_YEAST  RecName: Full=Y' element ATP-dependen...  33.5    1.8  
sp|Q02817.2|MUC2_HUMAN  RecName: Full=Mucin-2; Short=MUC-2; Al...  33.5    1.8  
sp|Q04537.1|PER_DROSR  RecName: Full=Period circadian protein      33.5    1.8  
sp|Q9ZLX4.1|DPO3B_HELPJ  RecName: Full=DNA polymerase III subu...  33.5    1.8  
sp|P40105.2|YRF12_YEAST  RecName: Full=Y' element ATP-dependen...  33.5    1.8  
sp|A1TSY0.1|Y3513_ACIAC  RecName: Full=UPF0271 protein Aave_3513   33.1    2.3  
sp|Q8RXE5.1|WNK10_ARATH  RecName: Full=Probable serine/threoni...  33.1    2.3  
sp|Q8DBU9.1|PNP_VIBVU  RecName: Full=Polyribonucleotide nucleo...  33.1    2.3  
sp|Q685J3.1|MUC17_HUMAN  RecName: Full=Mucin-17; Short=MUC-17;...  33.1    2.3  
sp|Q9KJ75.2|PTMCB_STRMU  RecName: Full=PTS system mannitol-spe...  33.1    2.3  
sp|P42272.3|FLIC1_PROMI  RecName: Full=Flagellin 1                 33.1    2.3  
sp|Q50833.2|CSG_METVO  RecName: Full=S-layer protein; AltName:...  33.1    2.3  
sp|A5F913.1|PNP_VIBC3  RecName: Full=Polyribonucleotide nucleo...  33.1    2.3  
sp|A6W5Y6.1|CH601_KINRD  RecName: Full=60 kDa chaperonin 1; Al...  33.1    2.3  
sp|Q9KU76.1|PNP_VIBCH  RecName: Full=Polyribonucleotide nucleo...  33.1    2.3  
sp|Q91012.2|T22D1_CHICK  RecName: Full=TSC22 domain family pro...  32.7    3.0  
sp|B4F2C3.1|PNP_PROMH  RecName: Full=Polyribonucleotide nucleo...  32.7    3.0  
sp|A6U483.1|TCAA_STAA2  RecName: Full=Membrane-associated prot...  32.7    3.0  
sp|Q6DG03.1|DMTF1_DANRE  RecName: Full=Cyclin-D-binding Myb-li...  32.7    3.0  
sp|Q4K6J1.1|MURD_PSEF5  RecName: Full=UDP-N-acetylmuramoylalan...  32.7    3.0  
sp|Q6CZT3.1|PEL2_ERWCT  RecName: Full=Pectate lyase 2; AltName...  32.7    3.0  
sp|Q6G6W4.1|TCAA_STAAS  RecName: Full=Membrane-associated prot...  32.7    3.0  
sp|Q6GE78.1|TCAA_STAAR  RecName: Full=Membrane-associated prot...  32.7    3.0  
sp|Q28139.2|NCKX1_BOVIN  RecName: Full=Sodium/potassium/calciu...  32.7    3.0  
sp|Q8NV48.1|TCAA_STAAW  RecName: Full=Membrane-associated prot...  32.7    3.0  
sp|P17260.1|KRE1_YEAST  RecName: Full=Protein KRE1; AltName: F...  32.7    3.0  
sp|Q7A3X6.1|TCAA_STAAN  RecName: Full=Membrane-associated prot...  32.7    3.0  
sp|P38058.1|CBPA_CLOCL  RecName: Full=Cellulose-binding protei...  32.7    3.0  
sp|Q8I7T3.1|SADA_DICDI  RecName: Full=Substrate-adhesion molec...  32.7    3.0  
sp|Q8T2A1.1|COLB_DICDI  RecName: Full=Colossin-B; Flags: Precu...  32.7    3.0  
sp|B2FPB2.1|SECA_STRMK  RecName: Full=Protein translocase subu...  32.3    3.9  
sp|Q9UKN1.2|MUC12_HUMAN  RecName: Full=Mucin-12; Short=MUC-12;...  32.3    3.9  
sp|Q9KI14.1|SDRF_STAEP  RecName: Full=Serine-aspartate repeat-...  32.3    3.9  
sp|Q64761.1|FIB1_ADEG1  RecName: Full=Fiber protein 1              32.3    3.9  
sp|P44602.1|HXUA1_HAEIN  RecName: Full=Heme/hemopexin-binding ...  32.3    3.9  
sp|P28871.1|CARP2_CANAL  RecName: Full=Candidapepsin-2; AltNam...  32.3    3.9  
sp|P13952.1|CCNB_SPISO  RecName: Full=G2/mitotic-specific cycl...  32.3    3.9  
sp|Q9NAX4.1|SP65_DICDI  RecName: Full=Spore coat protein SP65;...  32.3    3.9  
sp|Q869T7.1|PAKF_DICDI  RecName: Full=Serine/threonine-protein...  32.3    3.9  
sp|B3MKS0.1|IHOG_DROAN  RecName: Full=Interference hedgehog; F...  32.0    5.1  
sp|A9HW07.1|LEU1_BORPD  RecName: Full=2-isopropylmalate syntha...  32.0    5.1  
sp|Q9UIF9.4|BAZ2A_HUMAN  RecName: Full=Bromodomain adjacent to...  32.0    5.1  
sp|Q1RJQ6.1|Y327_RICBR  RecName: Full=Uncharacterized protein ...  32.0    5.1  
sp|Q8QZQ7.1|VF396_IIV6  RecName: Full=Uncharacterized protein ...  32.0    5.1  
sp|P32768.4|FLO1_YEAST  RecName: Full=Flocculation protein FLO...  32.0    5.1  
sp|Q5WFT6.1|SYP_BACSK  RecName: Full=Prolyl-tRNA synthetase; A...  32.0    5.1  
sp|Q8C0T5.2|SI1L1_MOUSE  RecName: Full=Signal-induced prolifer...  32.0    5.1  
sp|Q92176.3|COR1A_BOVIN  RecName: Full=Coronin-1A; AltName: Fu...  32.0    5.1  
sp|P39712.2|FLO9_YEAST  RecName: Full=Flocculation protein FLO...  32.0    5.1  
sp|P38537.1|SLAP_BACSH  RecName: Full=Surface-layer 125 kDa pr...  32.0    5.1  
sp|Q60106.1|XANP_XANS2  RecName: Full=Xanthomonalisin; AltName...  32.0    5.1  
sp|Q96J92.1|WNK4_HUMAN  RecName: Full=Serine/threonine-protein...  32.0    5.1  
sp|O35412.1|SI1L1_RAT  RecName: Full=Signal-induced proliferat...  32.0    5.1  
sp|Q54JA3.1|SIBE_DICDI  RecName: Full=Integrin beta-like prote...  32.0    5.1  
sp|Q8BJ34.2|LKAP_MOUSE  RecName: Full=Limkain-b1                   31.6    6.7  
sp|Q96D42.2|TIMD1_HUMAN  RecName: Full=Hepatitis A virus cellu...  31.6    6.7  
sp|Q5Z1F9.1|CH601_NOCFA  RecName: Full=60 kDa chaperonin 1; Al...  31.6    6.7  
sp|Q70JS2.2|KELC_ANOST  RecName: Full=Ring canal kelch homolog...  31.6    6.7  
sp|Q8K3T3.1|PER1_SPAJD  RecName: Full=Period circadian protein...  31.6    6.7  
sp|Q13183.1|S13A2_HUMAN  RecName: Full=Solute carrier family 1...  31.6    6.7  
sp|P47632.1|CH60_MYCGE  RecName: Full=60 kDa chaperonin; AltNa...  31.6    6.7  
sp|Q9H0E9.1|BRD8_HUMAN  RecName: Full=Bromodomain-containing p...  31.6    6.7  
sp|P38900.1|YH19_YEAST  RecName: Full=Putative uncharacterized...  31.6    6.7  
sp|Q54PA8.1|Y2080_DICDI  RecName: Full=Uncharacterized protein...  31.6    6.7  
sp|Q03EE2.1|RPOA_PEDPA  RecName: Full=DNA-directed RNA polymer...  31.6    6.7  
sp|Q0I2T0.1|PNP_HAES1  RecName: Full=Polyribonucleotide nucleo...  31.6    6.7  
sp|B0TQ97.1|PNP_SHEHH  RecName: Full=Polyribonucleotide nucleo...  31.2    8.7  
sp|O75952.2|CABYR_HUMAN  RecName: Full=Calcium-binding tyrosin...  31.2    8.7  
sp|A1T577.1|CH602_MYCVP  RecName: Full=60 kDa chaperonin 2; Al...  31.2    8.7  
sp|Q2JFC5.1|CH601_FRASC  RecName: Full=60 kDa chaperonin 1; Al...  31.2    8.7  
sp|Q12QG9.2|PNP_SHEDO  RecName: Full=Polyribonucleotide nucleo...  31.2    8.7  
sp|Q12WE6.1|DNAK_METBU  RecName: Full=Chaperone protein dnaK; ...  31.2    8.7  
sp|Q9A8J6.1|GLYA_CAUCR  RecName: Full=Serine hydroxymethyltran...  31.2    8.7  
sp|Q9Z898.2|PMP7_CHLPN  RecName: Full=Probable outer membrane ...  31.2    8.7  
sp|Q08673.1|SRL1_YEAST  RecName: Full=Cell wall protein SRL1; ...  31.2    8.7  
sp|Q00298.1|CUTI_BOTFU  RecName: Full=Cutinase; AltName: Full=...  31.2    8.7  
sp|P24088.3|YRF11_YEAST  RecName: Full=Y' element ATP-dependen...  31.2    8.7  
sp|P47178.1|DAN1_YEAST  RecName: Full=Cell wall protein DAN1; ...  31.2    8.7  
sp|P36027.1|MID2_YEAST  RecName: Full=Cell wall integrity sens...  31.2    8.7  
sp|O13559.1|YRF14_YEAST  RecName: Full=Y' element ATP-dependen...  31.2    8.7  
sp|O35973.1|PER1_MOUSE  RecName: Full=Period circadian protein...  31.2    8.7  
sp|O85805.1|CR1BE_BACTU  RecName: Full=Pesticidal crystal prot...  31.2    8.7  
sp|A5G0F8.1|OXAA_ACICJ  RecName: Full=Inner membrane protein oxaA  31.2    8.7  

ALIGNMENTS
>sp|P18305.2|Y43R_IIV6 RecName: Full=Uncharacterized protein 443R
Length=2432

 Score = 72.8 bits (177),  Expect = 3e-12
 Identities = 82/322 (25%), Positives = 143/322 (44%), Gaps = 47/322 (14%)
 Frame = +1

Query  31    ITLGGDLSGSAT-----VTNLGDATLT--ATITANSVALGTDTTGNFVADLTAGEGIDVS  189
             I L GDL GS T     V + G  TLT  A ++ NS  +G+ +T +   +LT G G+ +S
Sbjct  1900  IQLAGDLGGSGTTASSPVISSGAITLTKMANLSGNSQIIGSGSTSSSPVNLTLGSGLQIS  1959

Query  190   GG--GSENATITVSAEDATSSN-----KGIASFDSTDFTVSSGAVTV----NAERVQDIV  336
             G      +AT+TV    AT+         +    +T  T+++GA+T+    N      I+
Sbjct  1960  GTVLSVNSATLTVPPATATTIGGIEMLGDLTGSVATAPTIAAGAITLAKMANLSGNSQII  2019

Query  337   GAMVGSNTESGITV--TYEDSDGTLDFNVADPV-----------ITLSGDVAGS---ATM  468
             G+   ++T + +T+    + S   L  N A              I + GD+ GS   A  
Sbjct  2020  GSSSTTSTPTNLTLGSGLQISGTVLSVNSATLTVPPATATTIGGIEMLGDLTGSVATAPT  2079

Query  469   TNLGDVTIS--TTIQANSIALGTDTTGNYVSAISAGEGIDVSGSGSETATVTISAEDATD  642
                G +T+S    +  NS  +G+ +T +  + ++ G G+ +SG+     + T++   AT 
Sbjct  2080  VATGAITLSKMANLSGNSQIIGSSSTTSTPTNLTLGSGLQISGTVLSVNSATLTVPPATA  2139

Query  643   SNKG-------IASFDATDFTVSSGDVTV----NAERIQDIVGAMFSSNTESGISVTYED  789
             +  G       +    AT  TV++G +T+    N      ++G+  ++ + + IS+    
Sbjct  2140  TTIGGIEMLGDLTGSVATAPTVAAGAITLAKMANLSGTSQLIGSSSTTTSPANISLGSTL  2199

Query  790   SDGTIDLDVSDPTLSLQAMSQV  855
                   L V+  TL L   S V
Sbjct  2200  QMSGTTLSVNTSTLMLLVPSSV  2221


 Score = 67.8 bits (164),  Expect = 8e-11
 Identities = 73/307 (23%), Positives = 144/307 (46%), Gaps = 39/307 (12%)
 Frame = +1

Query  31    ITLGGDLSGSATVTNLGDATLT----ATITANSVALGTDTTGNFVADLTAGEGIDVSGG-  195
             I L GDL+GS+    +    +T    A ++ NS  +G+ +T +   +LT G G+ +SG  
Sbjct  1373  IQLSGDLTGSSISPTVAAGAITLAKMANLSGNSQIIGSSSTTSSPTNLTLGSGLQISGTV  1432

Query  196   -GSENATITVSAEDATSSN-----KGIASFDSTDFTVSSGAVTV----NAERVQDIVGAM  345
                 +AT+TV    AT+         +    +T  TV++GA+T+    N      I+G+ 
Sbjct  1433  LSVNSATLTVPPATATTIGGIEMLGDLTGSVATAPTVAAGAITLAKMANLSGSSQIIGSS  1492

Query  346   VGSNTESGITV--TYEDSDGTLDFNVADPV----ITLSGDVAGS---ATMTNLGDVTIS-  495
               ++  + +T+    + +   L  + A       I + GD+ GS   A     G +T++ 
Sbjct  1493  STTSAPTNLTLGSGLQITSTVLSISAATSSTLGGIEMLGDLTGSVATAPTVAAGAITLAK  1552

Query  496   -TTIQANSIALGTDTTGNYVSAISAGEGIDVSGSGSETATVTISAEDATDSNKG------  654
                +  NS  +G+ +T +  + ++ G G+ +SG+     + T++   AT +  G      
Sbjct  1553  MANLSGNSQIIGSSSTASTPTNLTLGSGLQISGTVLSVNSATLTVPPATATTIGGIEMLG  1612

Query  655   -IASFDATDFTVSSGDVTV----NAERIQDIVGAMFSSNTESGISV-TYEDSDGTIDLDV  816
              +    AT  TV++G +T+    N      I+G+  +++T + +++ +     GT+ L V
Sbjct  1613  DLTGSVATAPTVAAGAITLAKMANLSGNSQIIGSSSTASTPTNLTLGSGLQISGTV-LSV  1671

Query  817   SDPTLSL  837
             +  TL++
Sbjct  1672  NSATLTV  1678


 Score = 65.5 bits (158),  Expect = 4e-10
 Identities = 78/314 (24%), Positives = 146/314 (46%), Gaps = 47/314 (14%)
 Frame = +1

Query  31    ITLGGDLSGS-ATVTNLGDATLT----ATITANSVALGTDTTGNFVADLTAGEGIDVSGG  195
             I + GDL+GS AT   +    +T    A ++ NS  +G+ +T +   +LT G G+ +SG 
Sbjct  1527  IEMLGDLTGSVATAPTVAAGAITLAKMANLSGNSQIIGSSSTASTPTNLTLGSGLQISGT  1586

Query  196   --GSENATITVSAEDATSSN-----KGIASFDSTDFTVSSGAVTV----NAERVQDIVGA  342
                  +AT+TV    AT+         +    +T  TV++GA+T+    N      I+G+
Sbjct  1587  VLSVNSATLTVPPATATTIGGIEMLGDLTGSVATAPTVAAGAITLAKMANLSGNSQIIGS  1646

Query  343   MVGSNTESGITV--TYEDSDGTLDFNVADPV-----------ITLSGDVAGS---ATMTN  474
                ++T + +T+    + S   L  N A              I + GD+ GS   A    
Sbjct  1647  SSTASTPTNLTLGSGLQISGTVLSVNSATLTVPPATATTIGGIEMLGDLTGSVATAPTVA  1706

Query  475   LGDVTIS--TTIQANSIALGTDTTGNYVSAISAGEGIDVSGSGSETATVTISAEDATDSN  648
              G +T++    +  NS  +G+ +T +  + ++ G G+ +SG+     + T++   AT + 
Sbjct  1707  AGAITLAKMANLSGNSQIIGSSSTASTPTNLTLGSGLQISGTVLSINSATLTVPPATATT  1766

Query  649   KG-------IASFDATDFTVSSGDVTV----NAERIQDIVGAMFSSNTESGISV-TYEDS  792
              G       +    AT  TV++G +T+    N      I+G+  +++T + +++ +    
Sbjct  1767  IGGIEMLGDLTGSVATAPTVAAGAITLAKMANLSGNSQIIGSSSTASTPTNLTLGSGLQI  1826

Query  793   DGTIDLDVSDPTLS  834
              GTI L+V+  +LS
Sbjct  1827  SGTI-LNVNTTSLS  1839


 Score = 58.2 bits (139),  Expect = 7e-08
 Identities = 60/210 (28%), Positives = 96/210 (45%), Gaps = 27/210 (12%)
 Frame = +1

Query  31   ITLGGDLSG--SATVTNLGDATLT--ATITANSVALGTDTTGNFVADLTAGEGIDVSGGG  198
            + L GDLSG  SA V   G  TLT  A +T+ S  +G+ +T    + L+ G  + +SG  
Sbjct  776  VQLSGDLSGVASAPVITTGAITLTKMANLTSTSSLIGSSSTSTSPSQLSLGSNLQISG--  833

Query  199  SENATITVSAEDATSSNKGIASFDSTDFTVSSGAVTVNAERVQDIVGAMVGSNTESGITV  378
                T+ V+    TSS  G      T  T+S   V    + +       +G++  +   V
Sbjct  834  ---TTLDVN----TSSLSG-TFLPLTGGTMSGNIVIPTGDLISITDAPTIGTSAANKAYV  885

Query  379  TYEDSDGTLDFNVADPV---ITLSGDVAG-SATMTNL--GDVTIS--TTIQANSIALGTD  534
                 D  +  N    V   I L+GD+ G SAT+  +  G VT+S    +      +G+ 
Sbjct  886  -----DANITPNATSTVLGKIQLAGDLLGSSATLPTISAGAVTLSKMANLSTTMSLIGSS  940

Query  535  TTGNYVSAISAGEGIDVSGSGSETATVTIS  624
            +T N VS +S G  + +SG+  +  T ++S
Sbjct  941  STSNLVSQLSLGSNLQISGTTLDVNTSSLS  970


 Score = 58.2 bits (139),  Expect = 7e-08
 Identities = 57/233 (24%), Positives = 108/233 (46%), Gaps = 34/233 (14%)
 Frame = +1

Query  31    ITLGGDLSGS-ATVTNLGDATLT----ATITANSVALGTDTTGNFVADLTAGEGIDVSGG  195
             I + GDL+GS AT   +    +T    A ++ NS  +G+ +T +   +LT G G+ +SG 
Sbjct  1608  IEMLGDLTGSVATAPTVAAGAITLAKMANLSGNSQIIGSSSTASTPTNLTLGSGLQISGT  1667

Query  196   GSENATITVSAEDATSSNKG-------IASFDSTDFTVSSGAVTV----NAERVQDIVGA  342
                  + T++   AT++  G       +    +T  TV++GA+T+    N      I+G+
Sbjct  1668  VLSVNSATLTVPPATATTIGGIEMLGDLTGSVATAPTVAAGAITLAKMANLSGNSQIIGS  1727

Query  343   MVGSNTESGITVT--YEDSDGTLDFNVADPV-----------ITLSGDVAGS---ATMTN  474
                ++T + +T+    + S   L  N A              I + GD+ GS   A    
Sbjct  1728  SSTASTPTNLTLGSGLQISGTVLSINSATLTVPPATATTIGGIEMLGDLTGSVATAPTVA  1787

Query  475   LGDVTIS--TTIQANSIALGTDTTGNYVSAISAGEGIDVSGSGSETATVTISA  627
              G +T++    +  NS  +G+ +T +  + ++ G G+ +SG+     T ++S+
Sbjct  1788  AGAITLAKMANLSGNSQIIGSSSTASTPTNLTLGSGLQISGTILNVNTTSLSS  1840


 Score = 56.6 bits (135),  Expect = 2e-07
 Identities = 65/284 (22%), Positives = 121/284 (42%), Gaps = 69/284 (24%)
 Frame = +1

Query  31    ITLGGDLSGSATVTNLGDATLT----ATITANSVALGTDTTGNFVADLTAGEGIDVSGGG  198
             I L GDL+GS+T   +    +T    A ++ NS  +G+ +T +   +LT G G+ +SG  
Sbjct  1055  IQLSGDLTGSSTSPTIAAGAITLVKMANLSGNSQIIGSSSTASTPTNLTLGSGLQISGTV  1114

Query  199   SENATITVSAEDATSSNKGIASFDSTDFTVSSGAVTVNAERVQDIVGAMVGSNTESGITV  378
                 + T++   AT++  G                    E + D+ G++  + T      
Sbjct  1115  LSVNSATLTVPPATATTIG------------------GIEMLGDLTGSVATAPT------  1150

Query  379   TYEDSDGTLDFNVADPVITLSGDVAGSATMTNLGDVTISTTIQANSIALGTDTTGNYVSA  558
                         VA   ITL       A M NL           NS  +G+ +T +  + 
Sbjct  1151  ------------VATGAITL-------AKMANL---------SGNSQIIGSSSTTSTPTN  1182

Query  559   ISAGEGIDVSGSGSETATVTISAEDATDSNKG-------IASFDATDFTVSSGDVTV---  708
             ++ G G+ +SG+     + T++   AT ++ G       +    AT  TV++G +T+   
Sbjct  1183  LTLGSGLQISGTVLSVNSATLTVPPATSTSLGGIEMLGDLTGSVATAPTVAAGAITLAKM  1242

Query  709   -NAERIQDIVGAMFSSNTESGISV-TYEDSDGTIDLDVSDPTLS  834
              N      I+G+  +++T   +++  +    GT+ L+V+  +LS
Sbjct  1243  ANLSGNSQIIGSSSTASTPVNLTLGNFLQMTGTV-LNVNSSSLS  1285


 Score = 53.5 bits (127),  Expect = 2e-06
 Identities = 87/314 (27%), Positives = 143/314 (45%), Gaps = 51/314 (16%)
 Frame = +1

Query  31    ITLGGDLSGS-ATVTNLGDATLT----ATITANSVALGTDTTGNFVADLTAGEGIDVSGG  195
             I + GDL+GS AT   +    +T    A ++ NS  +G+ +T +   +LT G G+ +SG 
Sbjct  1135  IEMLGDLTGSVATAPTVATGAITLAKMANLSGNSQIIGSSSTTSTPTNLTLGSGLQISGT  1194

Query  196   GSENATITVSAEDATSSNKG-------IASFDSTDFTVSSGAVTV----NAERVQDIVGA  342
                  + T++   ATS++ G       +    +T  TV++GA+T+    N      I+G+
Sbjct  1195  VLSVNSATLTVPPATSTSLGGIEMLGDLTGSVATAPTVAAGAITLAKMANLSGNSQIIGS  1254

Query  343   MVGSNTESGITV-TYEDSDGTLDFNVADPVITLSGDVAGSATMTNLGDVTIST---TIQA  510
                ++T   +T+  +    GT+  NV     +LSG     +  T  G++ I T      A
Sbjct  1255  SSTASTPVNLTLGNFLQMTGTV-LNVNSS--SLSGTFLPLSGGTMSGNIVIPTGDLISIA  1311

Query  511   NSIALGTDTTGN-YVSA--ISAGEG--------IDVSGSGSETATVTIS-AEDATDSNKG  654
             ++  +GT      YV A  ISA           I +SG    T+T T+   + AT S +G
Sbjct  1312  DAPTVGTSAANKAYVDAQIISATPNATTTTLGKIQLSGDFDSTSTATVPIIKSATSSIQG  1371

Query  655   --IASFDATDF----TVSSGDVTV----NAERIQDIVGAMFSSNTESGISVTYEDS---D  795
                 S D T      TV++G +T+    N      I+G+  SS T S  ++T        
Sbjct  1372  KIQLSGDLTGSSISPTVAAGAITLAKMANLSGNSQIIGS--SSTTSSPTNLTLGSGLQIS  1429

Query  796   GTIDLDVSDPTLSL  837
             GT+ L V+  TL++
Sbjct  1430  GTV-LSVNSATLTV  1442


 Score = 40.4 bits (93),  Expect = 0.014
 Identities = 66/268 (24%), Positives = 115/268 (42%), Gaps = 39/268 (14%)
 Frame = +1

Query  31   ITLGGDLSGS--ATVTNLGDATLTATITANSVA--LGTDTTGNFVADLTAGEGIDVSGGG  198
            I L GD+SG+  A V + G  TL+     N+V+  +G+  T     +++ G  + ++G  
Sbjct  391  IQLSGDISGTAIAPVVSPGAITLSKMANLNAVSNLIGSSNTNVTPTNISLGSNLQMTG--  448

Query  199  SENATITVSAEDATSSNKGIASFDSTDFTVSSGAVTVNAERVQDIVGAMVGSNTESGITV  378
                T+ V+    +       SF S      SG + + +  +  I  A V     S    
Sbjct  449  ---TTLNVNLTSLS------GSFLSLLGGTMSGNIIIPSGDLISIADAPVSGT--SAANK  497

Query  379  TYEDSDGTLDF--NVADPV---ITLSGDVAG-SATMTNL--GDVTIS--TTIQANSIALG  528
            +Y DS   ++   N    V   I L+GD+ G SAT   +  G +T+S    + + S  +G
Sbjct  498  SYVDSQIIVNATPNATSTVLGKIQLTGDLLGSSATFPTVAPGAITLSKLANLSSPSKLIG  557

Query  529  TDTTGNYVSAISAGEGIDVSGS--------GSETATVTISAEDATD-SNKGIASFDATDF  681
            + +T +  + I+ G  + +SG+         + T   TIS       SN G  +   T +
Sbjct  558  SGSTSSSPANITLGTSLSMSGTSLNVVPTFSNPTFNGTISGTAVLGVSNGGTGNSTLTGY  617

Query  682  TVSSGD---VTVNAERIQDIVGAMFSSN  756
             V +G      V +  + ++ GA+ S N
Sbjct  618  VVGNGTAPFTAVTSIPVSNVNGAVQSVN  645


>sp|Q5SSG8.1|MUC21_HUMAN RecName: Full=Mucin-21; Short=MUC-21; AltName: Full=Epiglycanin; 
Flags: Precursor
Length=566

 Score = 63.5 bits (153),  Expect = 2e-09
 Identities = 76/250 (30%), Positives = 112/250 (44%), Gaps = 18/250 (7%)
 Frame = +1

Query  52   SGSATVTNLGDATLTATITANSVALGTDTTGNFVADLTAGEGIDVSGGGSENATITVSAE  231
            SG++T TN  ++  T+     +    + TT +  +  T  E    S G     T T S  
Sbjct  205  SGASTATN-SESRTTSNGAGTATNSESSTTSSGASTATNSESSTPSSGAG---TATNSES  260

Query  232  DATSSNKGIASFDSTDFTVSSGAVTV-NAERVQDIVGAMVGSNTESGITVTYEDSDGTLD  408
              TSS  G A+ +S   TVSSG  TV N+E      GA   +N+ES  T +  ++    D
Sbjct  261  STTSSGAGTAT-NSESSTVSSGISTVTNSESSTPSSGANTATNSESSTTSSGANTATNSD  319

Query  409  FNVADPVITLSGDVAGSATMTNLGDVT---ISTTIQANSIAL--GTDTTGNYVSAISAGE  573
             +      + + +   S T +     T    STT    S A   G+ TT +  S  +  E
Sbjct  320  SSTTSSGASTATNSESSTTSSGASTATNSESSTTSSGASTATNSGSSTTSSGTSTATNSE  379

Query  574  GIDVSGSGSETATVTISAEDATDSNKGIASFDATDFTVSSGDVTV-NAERIQDIVGAMFS  750
               VS SG+ TAT   ++E +T S+    + ++   TVSSG  T  N+E      GA  +
Sbjct  380  SSTVS-SGASTAT---TSESSTTSSGASTATNSESSTVSSGASTATNSESSTTSSGA--N  433

Query  751  SNTESGISVT  780
            + T SG SVT
Sbjct  434  TATNSGSSVT  443


 Score = 56.6 bits (135),  Expect = 2e-07
 Identities = 80/287 (27%), Positives = 115/287 (40%), Gaps = 34/287 (11%)
 Frame = +1

Query  52   SGSATVTNLGDATLTATITANSVALGTDTTGNFVADLTAGEGIDVSGGGSENATITVSAE  231
            SG++T TN   +T ++     S A  +D++       TA    D S   SE +T T S  
Sbjct  115  SGASTATNSESSTPSS---GASTATNSDSSTTSSGASTATNS-DSSTTSSEASTATNSES  170

Query  232  DATSSNKGIASFDSTDFTVSSGAVTV-NAERVQDIVGAMVGSNTESGI------TVTYED  390
              TSS    A+ +S   TVSS A T  N+E      GA   +N+ES        T T  +
Sbjct  171  STTSSGASTAT-NSESSTVSSRASTATNSESSTTSSGASTATNSESRTTSNGAGTATNSE  229

Query  391  SDGTLDFNVADPVITLSGDVAGSATMTNLGDVTIS------TTIQANSIALGTDTTGNYV  552
            S  T            S   +G+ T TN    T S      T  ++++++ G  T  N  
Sbjct  230  SSTTSSGASTATNSESSTPSSGAGTATNSESSTTSSGAGTATNSESSTVSSGISTVTNSE  289

Query  553  SAI-SAGEGIDVSGSGSETATVTISAEDATDSNKGIASFDATDFTVSSGDVT-------V  708
            S+  S+G     + + SE++T +  A  AT+S+    S  A+  T S    T        
Sbjct  290  SSTPSSGAN---TATNSESSTTSSGANTATNSDSSTTSSGASTATNSESSTTSSGASTAT  346

Query  709  NAERIQDIVGAMF-----SSNTESGISVTYEDSDGTIDLDVSDPTLS  834
            N+E      GA       SS T SG S        T+    S  T S
Sbjct  347  NSESSTTSSGASTATNSGSSTTSSGTSTATNSESSTVSSGASTATTS  393


 Score = 48.5 bits (114),  Expect = 5e-05
 Identities = 46/190 (24%), Positives = 77/190 (40%), Gaps = 12/190 (6%)
 Frame = +1

Query  205  NATITVSAEDATSSNKGIASFDSTDFTVSSGAVTVNAERVQDIVGAMVGSNTESGITVTY  384
            ++ I+  A  AT+S   + S   +  T+S  +VT N        G  + +N+E   T + 
Sbjct  35   SSVISSGASTATNSGSSVTSSGVSTATISGSSVTSN--------GVSIVTNSEFHTTSSG  86

Query  385  EDSDGTLDFNVADPVITLSGDVAGSATMTNLGDVT---ISTTIQANSIALGTDTTGNYVS  555
              +    +F+ A   I+++ +   S T +     T    ST     S A  +D++    S
Sbjct  87   ISTATNSEFSTASSGISIATNSESSTTSSGASTATNSESSTPSSGASTATNSDSSTTS-S  145

Query  556  AISAGEGIDVSGSGSETATVTISAEDATDSNKGIASFDATDFTVSSGDVTVNAERIQDIV  735
              S     D S + SE +T T S    T S    A+   +    S      N+E      
Sbjct  146  GASTATNSDSSTTSSEASTATNSESSTTSSGASTATNSESSTVSSRASTATNSESSTTSS  205

Query  736  GAMFSSNTES  765
            GA  ++N+ES
Sbjct  206  GASTATNSES  215


 Score = 46.2 bits (108),  Expect = 3e-04
 Identities = 46/188 (24%), Positives = 82/188 (43%), Gaps = 15/188 (7%)
 Frame = +1

Query  52   SGSATVTNLGDATLTATITANSVALGTDTTGNFVADLTAGEGIDVSGGGSENATITVSAE  231
            SG +TVTN   +T ++   AN+      +T +  A+         +  G+  AT   ++E
Sbjct  280  SGISTVTNSESSTPSSG--ANTATNSESSTTSSGANTATNSDSSTTSSGASTAT---NSE  334

Query  232  DATSSNKGIASFDSTDFTVSSGAVTV-NAERVQDIVGAMVGSNTESGITVTYEDSDGTLD  408
             +T+S+    + +S   T SSGA T  N+       G    +N+ES        S G   
Sbjct  335  SSTTSSGASTATNSESSTTSSGASTATNSGSSTTSSGTSTATNSESSTV-----SSGAST  389

Query  409  FNVADPVITLSGDVAGSATMTNLGDVTISTTIQANSIALGTDTTGNYVSAISAGEGIDVS  588
               ++   T S    G++T TN    T+S+     + +  + T+    +A ++G  +  +
Sbjct  390  ATTSESSTTSS----GASTATNSESSTVSSGASTATNSESSTTSSGANTATNSGSSVTSA  445

Query  589  GSGSETAT  612
            GSG+   T
Sbjct  446  GSGTAALT  453

----------------------------------------------------------------------------------------------------------

e)BLASTx versus NR
   
                                                                   Score     E
Sequences producing significant alignments:                       (Bits)  Value

ref|ZP_05023899.1|  haemagglutination activity domain protein ...  79.7    4e-13
gb|EFA77040.1|  hypothetical protein PPL_09793 [Polysphondyliu...  75.1    9e-12
ref|ZP_01265299.1|  hypothetical protein PU1002_01715 [Candida...  74.3    2e-11
ref|XP_002067709.1|  GK12566 [Drosophila willistoni] >gb|EDW78...  73.6    3e-11
ref|NP_149906.1|  443R [Invertebrate iridescent virus 6] >sp|P...  72.8    5e-11
gb|EFA78206.1|  hypothetical protein PPL_08856 [Polysphondyliu...  72.0    8e-11
ref|ZP_02733723.1|  FG-GAP repeat protein [Gemmata obscuriglob...  72.0    8e-11
gb|EFA77044.1|  IPT/TIG domain-containing protein [Polysphondy...  71.2    1e-10
gb|EFA77038.1|  hypothetical protein PPL_09791 [Polysphondyliu...  70.9    2e-10
ref|ZP_00054112.2|  COG5295: Autotransporter adhesin [Magnetos...  68.2    1e-09
ref|XP_002143028.1|  YALI0C06391p [Yarrowia lipolytica] >emb|C...  67.4    2e-09
ref|ZP_06320006.1|  predicted protein [Staphylococcus aureus s...  66.6    3e-09
ref|ZP_05741152.1|  outer membrane autotransporter barrel doma...  66.6    3e-09
ref|YP_537932.1|  cell surface antigen Sca3 [Rickettsia bellii...  66.6    3e-09
gb|EFA85227.1|  hypothetical protein PPL_02227 [Polysphondyliu...  66.2    4e-09
ref|YP_266348.1|  hypothetical protein SAR11_0932 [Candidatus ...  66.2    4e-09
ref|ZP_01080684.1|  hypothetical protein RS9917_01402 [Synecho...  66.2    4e-09
ref|XP_001989588.1|  GH18720 [Drosophila grimshawi] >gb|EDV926...  65.9    6e-09
ref|YP_001235214.1|  filamentous haemagglutinin outer membrane...  65.9    6e-09
ref|ZP_05135070.1|  outer membrane autotransporter barrel doma...  65.5    7e-09
ref|YP_002235020.1|  putative haemagglutinin-related autotrans...  65.5    7e-09
ref|ZP_01263915.1|  hypothetical protein PU1002_01756 [Candida...  65.5    7e-09
gb|AAD39531.2|AF149108_1  outer membrane protein A [Rickettsia...  65.5    7e-09
ref|XP_001475581.1|  PREDICTED: similar to mucin 3 [Mus musculus]  65.1    1e-08
ref|YP_402123.1|  hypothetical protein SDY_0423 [Shigella dyse...  65.1    1e-08
ref|ZP_01155761.1|  type I secretion target repeat protein [Oc...  65.1    1e-08
ref|ZP_01015063.1|  putative RTX toxin [Rhodobacterales bacter...  65.1    1e-08
ref|YP_002882789.1|  Ig domain protein group 1 domain protein ...  64.7    1e-08
ref|ZP_05081208.1|  hemagglutination activity domain protein [...  64.7    1e-08
ref|YP_818756.1|  hypothetical protein LEUM_1286 [Leuconostoc ...  64.7    1e-08
ref|YP_003498212.1|  hypothetical protein G2583_0601 [Escheric...  64.3    2e-08
gb|EEY55778.1|  mucin-like protein [Phytophthora infestans T30-4]  64.3    2e-08
ref|ZP_05938252.1|  putative RTX family exoprotein [Escherichi...  64.3    2e-08
ref|ZP_03006563.1|  BNR/Asp-box repeat domain protein [Escheri...  64.3    2e-08
ref|ZP_02823062.1|  large repetitive protein [Escherichia coli...  64.3    2e-08
ref|ZP_02785361.1|  large repetitive protein [Escherichia coli...  64.3    2e-08
ref|NP_308569.1|  hypothetical protein ECs0542 [Escherichia co...  64.3    2e-08
ref|ZP_01050750.1|  hypothetical protein MED134_12246 [Dokdoni...  64.3    2e-08
ref|ZP_06652476.1|  conserved hypothetical protein [Escherichi...  63.9    2e-08
ref|ZP_05058881.1|  Putative Ig domain family [Verrucomicrobia...  63.9    2e-08
ref|XP_518341.2|  PREDICTED: similar to KMQK697 [Pan troglodytes]  63.9    2e-08
dbj|BAH14734.1|  unnamed protein product [Homo sapiens]            63.5    3e-08
dbj|BAG63124.1|  unnamed protein product [Homo sapiens]            63.5    3e-08
emb|CAQ07653.1|  chromosome 6 open reading frame 205 [Homo sap...  63.5    3e-08
ref|NP_001010909.2|  mucin-21 precursor [Homo sapiens] >emb|CA...  63.5    3e-08
ref|ZP_01288681.1|  Flagellin-like:transferase hexapeptide rep...  63.5    3e-08
sp|Q5SSG8.1|MUC21_HUMAN  RecName: Full=Mucin-21; Short=MUC-21;...  63.5    3e-08
gb|AAI07479.1|  MUC21 protein [Homo sapiens]                       63.5    3e-08
ref|ZP_06317771.1|  predicted protein [Staphylococcus aureus s...  63.2    4e-08
ref|ZP_05603213.1|  predicted protein [Staphylococcus aureus s...  63.2    4e-08
ref|ZP_05600561.1|  serine-rich repeat-containing protein [Sta...  63.2    4e-08
ref|ZP_03562365.1|  serine-rich repeat-containing protein [Sta...  63.2    4e-08
ref|XP_001981456.1|  GG12068 [Drosophila erecta] >gb|EDV53326....  63.2    4e-08
ref|ZP_01256728.1|  Polymorphic membrane protein [Psychroflexu...  63.2    4e-08
ref|YP_001015925.1|  hypothetical protein NATL1_21051 [Prochlo...  63.2    4e-08
ref|YP_042074.1|  serine-rich repeat-containing protein [Staph...  63.2    4e-08
gb|EEU38055.1|  hypothetical protein NECHADRAFT_84492 [Nectria...  62.8    5e-08
ref|YP_003147782.1|  hypothetical protein Kkor_2606 [Kangiella...  62.8    5e-08
ref|XP_002043022.1|  GM16292 [Drosophila sechellia] >gb|EDW491...  62.8    5e-08
ref|ZP_01900113.1|  fibronectin type III domain protein [Morit...  62.8    5e-08
ref|YP_001530167.1|  YadA domain-containing protein [Desulfoco...  62.8    5e-08
ref|YP_001758884.1|  outer membrane adhesin like proteiin [She...  62.8    5e-08
ref|ZP_01385866.1|  Polymorphic membrane protein, Chlamydia:Ha...  62.8    5e-08
ref|YP_051355.1|  putative hemagglutinin/hemolysin-related pro...  62.8    5e-08
ref|YP_003398101.1|  Hemagluttinin domain protein [Acidaminoco...  62.4    6e-08
ref|NP_001163761.1|  papilin, isoform G [Drosophila melanogast...  62.4    6e-08
ref|NP_788751.2|  papilin, isoform F [Drosophila melanogaster]...  62.4    6e-08
ref|NP_001163760.1|  papilin, isoform C [Drosophila melanogast...  62.4    6e-08
ref|ZP_03569332.1|  outer membrane autotransporter barrel doma...  62.4    6e-08
ref|YP_001584088.1|  outer membrane autotransporter [Burkholde...  62.4    6e-08
ref|NP_788752.2|  papilin, isoform E [Drosophila melanogaster]...  62.4    6e-08
ref|YP_394461.1|  hypothetical protein Suden_1952 [Sulfurimona...  62.4    6e-08
ref|NP_508295.1|  hypothetical protein H02F09.3 [Caenorhabditi...  62.4    6e-08
ref|YP_379052.1|  VCBS [Chlorobium chlorochromatii CaD3] >gb|A...  62.4    6e-08
ref|XP_504116.1|  YALI0E18722p [Yarrowia lipolytica] >emb|CAG7...  62.4    6e-08
ref|ZP_00999666.1|  hypothetical protein OB2597_13888 [Oceanic...  62.4    6e-08
ref|ZP_06647699.1|  conserved hypothetical protein [Escherichi...  62.0    8e-08
emb|CBA31994.1|  hypothetical protein [Curvibacter putative sy...  62.0    8e-08
ref|ZP_03582893.1|  outer membrane autotransporter barrel doma...  62.0    8e-08
ref|YP_002411290.1|  adhesin for cattle intestine colonization...  62.0    8e-08
ref|ZP_05102368.1|  outer membrane autotransporter barrel [Ros...  62.0    8e-08
ref|YP_002297071.1|  S-layer protein [Rhodospirillum centenum ...  62.0    8e-08
ref|YP_001778971.1|  outer membrane autotransporter [Burkholde...  62.0    8e-08
ref|ZP_02732458.1|  outer membrane autotransporter barrel doma...  62.0    8e-08
ref|YP_001524831.1|  hypothetical protein AZC_1915 [Azorhizobi...  62.0    8e-08
ref|YP_001816063.1|  hypothetical protein BamMC406_6074 [Burkh...  62.0    8e-08
ref|YP_840389.1|  outer membrane autotransporter [Burkholderia...  62.0    8e-08
ref|YP_838560.1|  outer membrane autotransporter [Burkholderia...  62.0    8e-08
ref|YP_626365.1|  Outer membrane autotransporter barrel [Burkh...  62.0    8e-08
emb|CAA87091.1|  secreted acid phosphatase 2 (SAP2) [Leishmani...  62.0    8e-08
ref|YP_044654.1|  putative cell wall-anchored protein [Staphyl...  62.0    8e-08
ref|NP_647392.1|  hypothetical protein MW2575 [Staphylococcus ...  62.0    8e-08
ref|YP_866596.1|  filamentous haemagglutinin outer membrane pr...  62.0    8e-08
ref|XP_002669339.1|  predicted protein [Naegleria gruberi] >gb...  61.6    1e-07
ref|YP_002153711.1|  hypothetical glycine-rich autotransporter...  61.6    1e-07
dbj|BAG61436.1|  unnamed protein product [Homo sapiens]            61.6    1e-07
ref|XP_002070501.1|  GK10999 [Drosophila willistoni] >gb|EDW81...  61.6    1e-07
ref|YP_001229522.1|  Cna B domain-containing protein [Geobacte...  61.6    1e-07
ref|YP_864707.1|  filamentous haemagglutinin outer membrane pr...  61.6    1e-07
emb|CAI18456.1|  chromosome 6 open reading frame 205 [Homo sap...  61.6    1e-07
ref|YP_526272.1|  hypothetical protein Sde_0798 [Saccharophagu...  61.6    1e-07
ref|NP_286230.1|  RTX family exoprotein [Escherichia coli O157...  61.6    1e-07
ref|ZP_04864506.1|  cell wall-anchored protein [Staphylococcus...  61.2    1e-07
ref|YP_002761875.1|  hypothetical protein GAU_2363 [Gemmatimon...  61.2    1e-07
ref|XP_002121463.1|  PREDICTED: similar to zymogen granule mem...  61.2    1e-07
ref|ZP_02925085.1|  Outer membrane autotransporter barrel [Ver...  61.2    1e-07
ref|ZP_01471389.1|  cell wall surface anchor family protein [S...  61.2    1e-07
gb|AAT27425.1|  HmwA [Haemophilus influenzae]                      61.2    1e-07
ref|YP_374310.1|  VCBS [Chlorobium luteolum DSM 273] >gb|ABB23...  61.2    1e-07
ref|YP_264883.1|  hypothetical protein Psyc_1601 [Psychrobacte...  61.2    1e-07
ref|ZP_01312631.1|  flagellin-like [Desulfuromonas acetoxidans...  61.2    1e-07
emb|CAQ51084.1|  serine-rich adhesin for platelets (Staphyloco...  60.8    2e-07
ref|YP_003131163.1|  hypothetical protein Huta_2263 [Halorhabd...  60.8    2e-07
ref|YP_002261122.1|  hemagglutinin-related protein [Ralstonia ...  60.8    2e-07
ref|ZP_03131891.1|  Parallel beta-helix repeat protein [Chthon...  60.8    2e-07
ref|ZP_01879993.1|  Large exoprotein [Roseovarius sp. TM1035] ...  60.8    2e-07
ref|YP_929130.1|  putative outer membrane adhesin like protein...  60.8    2e-07
ref|NP_723377.1|  mucin related 29B [Drosophila melanogaster] ...  60.8    2e-07
ref|XP_383364.1|  hypothetical protein FG03188.1 [Gibberella z...  60.8    2e-07
ref|ZP_06340651.1|  predicted protein [Staphylococcus aureus s...  60.5    2e-07
ref|ZP_05687620.1|  cell wall-anchored protein [Staphylococcus...  60.5    2e-07
ref|YP_002429549.1|  filamentous hemagglutinin family outer me...  60.5    2e-07
ref|ZP_01265301.1|  hypothetical protein PU1002_01725 [Candida...  60.5    2e-07
ref|XP_452176.1|  unnamed protein product [Kluyveromyces lacti...  60.5    2e-07
sp|Q8VQ99.1|SRAP_STAAU  RecName: Full=Serine-rich adhesin for ...  60.5    2e-07
gb|AAQ88781.1|  KMQK697 [Homo sapiens] >emb|CAQ08321.1| chromo...  60.5    2e-07
ref|YP_324654.1|  VCBS [Anabaena variabilis ATCC 29413] >gb|AB...  60.5    2e-07
ref|YP_003453433.1|  hypothetical protein AZL_f01290 [Azospiri...  60.1    3e-07
ref|YP_003371758.1|  peptidase domain protein [Pirellula stale...  60.1    3e-07
ref|ZP_05692361.1|  hypothetical protein SAJG_01441 [Staphyloc...  60.1    3e-07
ref|XP_002421148.1|  hypothetical GPI-anchored protein, putati...  60.1    3e-07
ref|ZP_05053626.1|  type I secretion target GGXGXDXXX repeat p...  60.1    3e-07
ref|YP_001773965.1|  outer membrane autotransporter [Burkholde...  60.1    3e-07
ref|YP_001682728.1|  hemolysin-type calcium-binding region [Ca...  60.1    3e-07
ref|YP_555132.1|  adhesin HecA [Burkholderia xenovorans LB400]...  60.1    3e-07
ref|YP_378932.1|  parallel beta-helix repeat-containing protei...  60.1    3e-07
ref|NP_373178.1|  serine-threoinine rich antigen [Staphylococc...  60.1    3e-07
ref|YP_003430313.1|  conserved hypothetical secreted protein [...  59.7    4e-07
ref|ZP_05929958.1|  outer membrane autotransporter barrel doma...  59.7    4e-07
ref|YP_003091321.1|  Fibronectin type III domain protein [Pedo...  59.7    4e-07
ref|ZP_05157135.1|  outermembrane transporter [Brucella abortu...  59.7    4e-07
gb|EER11577.1|  dentin sialophosphoprotein precursor, putative...  59.7    4e-07
ref|YP_002253278.1|  hemagglutinin-related (transposon inactiv...  59.7    4e-07
ref|YP_001981111.1|  Putative Ig domain family [Cellvibrio jap...  59.7    4e-07
ref|ZP_02731030.1|  FG-GAP repeat protein [Gemmata obscuriglob...  59.7    4e-07
ref|ZP_01256273.1|  hypothetical protein P700755_32769 [Psychr...  59.7    4e-07
ref|YP_001858800.1|  YadA domain-containing protein [Burkholde...  59.7    4e-07
gb|AAO84907.1|  extracellular matrix protein papilin 2 [Drosop...  59.7    4e-07
gb|AAG37995.1|AF205357_1  extracellular matrix protein papilin...  59.7    4e-07
gb|AAO84908.1|  extracellular matrix protein papilin 3 [Drosop...  59.7    4e-07
ref|ZP_01313540.1|  Hemolysin-type calcium-binding region [Des...  59.7    4e-07
ref|YP_002550799.1|  Ca 2+ binding protein [Agrobacterium viti...  59.3    5e-07
ref|ZP_03582534.1|  cable pili-associated 22 kDa adhesin prote...  59.3    5e-07
ref|XP_002105231.1|  GD18032 [Drosophila simulans] >gb|EDX1473...  59.3    5e-07
ref|XP_002058703.1|  GJ14166 [Drosophila virilis] >gb|EDW58671...  59.3    5e-07
ref|ZP_01546562.1|  fat protein-possibly involved in cell-cell...  59.3    5e-07
ref|YP_777327.1|  adhesin [Burkholderia ambifaria AMMD] >gb|AB...  59.3    5e-07
ref|YP_555134.1|  filamentous haemagglutinin / adhesin [Burkho...  59.3    5e-07
ref|YP_324144.1|  Na-Ca exchanger/integrin-beta4 [Anabaena var...  59.3    5e-07
ref|YP_252241.1|  hypothetical protein SH0326 [Staphylococcus ...  59.3    5e-07
ref|YP_065841.1|  hypothetical protein DP2105 [Desulfotalea ps...  59.3    5e-07
ref|YP_001633853.1|  polymorphic outer membrane protein [Chlor...  59.3    5e-07
ref|XP_002649170.1|  hypothetical protein DDB_G0295727 [Dictyo...  58.9    7e-07
ref|XP_002344095.1|  PREDICTED: hypothetical protein [Homo sap...  58.9    7e-07
ref|XP_002344035.1|  PREDICTED: hypothetical protein [Homo sap...  58.9    7e-07
ref|XP_002344011.1|  PREDICTED: hypothetical protein [Homo sap...  58.9    7e-07
ref|XP_002343937.1|  PREDICTED: hypothetical protein [Homo sap...  58.9    7e-07
ref|ZP_03982856.1|  surface protein from Gram-positive cocci [...  58.9    7e-07
ref|XP_002118974.1|  hypothetical protein TRIADDRAFT_62951 [Tr...  58.9    7e-07
ref|YP_001970885.1|  putative glycine-rich autotransporter pro...  58.9    7e-07
ref|YP_001655098.1|  hypothetical protein MAE_00840 [Microcyst...  58.9    7e-07
ref|YP_001528499.1|  hemolysin-type calcium-binding region [De...  58.9    7e-07
ref|XP_002344067.1|  PREDICTED: hypothetical protein XP_002344...  58.9    7e-07
ref|XP_001716029.1|  PREDICTED: hypothetical protein [Homo sap...  58.9    7e-07
ref|XP_002343962.1|  PREDICTED: hypothetical protein [Homo sap...  58.9    7e-07
ref|XP_001131329.1|  PREDICTED: hypothetical protein [Homo sap...  58.9    7e-07
emb|CAJ72183.1|  unknown protein [Candidatus Kuenenia stuttgar...  58.9    7e-07
ref|NP_442017.1|  hypothetical protein slr0364 [Synechocystis ...  58.9    7e-07
ref|ZP_01156956.1|  hypothetical protein OG2516_13621 [Oceanic...  58.9    7e-07
ref|ZP_00154578.2|  COG3210: Large exoproteins involved in hem...  58.9    7e-07
ref|YP_003520902.1|  YeeJ [Pantoea ananatis LMG 20103] >gb|ADD...  58.5    9e-07
ref|ZP_06377375.1|  cell wall anchor domain-containing protein...  58.5    9e-07
ref|ZP_04430034.1|  predicted polymerase with PALM domain, HD ...  58.5    9e-07
ref|YP_002381730.1|  adhesin for cattle intestine colonization...  58.5    9e-07
ref|XP_002605321.1|  hypothetical protein BRAFLDRAFT_89036 [Br...  58.5    9e-07
ref|YP_002299309.1|  hypothetical protein RC1_3132 [Rhodospiri...  58.5    9e-07
ref|YP_001944163.1|  Hemolysin-type calcium-binding region [Ch...  58.5    9e-07
ref|YP_001875229.1|  outer membrane autotransporter [Elusimicr...  58.5    9e-07
ref|YP_857205.1|  structural toxin protein RtxA [Aeromonas hyd...  58.5    9e-07
ref|ZP_01471398.1|  hypothetical protein RS9916_36837 [Synecho...  58.5    9e-07
ref|YP_378930.1|  parallel beta-helix repeat-containing protei...  58.5    9e-07
ref|YP_778249.1|  hypothetical protein Bamb_6371 [Burkholderia...  58.5    9e-07
ref|YP_912749.1|  hemolysin-type calcium-binding region [Chlor...  58.5    9e-07
ref|ZP_06468055.1|  hypothetical protein BC1003DRAFT_4277 [Bur...  58.2    1e-06
ref|ZP_06328375.1|  LPXTG-domain-containing protein cell wall ...  58.2    1e-06
ref|YP_002954310.1|  hypothetical protein DMR_29330 [Desulfovi...  58.2    1e-06
ref|ZP_05028293.1|  FG-GAP repeat domain protein [Microcoleus ...  58.2    1e-06
ref|ZP_02735782.1|  autotransporter-associated beta strand rep...  58.2    1e-06
ref|ZP_02165618.1|  iron-regulated protein FrpC [Hoeflea photo...  58.2    1e-06
ref|YP_002979946.1|  filamentous hemagglutinin family outer me...  58.2    1e-06
ref|YP_001333587.1|  hypothetical protein NWMN_2553 [Staphyloc...  58.2    1e-06
emb|CAM75348.1|  conserved hypothetical protein [Magnetospiril...  58.2    1e-06
ref|YP_001812417.1|  filamentous haemagglutinin outer membrane...  58.2    1e-06
ref|XP_454605.1|  unnamed protein product [Kluyveromyces lacti...  58.2    1e-06
ref|YP_528790.1|  hypothetical protein Sde_3323 [Saccharophagu...  58.2    1e-06
ref|YP_578284.1|  Outer membrane autotransporter barrel [Nitro...  58.2    1e-06
ref|YP_501439.1|  hypothetical protein SAOUHSC_02990 [Staphylo...  58.2    1e-06
ref|YP_495223.1|  cell wall anchor domain-containing protein [...  58.2    1e-06
ref|YP_187464.1|  LPXTG cell wall surface anchor family protei...  58.2    1e-06
ref|YP_132263.1|  hypotetical protein [Photobacterium profundu...  58.2    1e-06
ref|YP_672707.1|  outer membrane autotransporter [Mesorhizobiu...  58.2    1e-06
ref|ZP_05993630.1|  outer membrane transporter [Brucella suis ...  57.8    2e-06
ref|YP_003144743.1|  hypothetical protein Shel_23850 [Slackia ...  57.8    2e-06
ref|ZP_03573542.1|  cable pili-associated 22 kDa adhesin prote...  57.8    2e-06
ref|ZP_05063323.1|  outer membrane autotransporter barrel doma...  57.8    2e-06
ref|ZP_05051116.1|  type I secretion target GGXGXDXXX repeat p...  57.8    2e-06
ref|ZP_03132319.1|  autotransporter-associated beta strand rep...  57.8    2e-06
ref|XP_001956560.1|  GF24532 [Drosophila ananassae] >gb|EDV393...  57.8    2e-06
ref|YP_001863381.1|  filamentous haemagglutinin outer membrane...  57.8    2e-06
ref|YP_001770157.1|  structural toxin protein RtxA [Methylobac...  57.8    2e-06
gb|EDN63499.1|  pathogen-related protein [Saccharomyces cerevi...  57.8    2e-06
ref|ZP_04943446.1|  hypothetical protein BCPG_05009 [Burkholde...  57.8    2e-06
ref|ZP_01385852.1|  Haemagluttinin:Filamentous haemagglutinin-...  57.8    2e-06
ref|YP_944152.1|  cadherin domain-containing protein [Psychrom...  57.8    2e-06
ref|YP_434380.1|  outer membrane protein domain-containing pro...  57.8    2e-06
ref|YP_001129948.1|  putative outer membrane adhesin like prot...  57.8    2e-06
dbj|BAI54878.1|  conserved hypothetical protein [Escherichia c...  57.4    2e-06
ref|ZP_06198513.1|  LPXTG cell wall surface anchor family prot...  57.4    2e-06
ref|ZP_05808520.1|  conserved hypothetical protein [Mesorhizob...  57.4    2e-06
ref|ZP_05665807.1|  cell wall surface adhesion protein [Entero...  57.4    2e-06
gb|EEU41266.1|  hypothetical protein NECHADRAFT_83506 [Nectria...  57.4    2e-06
ref|YP_003130980.1|  GLUG domain protein [Halorhabdus utahensi...  57.4    2e-06
ref|YP_003104968.1|  outer membrane autotransporter [Brucella ...  57.4    2e-06
ref|YP_002908876.1|  hypothetical protein bglu_2g12590 [Burkho...  57.4    2e-06
ref|ZP_03999305.1|  hypothetical protein HborDRAFT_2098 [Halog...  57.4    2e-06
ref|ZP_04002571.1|  hemolysin family calcium-binding protein [...  57.4    2e-06
ref|YP_002760255.1|  hypothetical protein GAU_0743 [Gemmatimon...  57.4    2e-06
ref|ZP_03131084.1|  autotransporter-associated beta strand rep...  57.4    2e-06
ref|XP_002067635.1|  GK24882 [Drosophila willistoni] >gb|EDW78...  57.4    2e-06
ref|ZP_02882131.1|  YadA  domain protein [Burkholderia gramini...  57.4    2e-06
ref|YP_001758870.1|  outer membrane adhesin like proteiin [She...  57.4    2e-06
ref|ZP_02146938.1|  hypothetical protein RGBS107_08210 [Phaeob...  57.4    2e-06
ref|ZP_02162962.1|  probable aggregation factor core protein M...  57.4    2e-06
ref|XP_001594406.1|  hypothetical protein SS1G_04213 [Scleroti...  57.4    2e-06
ref|XP_001479012.1|  PREDICTED: similar to Muc3 protein [Mus m...  57.4    2e-06
gb|EDL19282.1|  mCG6879 [Mus musculus]                             57.4    2e-06
gb|ABL74378.1|  antifreeze protein [Marinomonas primoryensis]      57.4    2e-06
ref|ZP_01546133.1|  hypothetical protein SIAM614_18504 [Stappi...  57.4    2e-06
ref|YP_439968.1|  serine protease [Burkholderia thailandensis ...  57.4    2e-06
ref|NP_752300.1|  RTX family exoprotein A gene [Escherichia co...  57.4    2e-06
ref|YP_132258.1|  hypothetical protein PBPRB0585 [Photobacteri...  57.4    2e-06
ref|ZP_06608989.1|  putative lipoprotein [Actinomyces odontoly...  57.0    3e-06
ref|YP_003372592.1|  outer membrane adhesin like proteiin [Pir...  57.0    3e-06
ref|ZP_06105928.1|  predicted protein [Brucella melitensis bv....  57.0    3e-06
ref|XP_002552892.1|  KLTH0D03894p [Lachancea thermotolerans] >...  57.0    3e-06
gb|EEH53646.1|  predicted protein [Micromonas pusilla CCMP1545]    57.0    3e-06
ref|ZP_03738758.1|  hypothetical protein DthioDRAFT_3186 [Desu...  57.0    3e-06
ref|YP_002234743.1|  cable pilus associated adhesin protein [B...  57.0    3e-06
ref|YP_001751229.1|  hypothetical protein PputW619_4380 [Pseud...  57.0    3e-06
ref|ZP_02835031.1|  VCBS repeat-containing protein [Salmonella...  57.0    3e-06
ref|ZP_02735519.1|  outer membrane autotransporter barrel doma...  57.0    3e-06
ref|YP_002147273.1|  ShdA [Salmonella enterica subsp. enterica...  57.0    3e-06
ref|ZP_02380443.1|  Haemagluttinin domain protein [Burkholderi...  57.0    3e-06
ref|ZP_02168020.1|  iron-regulated protein FrpC [Hoeflea photo...  57.0    3e-06
ref|ZP_02149336.1|  hypothetical protein RG210_05237 [Phaeobac...  57.0    3e-06
ref|ZP_01904316.1|  surface adhesion protein, putative [Roseob...  57.0    3e-06
gb|AAT36485.1|  cable pili-associated 22 kDa adhesin protein [...  57.0    3e-06
ref|NP_996054.1|  mucin 68Ca [Drosophila melanogaster] >gb|AAS...  57.0    3e-06
sp|P35828.4|SLAP_CAUCR  RecName: Full=S-layer protein; AltName...  57.0    3e-06
ref|NP_419823.1|  S-layer protein RsaA [Caulobacter crescentus...  57.0    3e-06
gb|ADA80274.1|  Flagellar hook-length control protein fliK [St...  56.6    3e-06
gb|EEY56322.1|  adhesin protein, putative [Phytophthora infest...  56.6    3e-06
ref|ZP_04633986.1|  Leucyl aminopeptidase [Yersinia frederikse...  56.6    3e-06
ref|ZP_04430035.1|  hemagluttinin repeat protein [Planctomyces...  56.6    3e-06
ref|YP_002255988.1|  hemagglutinin-related autotransporter pro...  56.6    3e-06
ref|YP_002277221.1|  outer membrane autotransporter barrel dom...  56.6    3e-06
ref|YP_001603737.1|  putative autotransporter protein [Glucona...  56.6    3e-06
ref|YP_001458201.1|  autotransporter (AT) family porin [Escher...  56.6    3e-06
ref|ZP_01851894.1|  VCBS [Planctomyces maris DSM 8797] >gb|EDL...  56.6    3e-06
ref|YP_001290870.1|  HMW2A, high molecular weight adhesin 2 [H...  56.6    3e-06
ref|ZP_04947089.1|  Large exoprotein involved in heme utilizat...  56.6    3e-06
ref|YP_758001.1|  outer membrane autotransporter [Maricaulis m...  56.6    3e-06
ref|YP_572118.1|  putative hemagglutinin/hemolysin-related pro...  56.6    3e-06
ref|YP_001279954.1|  hypothetical protein PsycPRwf_1054 [Psych...  56.6    3e-06
gb|AAF36548.1|AF193880_1  82-kDa surface lipoprotein precursor...  56.6    3e-06
gb|AAA20524.1|  adhesin [Haemophilus influenzae]                   56.6    3e-06
ref|NP_103401.1|  serine proteinase [Mesorhizobium loti MAFF30...  56.6    3e-06
ref|NP_521309.1|  putative hemagglutinin-related protein [Rals...  56.6    3e-06
ref|ZP_01078491.1|  Autotransporter adhesin [Marinomonas sp. M...  56.6    3e-06
ref|ZP_01063545.1|  hypothetical protein MED222_04835 [Vibrio ...  56.6    3e-06
ref|ZP_00988886.1|  hypothetical protein V12B01_12555 [Vibrio ...  56.6    3e-06
ref|ZP_03001679.1|  EntS/YbdA MFS transporter [Escherichia col...  56.6    3e-06
ref|YP_001864885.1|  filamentous haemagglutinin outer membrane...  56.6    3e-06
ref|YP_721957.1|  hypothetical protein Tery_2255 [Trichodesmiu...  56.6    3e-06
ref|XP_956146.1|  hypothetical protein NCU04373 [Neurospora cr...  56.6    3e-06
ref|ZP_06625140.1|  LPXTG-motif cell wall anchor domain protei...  56.2    4e-06
emb|CBA31110.1|  hypothetical protein [Curvibacter putative sy...  56.2    4e-06
ref|ZP_03582326.1|  YadA C- domain protein [Burkholderia multi...  56.2    4e-06
ref|ZP_03583873.1|  filamentous hemagglutinin [Burkholderia mu...  56.2    4e-06
ref|ZP_03130708.1|  autotransporter-associated beta strand rep...  56.2    4e-06
ref|ZP_05025718.1|  hypothetical protein MC7420_5332 [Microcol...  56.2    4e-06
ref|XP_002098701.1|  GE10512 [Drosophila yakuba] >gb|EDW98413....  56.2    4e-06
ref|XP_002048593.1|  GJ11268 [Drosophila virilis] >gb|EDW70935...  56.2    4e-06
ref|YP_001997940.1|  Haemagluttinin domain protein [Chlorobacu...  56.2    4e-06
ref|ZP_01977497.1|  von Willebrand factor, type A [Vibrio chol...  56.2    4e-06
ref|ZP_01978841.1|  Gp5 C- repeat (3 copies) family [Vibrio ch...  56.2    4e-06
gb|ABQ76136.1|  probable cell surface adhesin [uncultured halo...  56.2    4e-06
ref|ZP_01745428.1|  outer membrane autotransporter barrel [Sag...  56.2    4e-06
ref|ZP_01748756.1|  outer membrane autotransporter barrel [Sag...  56.2    4e-06
ref|ZP_01748621.1|  outer membrane autotransporter barrel [Sag...  56.2    4e-06
ref|YP_001059062.1|  putative outer membrane protein [Burkhold...  56.2    4e-06
ref|ZP_01619412.1|  hypothetical protein L8106_15695 [Lyngbya ...  56.2    4e-06
ref|YP_001047125.1|  Ig domain-containing protein [Methanocull...  56.2    4e-06
ref|ZP_01385307.1|  Outer membrane autotransporter barrel [Chl...  56.2    4e-06
gb|AAY28517.1|  biofilm-associated protein [Staphylococcus xyl...  56.2    4e-06
gb|AAT27426.1|  HmwA [Haemophilus influenzae]                      56.2    4e-06
ref|NP_105314.1|  hypothetical protein mll4444 [Mesorhizobium ...  56.2    4e-06
ref|ZP_01125333.1|  Large exoprotein involved in heme utilizat...  56.2    4e-06
ref|YP_809209.1|  hypothetical protein LACR_1259 [Lactococcus ...  56.2    4e-06
ref|YP_001116438.1|  filamentous haemagglutinin outer membrane...  56.2    4e-06
ref|ZP_06465284.1|  hypothetical protein BC1003DRAFT_1505 [Bur...  55.8    6e-06
ref|ZP_06229558.1|  Uncharacterized protein with a C-terminal ...  55.8    6e-06
ref|ZP_05875194.1|  outer membrane autotransporter [Brucella a...  55.8    6e-06
ref|ZP_05820179.1|  outer membrane transporter [Brucella abort...  55.8    6e-06
ref|ZP_05923514.1|  gram-positive cocci surface protein [Enter...  55.8    6e-06
ref|ZP_05591403.1|  serine protease [Burkholderia thailandensi...  55.8    6e-06
ref|ZP_05190428.1|  outermembrane transporter [Brucella abortu...  55.8    6e-06
ref|ZP_05160231.1|  outermembrane transporter [Brucella abortu...  55.8    6e-06
ref|XP_002546966.1|  predicted protein [Candida tropicalis MYA...  55.8    6e-06
ref|ZP_04595674.1|  autotransporter-associated beta strand rep...  55.8    6e-06
ref|ZP_03786675.1|  outer membrane autotransporter barrel doma...  55.8    6e-06
ref|YP_002395437.1|  hypothetical protein VS_II0855 [Vibrio sp...  55.8    6e-06
ref|YP_002252897.1|  hemagglutinin-related protein [Ralstonia ...  55.8    6e-06
ref|YP_002158801.1|  iron-regulated protein FrpC [Vibrio fisch...  55.8    6e-06
ref|ZP_03132320.1|  autotransporter-associated beta strand rep...  55.8    6e-06
ref|YP_002027242.1|  outer membrane autotransporter barrel dom...  55.8    6e-06
ref|XP_001955051.1|  GF16440 [Drosophila ananassae] >gb|EDV436...  55.8    6e-06
ref|YP_001932123.1|  outermembrane transporter [Brucella abort...  55.8    6e-06
ref|YP_001807094.1|  filamentous haemagglutinin outer membrane...  55.8    6e-06
ref|ZP_02684108.1|  VCBS repeat-containing protein [Salmonella...  55.8    6e-06
ref|YP_001524420.1|  putative serine protease [Azorhizobium ca...  55.8    6e-06
gb|ACX39892.1|  conserved hypothetical protein [Escherichia co...  55.8    6e-06
ref|YP_371516.1|  outer membrane autotransporter barrel [Burkh...  55.8    6e-06
ref|YP_222973.1|  outermembrane transporter [Brucella abortus ...  55.8    6e-06
ref|YP_151729.1|  large repetitive protein [Salmonella enteric...  55.8    6e-06
ref|NP_742967.1|  surface adhesion protein, putative [Pseudomo...  55.8    6e-06
ref|YP_123109.1|  hypothetical protein lpp0779 [Legionella pne...  55.8    6e-06
emb|CAA61860.1|  AOF1001 [Saccharomyces cerevisiae]                55.8    6e-06
ref|NP_519896.1|  hemagglutinin-related protein [Ralstonia sol...  55.8    6e-06
sp|P33666.2|YDBA_ECOLI  RecName: Full=Putative uncharacterized...  55.8    6e-06
pir||C48399  ABC-type transport protein ydbA.2 - Escherichia c...  55.8    6e-06
ref|ZP_01004603.1|  hypothetical protein SKA53_00375 [Loktanel...  55.8    6e-06
ref|ZP_00943049.1|  Hypothetical Protein RRSL_04357 [Ralstonia...  55.8    6e-06
ref|YP_776459.1|  hemagluttinin domain-containing protein [Bur...  55.8    6e-06
ref|ZP_06121231.1|  Hemolysin-type calcium-binding region [Cau...  55.5    8e-06
ref|ZP_04861025.1|  LOW QUALITY PROTEIN: conserved hypothetica...  55.5    8e-06
ref|YP_003018619.1|  von Willebrand factor type A [Pectobacter...  55.5    8e-06
ref|YP_003002761.1|  Ig family protein [Dickeya zeae Ech1591] ...  55.5    8e-06
ref|ZP_04631240.1|  Type V secretory pathway, adhesin AidA [Ye...  55.5    8e-06
ref|ZP_04465870.1|  HMW1A, high molecular weight adhesin 1 [Ha...  55.5    8e-06
ref|XP_002136422.1|  GA22193 [Drosophila pseudoobscura pseudoo...  55.5    8e-06
ref|YP_002143963.1|  putative surface-exposed virulence protei...  55.5    8e-06
ref|XP_002023996.1|  GL22810 [Drosophila persimilis] >gb|EDW29...  55.5    8e-06
ref|XP_001746730.1|  hypothetical protein [Monosiga brevicolli...  55.5    8e-06
ref|YP_001589589.1|  hypothetical protein SPAB_03407 [Salmonel...  55.5    8e-06
ref|YP_001525801.1|  outer membrane autotransporter barrel pro...  55.5    8e-06
ref|ZP_01898501.1|  probable aggregation factor core protein M...  55.5    8e-06
emb|CAM76579.1|  Autotransporter adhesin [Magnetospirillum gry...  55.5    8e-06
ref|ZP_04916557.1|  RTX protein [Vibrio cholerae RC385] >gb|ED...  55.5    8e-06
ref|XP_952825.1|  hypothetical protein [Theileria annulata str...  55.5    8e-06
ref|YP_065840.1|  hypothetical protein DP2104 [Desulfotalea ps...  55.5    8e-06
ref|ZP_01058341.1|  hypothetical protein MED193_12118 [Roseoba...  55.5    8e-06
ref|ZP_00950718.1|  probable extracellular nuclease [Croceibac...  55.5    8e-06
ref|ZP_00958302.1|  putative RTX family exoprotein [Roseovariu...  55.5    8e-06
ref|XP_717775.1|  hypothetical protein CaO19.5401 [Candida alb...  55.5    8e-06
ref|ZP_00604835.1|  Surface protein from Gram-positive cocci, ...  55.5    8e-06
gb|ACX47353.1|  UpaH [Escherichia coli]                            55.1    1e-05
ref|ZP_05967614.2|  exoprotein, RTX family [Enterobacter cance...  55.1    1e-05
ref|YP_003370899.1|  flagellin domain protein [Pirellula stale...  55.1    1e-05
ref|YP_003513183.1|  hypothetical protein Snas_4445 [Stackebra...  55.1    1e-05
ref|ZP_03834028.1|  putative hemagglutinin/hemolysin-related p...  55.1    1e-05
ref|XP_002156876.1|  PREDICTED: hypothetical protein, partial ...  55.1    1e-05
ref|YP_002395438.1|  hypothetical protein VS_II0856 [Vibrio sp...  55.1    1e-05
ref|ZP_03345598.1|  large repetitive protein [Salmonella enter...  55.1    1e-05
ref|ZP_05026641.1|  filamentous haemagglutinin family N-termin...  55.1    1e-05
ref|YP_001820837.1|  hypothetical protein Oter_3963 [Opitutus ...  55.1    1e-05
ref|ZP_03220539.1|  VCBS repeat-containing protein [Salmonella...  55.1    1e-05
ref|YP_002147625.1|  VCBS repeat-containing protein [Salmonell...  55.1    1e-05
ref|ZP_02664520.1|  VCBS repeat-containing protein [Salmonella...  55.1    1e-05
gb|ABZ10813.1|  putative mannoprotein precursor [Saccharomyces...  55.1    1e-05
ref|ZP_02141632.1|  VCBS repeat domain protein [Roseobacter li...  55.1    1e-05
ref|YP_001341907.1|  filamentous haemagglutinin outer membrane...  55.1    1e-05
ref|ZP_01898857.1|  iron-regulated protein FrpC [Moritella sp....  55.1    1e-05
ref|XP_369035.2|  hypothetical protein MGG_00209 [Magnaporthe ...  55.1    1e-05
ref|XP_001422108.1|  predicted protein [Ostreococcus lucimarin...  55.1    1e-05
ref|ZP_01747887.1|  hypothetical protein SSE37_14584 [Sagittul...  55.1    1e-05
gb|EAZ36212.1|  hypothetical protein OsJ_20530 [Oryza sativa J...  55.1    1e-05
ref|XP_001308251.1|  flocculin [Trichomonas vaginalis G3] >gb|...  55.1    1e-05
ref|YP_001774086.1|  YadA domain-containing protein [Burkholde...  55.1    1e-05
ref|NP_001057100.1|  Os06g0207500 [Oryza sativa (japonica cult...  55.1    1e-05
ref|ZP_01306795.1|  Flagellar capping protein [Oceanobacter sp...  55.1    1e-05
ref|ZP_01201706.1|  conserved hypothetical protein [Flavobacte...  55.1    1e-05
ref|XP_460364.1|  hypothetical protein DEHA0E25971g [Debaryomy...  55.1    1e-05
ref|NP_149724.1|  261R [Invertebrate iridescent virus 6] >sp|Q...  55.1    1e-05
gb|AAC47744.1|  histidine secretory acid phosphatase [Leishman...  55.1    1e-05
ref|YP_333623.1|  Hep_Hag family protein [Burkholderia pseudom...  55.1    1e-05
dbj|BAD35858.1|  lustrin A-like [Oryza sativa Japonica Group] ...  55.1    1e-05
ref|YP_003536097.1|  cell surface glycoprotein [Haloferax volc...  55.1    1e-05
ref|ZP_01013232.1|  hypothetical protein RB2654_10713 [Rhodoba...  55.1    1e-05
ref|YP_001266177.1|  glycoprotein [Pseudomonas putida F1] >gb|...  55.1    1e-05
gb|EFA85385.1|  hypothetical protein PPL_02388 [Polysphondyliu...  54.7    1e-05
gb|EFA84423.1|  hypothetical protein PPL_02455 [Polysphondyliu...  54.7    1e-05
gb|EFA83172.1|  hypothetical protein PPL_03962 [Polysphondyliu...  54.7    1e-05
emb|CBG25709.1|  large repetitive protein [Salmonella enterica...  54.7    1e-05
ref|ZP_05551480.1|  AT family autotransporter [Fusobacterium s...  54.7    1e-05
ref|ZP_05181713.1|  outer membrane autotransporter [Brucella s...  54.7    1e-05
ref|ZP_04653368.1|  VCBS repeat-containing protein [Salmonella...  54.7    1e-05
ref|ZP_04062338.1|  gram positive anchor domain protein [Strep...  54.7    1e-05
ref|ZP_06533908.1|  large repetitive protein [Salmonella enter...  54.7    1e-05
ref|XP_002162557.1|  PREDICTED: similar to predicted protein [...  54.7    1e-05
ref|ZP_03373741.1|  large repetitive protein [Salmonella enter...  54.7    1e-05
ref|YP_002244687.1|  large repetitive protein [Salmonella ente...  54.7    1e-05
ref|YP_002257120.1|  hemagglutinin-related protein [Ralstonia ...  54.7    1e-05
ref|YP_002130037.1|  Hemolysin-type calcium-binding region [Ph...  54.7    1e-05
ref|XP_002088786.1|  GE18759 [Drosophila yakuba] >gb|EDW88498....  54.7    1e-05
ref|XP_002066735.1|  GK24403 [Drosophila willistoni] >gb|EDW77...  54.7    1e-05
ref|YP_003241479.1|  S-layer domain protein [Geobacillus sp. Y...  54.7    1e-05
ref|YP_001925065.1|  GLUG domain protein [Methylobacterium pop...  54.7    1e-05
ref|ZP_02736367.1|  hypothetical protein GobsU_31439 [Gemmata ...  54.7    1e-05
ref|ZP_02573570.1|  VCBS repeat-containing protein [Salmonella...  54.7    1e-05
ref|YP_001666437.1|  hypothetical protein PputGB1_0186 [Pseudo...  54.7    1e-05
ref|YP_001639875.1|  hypothetical protein Mext_2409 [Methyloba...  54.7    1e-05
gb|ABX46037.1|  biofilm associated protein A [Salmonella enter...  54.7    1e-05
ref|ZP_01767991.1|  protein YbcL [Burkholderia pseudomallei 30...  54.7    1e-05
ref|ZP_01763937.1|  Hep_Hag family [Burkholderia pseudomallei ...  54.7    1e-05
ref|YP_001414862.1|  outer membrane autotransporter [Parvibacu...  54.7    1e-05
ref|YP_001860454.1|  filamentous haemagglutinin outer membrane...  54.7    1e-05
ref|YP_838849.1|  hemagluttinin domain-containing protein [Bur...  54.7    1e-05
ref|ZP_01449617.1|  calcium binding hemolysin protein, putativ...  54.7    1e-05
ref|NP_518236.1|  putative hemagglutinin-related protein [Rals...  54.7    1e-05
ref|NP_771354.1|  hypothetical protein blr4714 [Bradyrhizobium...  54.7    1e-05
ref|NP_457158.1|  large repetitive protein [Salmonella enteric...  54.7    1e-05
ref|ZP_00942782.1|  Hemolysin [Ralstonia solanacearum UW551] >...  54.7    1e-05
ref|YP_623013.1|  hemagluttinin motif-containing protein [Burk...  54.7    1e-05
ref|ZP_05568133.1|  cell wall surface anchor family protein [E...  54.3    2e-05
ref|YP_003307788.1|  outer membrane autotransporter barrel dom...  54.3    2e-05
ref|ZP_04574338.1|  outer membrane protein [Fusobacterium sp. ...  54.3    2e-05
ref|ZP_05113823.1|  type I secretion target GGXGXDXXX repeat p...  54.3    2e-05
ref|YP_002540967.1|  outer membrane pathogenesis protein [Agro...  54.3    2e-05
ref|ZP_03587694.1|  outer membrane autotransporter barrel doma...  54.3    2e-05
ref|YP_002293540.1|  putative adhesin [Escherichia coli SE11] ...  54.3    2e-05
ref|ZP_03127993.1|  autotransporter-associated beta strand rep...  54.3    2e-05
ref|YP_001463325.1|  putative invasin [Escherichia coli E24377...  54.3    2e-05
ref|ZP_02043819.1|  hypothetical protein ACTODO_00671 [Actinom...  54.3    2e-05
ref|ZP_01788721.1|  HMW2A, high molecular weight adhesin 2 [Ha...  54.3    2e-05
ref|ZP_01225313.1|  OmpA-like transmembrane domain protein [ma...  54.3    2e-05
ref|NP_487386.1|  hypothetical protein all3346 [Nostoc sp. PCC...  54.3    2e-05
ref|YP_840446.1|  YadA domain-containing protein [Burkholderia...  54.3    2e-05
ref|XP_382228.1|  hypothetical protein FG02052.1 [Gibberella z...  54.3    2e-05
ref|ZP_06295927.1|  filamentous hemagglutinin family outer mem...  53.9    2e-05
ref|ZP_06057948.1|  LOW QUALITY PROTEIN: cell-surface adhesin ...  53.9    2e-05
dbj|BAI44118.1|  chitin binding protein 4 [Magnaporthe oryzae]     53.9    2e-05
ref|YP_003113492.1|  Ricin B lectin [Catenulispora acidiphila ...  53.9    2e-05
ref|ZP_05318012.1|  Hep_Hag family protein [Neisseria sicca AT...  53.9    2e-05
ref|ZP_05314563.1|  hypothetical protein NAL212DRAFT_0735 [Nit...  53.9    2e-05
ref|YP_003009652.1|  cell wall/surface repeat protein [Paeniba...  53.9    2e-05
ref|YP_002963471.1|  hypothetical protein MexAM1_META1p2412 [M...  53.9    2e-05
ref|ZP_04715501.1|  hypothetical protein AmacA2_10885 [Alterom...  53.9    2e-05
ref|ZP_06544254.1|  putative surface-exposed virulence protein...  53.9    2e-05
ref|ZP_06536558.1|  putative surface-exposed virulence protein...  53.9    2e-05
ref|YP_002731204.1|  hemagglutination activity domain protein ...  53.9    2e-05
ref|YP_002431456.1|  Hemagluttinin repeat-containing protein [...  53.9    2e-05
ref|ZP_03377252.1|  putative surface-exposed virulence protein...  53.9    2e-05
ref|ZP_03365273.1|  large repetitive protein [Salmonella enter...  53.9    2e-05
ref|ZP_03364961.1|  putative surface-exposed virulence protein...  53.9    2e-05
ref|ZP_03359502.1|  putative surface-exposed virulence protein...  53.9    2e-05
ref|ZP_03346259.1|  putative surface-exposed virulence protein...  53.9    2e-05
ref|XP_002431788.1|  papilin, putative [Pediculus humanus corp...  53.9    2e-05
ref|XP_001358426.2|  GA17283 [Drosophila pseudoobscura pseudoo...  53.9    2e-05
ref|YP_002153626.1|  putative haemagglutinin-related autotrans...  53.9    2e-05
ref|XP_002075754.1|  GK12364 [Drosophila willistoni] >gb|EDW86...  53.9    2e-05
ref|ZP_03032525.1|  EntS/YbdA MFS transporter [Escherichia col...  53.9    2e-05
ref|YP_001940725.1|  Large exoprotein involved in heme utiliza...  53.9    2e-05
ref|YP_001896988.1|  YadA domain protein [Burkholderia phytofi...  53.9    2e-05
ref|ZP_02146514.1|  Hemolysin-type calcium-binding region [Pha...  53.9    2e-05
ref|ZP_02161238.1|  probable extracellular nuclease [Kordia al...  53.9    2e-05
ref|YP_001474531.1|  fibronectin type III domain-containing pr...  53.9    2e-05
ref|ZP_01255577.1|  putative adhesion lipoprotein [Psychroflex...  53.9    2e-05
ref|ZP_01890233.1|  probable extracellular nuclease [unidentif...  53.9    2e-05
ref|ZP_04970754.1|  AT family autotransporter [Fusobacterium n...  53.9    2e-05
ref|ZP_01748755.1|  possible serine protease/outer membrane au...  53.9    2e-05
ref|YP_001667091.1|  hypothetical protein PputGB1_0845 [Pseudo...  53.9    2e-05
emb|CAL58530.1|  Haemagluttinin motif:Hep_Hag (ISS) [Ostreococ...  53.9    2e-05
ref|YP_791365.1|  hypothetical protein PA14_40260 [Pseudomonas...  53.9    2e-05
ref|YP_560399.1|  putative membrane-anchored cell surface prot...  53.9    2e-05
gb|AAD34846.1|AF139831_1  proline/threonine-rich protein [Salm...  53.9    2e-05
ref|YP_367062.1|  YadA/haemagluttinin like protein [Burkholder...  53.9    2e-05
ref|NP_951340.1|  cadherin domain/calx-beta domain-containing ...  53.9    2e-05
gb|AAD29677.1|AF133185_1  BigA [Salmonella enterica subsp. ent...  53.9    2e-05
ref|NP_602353.1|  hypothetical protein FN1526 [Fusobacterium n...  53.9    2e-05
ref|NP_870154.1|  aggregation factor core protein MAFp3, isofo...  53.9    2e-05
ref|NP_774564.1|  hypothetical protein bll7924 [Bradyrhizobium...  53.9    2e-05
ref|YP_001229544.1|  hypothetical protein Gura_0762 [Geobacter...  53.9    2e-05
ref|ZP_01090946.1|  hypothetical protein DSM3645_11227 [Blasto...  53.9    2e-05
ref|ZP_00952559.1|  Glycosyl hydrolase, BNR repeat [Oceanicaul...  53.9    2e-05
ref|YP_002799917.1|  Flagellin [Azotobacter vinelandii DJ] >gb...  53.9    2e-05
ref|ZP_06487552.1|  putative filamentous hemagglutinin-like pr...  53.5    3e-05
ref|ZP_06226942.1|  hypothetical protein BC1002DRAFT_3518 [Bur...  53.5    3e-05
ref|ZP_05788423.1|  poly(beta-D-mannuronate) C5 epimerase 3 [S...  53.5    3e-05
emb|CBA31990.1|  hypothetical protein [Curvibacter putative sy...  53.5    3e-05
ref|YP_003136084.1|  filamentous haemagglutinin family outer m...  53.5    3e-05
ref|XP_002457976.1|  hypothetical protein SORBIDRAFT_03g024470...  53.5    3e-05
ref|XP_002495805.1|  ZYRO0C03432p [Zygosaccharomyces rouxii] >...  53.5    3e-05
ref|YP_002907635.1|  YadA C-terminal domain protein [Burkholde...  53.5    3e-05
ref|YP_002895843.1|  cell surface protein [Burkholderia pseudo...  53.5    3e-05
ref|ZP_03967408.1|  possible filamentous hemagglutinin outer m...  53.5    3e-05
ref|ZP_05969952.1|  large repetitive protein [Enterobacter can...  53.5    3e-05
ref|ZP_03573352.1|  YadA C- domain protein [Burkholderia multi...  53.5    3e-05
ref|YP_002413033.1|  putative invasin [Escherichia coli UMN026...  53.5    3e-05
ref|ZP_05040212.1|  haemagglutination activity domain protein ...  53.5    3e-05
ref|XP_001918934.1|  PREDICTED: similar to predicted protein [...  53.5    3e-05
ref|YP_002370541.1|  filamentous hemagglutinin family outer me...  53.5    3e-05
ref|YP_001743213.1|  putative invasin [Escherichia coli SMS-3-...  53.5    3e-05
ref|ZP_02886528.1|  Haemagluttinin domain protein [Burkholderi...  53.5    3e-05
ref|YP_001778668.1|  hemagluttinin domain-containing protein [...  53.5    3e-05
ref|YP_001718911.1|  hypothetical protein YPK_0145 [Yersinia p...  53.5    3e-05
ref|ZP_03219707.1|  autotransporter beta-domain protein [Salmo...  53.5    3e-05
ref|YP_001770167.1|  outer membrane adhesin like proteiin [Met...  53.5    3e-05
ref|ZP_02376969.1|  outer membrane autotransporter barrel doma...  53.5    3e-05
ref|YP_001656254.1|  putative peptidase [Microcystis aeruginos...  53.5    3e-05
ref|YP_001526149.1|  putative outer membrane autotransporter b...  53.5    3e-05
ref|XP_001550327.1|  hypothetical protein BC1G_11535 [Botryoti...  53.5    3e-05
ref|ZP_01902639.1|  hypothetical protein RAZWK3B_18928 [Roseob...  53.5    3e-05
ref|ZP_01985302.1|  conserved hypothetical protein [Vibrio har...  53.5    3e-05
ref|YP_001583685.1|  hemolysin-type calcium-binding region [Bu...  53.5    3e-05
ref|YP_001861848.1|  filamentous haemagglutinin outer membrane...  53.5    3e-05
ref|ZP_01441112.1|  Parallel beta-helix repeat [Roseovarius sp...  53.5    3e-05
ref|NP_048362.1|  hypothetical protein [Paramecium bursaria Ch...  53.5    3e-05
gb|AAD56660.1|AF180944_1  HmwA [Haemophilus influenzae]            53.5    3e-05
gb|AAF69025.1|  outer membrane-like protein [Pseudomonas putida]   53.5    3e-05
ref|NP_588031.3|  sequence orphan [Schizosaccharomyces pombe] ...  53.5    3e-05
ref|XP_501711.1|  YALI0C11165p [Yarrowia lipolytica] >emb|CAG8...  53.5    3e-05
emb|CAC34385.1|  scaB cellulosomal scaffoldin protein precurso...  53.5    3e-05
ref|ZP_01004590.1|  S-layer protein RsaA [Loktanella vestfolde...  53.5    3e-05
ref|ZP_01312628.1|  flagellin-like [Desulfuromonas acetoxidans...  53.5    3e-05
ref|XP_384749.1|  hypothetical protein FG04573.1 [Gibberella z...  53.5    3e-05
ref|ZP_06662776.1|  yeeJ protein [Escherichia coli B088] >gb|E...  53.1    4e-05
ref|ZP_06489989.1|  putative filamentous hemagglutinin-like pr...  53.1    4e-05
gb|EFA85100.1|  hypothetical protein PPL_02097 [Polysphondyliu...  53.1    4e-05
ref|ZP_06231123.1|  Hemolysin-type calcium-binding protein [De...  53.1    4e-05
ref|YP_002387467.1|  adhesin [Escherichia coli IAI1] >emb|CAQ9...  53.1    4e-05
ref|ZP_05084791.1|  serine proteinase, putative [Pseudovibrio ...  53.1    4e-05
ref|ZP_03266013.1|  outer membrane autotransporter barrel doma...  53.1    4e-05
ref|ZP_02381193.1|  hypothetical protein BuboB_25965 [Burkhold...  53.1    4e-05
ref|YP_001290532.1|  HMW1A, high molecular weight adhesin 1 [H...  53.1    4e-05
ref|YP_001221637.1|  hypothetical protein CMM_0897 [Clavibacte...  53.1    4e-05
ref|YP_001032441.1|  cell wall surface anchor family protein [...  53.1    4e-05
ref|YP_558167.1|  polymorphic membrane protein, filamentous ha...  53.1    4e-05
ref|ZP_01260106.1|  fibronectin type III domain protein [Vibri...  53.1    4e-05
ref|YP_502900.1|  PT repeat-containing protein [Methanospirill...  53.1    4e-05
ref|YP_207120.1|  RTX repeat-containing calcium-binding cytoto...  53.1    4e-05
emb|CAI77661.1|  adhesin [Haemophilus influenzae]                  53.1    4e-05
ref|YP_064252.1|  hypothetical protein DP0516 [Desulfotalea ps...  53.1    4e-05
ref|NP_866060.1|  extracellular nuclease [Rhodopirellula balti...  53.1    4e-05
ref|NP_523098.1|  hemagglutinin-related protein [Ralstonia sol...  53.1    4e-05
sp|P50401.1|GUXA_CELFI  RecName: Full=Exoglucanase A; AltName:...  53.1    4e-05
ref|YP_003193694.1|  probable aggregation factor core protein ...  53.1    4e-05
ref|YP_776250.1|  outer membrane autotransporter [Burkholderia...  53.1    4e-05
ref|YP_001117138.1|  outer membrane autotransporter [Burkholde...  53.1    4e-05
ref|YP_677947.1|  endoglucanase-like protein [Cytophaga hutchi...  53.1    4e-05
emb|CBK74256.1|  hypothetical protein [Butyrivibrio fibrisolve...  52.8    5e-05
ref|ZP_06356497.1|  conserved hypothetical protein [Citrobacte...  52.8    5e-05
ref|ZP_05815367.1|  AT family autotransporter [Fusobacterium s...  52.8    5e-05
ref|ZP_05730209.1|  putative hemagglutinin/hemolysin-related p...  52.8    5e-05
ref|ZP_04465871.1|  HMW1A, high molecular weight adhesin 1 [Ha...  52.8    5e-05
ref|ZP_03826552.1|  putative hemagglutinin/hemolysin-related p...  52.8    5e-05
ref|YP_002441035.1|  hypothetical protein PLES_34491 [Pseudomo...  52.8    5e-05
ref|YP_002299977.1|  putative Ig domain proteni [Rhodospirillu...  52.8    5e-05
ref|ZP_05051456.1|  Gp5 C-terminal domain repeat protein famil...  52.8    5e-05
ref|XP_002135105.1|  GA23868 [Drosophila pseudoobscura pseudoo...  52.8    5e-05
ref|ZP_05035754.1|  hypothetical protein S7335_2186 [Synechoco...  52.8    5e-05
ref|XP_002068286.1|  GK25518 [Drosophila willistoni] >gb|EDW79...  52.8    5e-05
ref|YP_001818606.1|  beta strand repeat-containing protein [Op...  52.8    5e-05
ref|ZP_02929561.1|  outer membrane autotransporter barrel doma...  52.8    5e-05
ref|ZP_02929491.1|  outer membrane autotransporter barrel doma...  52.8    5e-05
ref|ZP_02698053.1|  conserved hypothetical protein [Salmonella...  52.8    5e-05
ref|ZP_02669646.1|  VCBS repeat-containing protein [Salmonella...  52.8    5e-05
ref|ZP_02659272.1|  VCBS repeat-containing protein [Salmonella...  52.8    5e-05
ref|ZP_02654742.1|  ShdA [Salmonella enterica subsp. enterica ...  52.8    5e-05
ref|ZP_02511815.1|  polymorphic membrane protein, Filamentous ...  52.8    5e-05
ref|YP_001903729.1|  Putative filamentous hemagglutinin-relate...  52.8    5e-05
ref|YP_001525551.1|  hypothetical protein AZC_2635 [Azorhizobi...  52.8    5e-05
ref|YP_001402955.1|  putative invasin [Yersinia pseudotubercul...  52.8    5e-05
ref|ZP_01884202.1|  hypothetical protein PBAL39_25275 [Pedobac...  52.8    5e-05
ref|ZP_01717644.1|  hypothetical protein ALPR1_10680 [Algoriph...  52.8    5e-05
ref|YP_001583498.1|  hemagluttinin domain-containing protein [...  52.8    5e-05
ref|XP_001203871.1|  PREDICTED: hypothetical protein, partial ...  52.8    5e-05
ref|ZP_01386642.1|  hypothetical protein CferDRAFT_0595 [Chlor...  52.8    5e-05
ref|YP_434381.1|  RTX toxins and related Ca2+-binding protein ...  52.8    5e-05
ref|YP_257280.1|  calcium-binding outer membrane-like protein ...  52.8    5e-05
ref|YP_217677.1|  large repetitive protein [Salmonella enteric...  52.8    5e-05
ref|NP_250565.1|  hypothetical protein PA1874 [Pseudomonas aer...  52.8    5e-05
ref|NP_637389.1|  YapH protein [Xanthomonas campestris pv. cam...  52.8    5e-05
ref|NP_267481.1|  hypothetical protein L159364 [Lactococcus la...  52.8    5e-05
ref|XP_953626.1|  inositol phosphatase [Theileria annulata] >e...  52.8    5e-05
ref|YP_066910.1|  hypothetical protein DPPB56 [Desulfotalea ps...  52.8    5e-05
ref|NP_522741.1|  putative hemagglutinin/hemolysin-related pro...  52.8    5e-05
ref|ZP_01167669.1|  type I secretion target repeat protein [Oc...  52.8    5e-05
ref|YP_003196627.1|  hypothetical protein RB2501_02025 [Robigi...  52.8    5e-05
ref|XP_641455.1|  hypothetical protein DDB_G0279995 [Dictyoste...  52.8    5e-05
ref|YP_001865257.1|  filamentous haemagglutinin outer membrane...  52.8    5e-05
ref|ZP_05970979.2|  exoprotein, RTX family [Enterobacter cance...  52.4    6e-05
ref|ZP_06182281.1|  conserved hypothetical protein [Vibrio alg...  52.4    6e-05
ref|ZP_05790280.1|  endoglucanase [Synechococcus sp. WH 8109] ...  52.4    6e-05
emb|CBA28114.1|  hypothetical protein [Curvibacter putative sy...  52.4    6e-05
ref|ZP_05650921.1|  beta-1,4-N-acetylmuramoylhydrolase [Entero...  52.4    6e-05
ref|ZP_05161098.1|  outer membrane autotransporter [Brucella s...  52.4    6e-05
ref|YP_003045088.1|  adhesin [Escherichia coli B str. REL606] ...  52.4    6e-05
emb|CAQ32398.1|  yeeJ [Escherichia coli BL21(DE3)]                 52.4    6e-05
ref|YP_003073219.1|  hemagglutination activity domain protein ...  52.4    6e-05
ref|YP_003035906.1|  Ig domain protein group 1 domain protein ...  52.4    6e-05
ref|ZP_04624092.1|  hypothetical protein ykris0001_40660 [Yers...  52.4    6e-05
ref|ZP_04625921.1|  hypothetical protein ykris0001_38190 [Yers...  52.4    6e-05
ref|YP_002908456.1|  Adhesin HecA family protein [Burkholderia...  52.4    6e-05
ref|ZP_04573109.1|  LOW QUALITY PROTEIN: outer membrane protei...  52.4    6e-05
ref|YP_003124727.1|  conserved repeat domain protein [Chitinop...  52.4    6e-05
ref|YP_002796906.1|  Na-Ca exchanger/integrin-beta4 [Laribacte...  52.4    6e-05
ref|XP_002155114.1|  PREDICTED: similar to mucin 2, partial [H...  52.4    6e-05
ref|YP_002431457.1|  filamentous hemagglutinin family outer me...  52.4    6e-05
ref|YP_002398221.1|  adhesin [Escherichia coli ED1a] >emb|CAR0...  52.4    6e-05
ref|ZP_03450969.1|  hemagglutination activity domain protein [...  52.4    6e-05
ref|YP_002234769.1|  putative outer membrane autotransporter [...  52.4    6e-05
ref|ZP_03131310.1|  outer membrane autotransporter barrel doma...  52.4    6e-05
ref|ZP_05042311.1|  type I secretion target GGXGXDXXX repeat p...  52.4    6e-05
ref|ZP_02889384.1|  filamentous haemagglutinin family outer me...  52.4    6e-05
ref|YP_001707578.1|  cell-surface adhesin [Acinetobacter bauma...  52.4    6e-05
ref|ZP_02772856.1|  large repetitive protein [Escherichia coli...  52.4    6e-05
ref|ZP_03215894.1|  VCBS repeat-containing protein [Salmonella...  52.4    6e-05
ref|ZP_02511898.1|  cell surface protein [Burkholderia pseudom...  52.4    6e-05
ref|ZP_02487782.1|  cell surface protein [Burkholderia pseudom...  52.4    6e-05
ref|ZP_04897245.1|  putative cell surface protein [Burkholderi...  52.4    6e-05
ref|XP_001635740.1|  predicted protein [Nematostella vectensis...  52.4    6e-05
ref|YP_001185098.1|  putative outer membrane adhesin like prot...  52.4    6e-05
ref|XP_001367678.1|  PREDICTED: hypothetical protein [Monodelp...  52.4    6e-05
ref|YP_001058172.1|  cell surface protein [Burkholderia pseudo...  52.4    6e-05
ref|YP_965090.1|  putative outer membrane adhesin like proteii...  52.4    6e-05
ref|YP_958105.1|  Ig domain-containing protein [Marinobacter a...  52.4    6e-05
ref|YP_001812021.1|  filamentous haemagglutinin outer membrane...  52.4    6e-05
ref|YP_732521.1|  putative outer membrane adhesin like protein...  52.4    6e-05
ref|YP_669320.1|  putative autotransporter/adhesin [Escherichi...  52.4    6e-05
ref|YP_681898.1|  VBCS repeat-containing protein [Roseobacter ...  52.4    6e-05
ref|YP_606687.1|  outer membrane autotransporter [Pseudomonas ...  52.4    6e-05
ref|YP_557393.1|  filamentous haemagglutinin [Burkholderia xen...  52.4    6e-05
gb|AAY28518.1|  biofilm-associated protein [Staphylococcus sim...  52.4    6e-05
gb|AAS77299.1|  adhesin [Haemophilus influenzae]                   52.4    6e-05
gb|AAS77300.1|  adhesin [Haemophilus influenzae]                   52.4    6e-05
ref|AP_002584.1|  adhesin [Escherichia coli str. K-12 substr. ...  52.4    6e-05
ref|YP_355178.1|  parallel beta-helix repeat-containing transc...  52.4    6e-05
ref|YP_332688.1|  adhesin/hemolysin [Burkholderia pseudomallei...  52.4    6e-05
ref|YP_299189.1|  Outer membrane autotransporter barrel [Ralst...  52.4    6e-05
ref|NP_286229.1|  hypothetical protein Z0609 [Escherichia coli...  52.4    6e-05
ref|NP_416485.4|  probable adhesin [Escherichia coli str. K-12...  52.4    6e-05
ref|YP_072270.1|  Ig-like domain-containing protein [Yersinia ...  52.4    6e-05
ref|NP_061407.1|  hypothetical protein Fpla030 [Plasmid F] >sp...  52.4    6e-05
ref|NP_770203.1|  hypothetical protein bll3563 [Bradyrhizobium...  52.4    6e-05
ref|NP_308568.1|  hypothetical protein ECs0541 [Escherichia co...  52.4    6e-05
ref|ZP_01155283.1|  hypothetical protein OG2516_05278 [Oceanic...  52.4    6e-05
ref|ZP_00952558.1|  fibronectin type III domain protein [Ocean...  52.4    6e-05
ref|ZP_02497137.1|  possible adhesin/hemolysin [Burkholderia p...  52.4    6e-05
ref|ZP_00055704.1|  COG2931: RTX toxins and related Ca2+-bindi...  52.4    6e-05
ref|ZP_00143270.1|  FUSOBACTERIUM OUTER MEMBRANE PROTEIN FAMIL...  52.4    6e-05
ref|YP_003533721.1|  GLUG motif domain protein [Haloferax volc...  52.0    8e-05
gb|EFA85523.1|  hypothetical protein PPL_01480 [Polysphondyliu...  52.0    8e-05
ref|ZP_05436780.1|  EntS/YbdA MFS transporter [Escherichia sp....  52.0    8e-05
ref|ZP_05431697.1|  adhesin [Shigella sp. D9]                      52.0    8e-05
ref|ZP_03913717.1|  conserved hypothetical protein [Leuconosto...  52.0    8e-05
ref|ZP_03829256.1|  putative hemagglutinin/hemolysin-related p...  52.0    8e-05
ref|ZP_05113842.1|  Cadherin domain protein [Labrenzia alexand...  52.0    8e-05
ref|YP_002559933.1|  hypothetical protein MCCL_0530 [Macrococc...  52.0    8e-05
ref|ZP_03514060.1|  putative hemolysin-type calcium-binding pr...  52.0    8e-05
ref|XP_002588163.1|  hypothetical protein BRAFLDRAFT_68801 [Br...  52.0    8e-05
ref|XP_001916050.1|  PREDICTED: similar to mucin 17 [Equus cab...  52.0    8e-05
ref|XP_001953393.1|  GF17743 [Drosophila ananassae] >gb|EDV419...  52.0    8e-05
ref|ZP_02906872.1|  outer membrane autotransporter barrel doma...  52.0    8e-05
ref|ZP_02735422.1|  putative cell surface protein [Gemmata obs...  52.0    8e-05
ref|ZP_02692679.1|  hypothetical protein Epulo_05993 [Epulopis...  52.0    8e-05
ref|YP_002041952.1|  large repetitive protein [Salmonella ente...  52.0    8e-05
ref|ZP_02489027.1|  cell surface protein [Burkholderia pseudom...  52.0    8e-05
ref|ZP_02347257.1|  VCBS repeat-containing protein [Salmonella...  52.0    8e-05
ref|ZP_03164566.1|  conserved hypothetical protein [Salmonella...  52.0    8e-05
ref|XP_001384668.2|  hypothetical protein PICST_31865 [Pichia ...  52.0    8e-05
ref|ZP_01852946.1|  VCBS [Planctomyces maris DSM 8797] >gb|EDL...  52.0    8e-05
ref|ZP_01736038.1|  Flagellin and related hook-associated prot...  52.0    8e-05
ref|ZP_01693516.1|  hypothetical protein M23134_06203 [Microsc...  52.0    8e-05
ref|ZP_01627077.1|  hypothetical protein MGP2080_05280 [marine...  52.0    8e-05
ref|ZP_01551157.1|  Outer membrane autotransporter barrel [Sta...  52.0    8e-05
ref|ZP_01387100.1|  hypothetical protein CferDRAFT_0105 [Chlor...  52.0    8e-05
ref|ZP_01387101.1|  hypothetical protein CferDRAFT_0106 [Chlor...  52.0    8e-05
ref|ZP_01365226.1|  hypothetical protein PaerPA_01002342 [Pseu...  52.0    8e-05
ref|ZP_01215158.1|  putative RTX toxin [Psychromonas sp. CNPT3...  52.0    8e-05
gb|AAR21093.1|  surface layer protein SapA13 [Campylobacter fe...  52.0    8e-05
ref|XP_363942.1|  hypothetical protein MGG_01868 [Magnaporthe ...  52.0    8e-05
ref|YP_094680.1|  hypothetical protein lpg0644 [Legionella pne...  52.0    8e-05
ref|YP_191548.1|  hypothetical protein GOX1124 [Gluconobacter ...  52.0    8e-05
ref|YP_167926.1|  type I secretion target repeat-containing pr...  52.0    8e-05
ref|YP_108245.1|  putative outer membrane protein [Burkholderi...  52.0    8e-05
ref|ZP_01124329.1|  hypothetical protein WH7805_03991 [Synecho...  52.0    8e-05
ref|ZP_01038861.1|  outer membrane autotransporter barrel prot...  52.0    8e-05
ref|YP_825521.1|  serine/threonine protein kinase [Solibacter ...  52.0    8e-05
ref|ZP_06205203.1|  conserved repeat domain protein [Yersinia ...  51.6    1e-04
ref|ZP_06181928.1|  conserved hypothetical protein [Vibrio alg...  51.6    1e-04
gb|EEZ80565.1|  hypothetical protein Sup05_0352 [uncultured SU...  51.6    1e-04
ref|ZP_05962755.1|  outer membrane autotransporter [Brucella n...  51.6    1e-04
ref|ZP_05449790.1|  outer membrane autotransporter [Brucella n...  51.6    1e-04
gb|ACS91539.1|  biofilm associated protein [Salmonella enteric...  51.6    1e-04
ref|YP_003014568.1|  hypothetical protein Pjdr2_5878 [Paenibac...  51.6    1e-04
ref|ZP_04451137.1|  hypothetical protein GCWU000182_00418 [Abi...  51.6    1e-04
ref|ZP_04456648.1|  Invasin [Yersinia pestis Pestoides A] >gb|...  51.6    1e-04
ref|ZP_04519372.1|  Invasin [Yersinia pestis Nepal516] >gb|EEO...  51.6    1e-04
ref|YP_002407707.1|  putative autotransported outer membrane p...  51.6    1e-04
emb|CAR65428.1|  DEHA2B01452p [Debaryomyces hansenii]              51.6    1e-04
ref|XP_002013998.1|  GL23094 [Drosophila persimilis] >gb|EDW24...  51.6    1e-04
ref|YP_001874392.1|  hypothetical protein YPTS_3986 [Yersinia ...  51.6    1e-04
ref|ZP_04887939.1|  putative cell surface protein [Burkholderi...  51.6    1e-04
ref|YP_001817334.1|  flagellar hook-associated 2 domain-contai...  51.6    1e-04
ref|ZP_02925046.1|  hypothetical protein VspiD_00350 [Verrucom...  51.6    1e-04
ref|YP_001751905.1|  Na-Ca exchanger/integrin-beta4 [Pseudomon...  51.6    1e-04
ref|ZP_02657669.1|  autotransporter beta-domain protein [Salmo...  51.6    1e-04
ref|ZP_02470325.1|  cell surface protein [Burkholderia pseudom...  51.6    1e-04
ref|ZP_02417374.1|  cell surface protein [Burkholderia pseudom...  51.6    1e-04
ref|ZP_02306192.1|  invasin [Yersinia pestis biovar Antiqua st...  51.6    1e-04
gb|ABQ02469.1|  OmpB [Rickettsia sp. TwKM01]                       51.6    1e-04
ref|ZP_02162963.1|  predicted calcium-binding protein [Kordia ...  51.6    1e-04
ref|ZP_01905078.1|  Hemolysin-type calcium-binding region [Ros...  51.6    1e-04
ref|XP_001508418.1|  PREDICTED: similar to tumor protein p53 b...  51.6    1e-04
ref|YP_001227171.1|  hypothetical protein SynRCC307_0915 [Syne...  51.6    1e-04
ref|YP_001207838.1|  hemagglutinin-like protein [Bradyrhizobiu...  51.6    1e-04
ref|YP_001164639.1|  invasin [Yersinia pestis Pestoides F] >gb...  51.6    1e-04
ref|YP_001129959.1|  hypothetical protein Cvib_0435 [Prostheco...  51.6    1e-04
ref|ZP_01753166.1|  hypothetical protein RSK20926_13389 [Roseo...  51.6    1e-04
ref|ZP_01736037.1|  flagellin FliC [Marinobacter sp. ELB17] >g...  51.6    1e-04
ref|YP_001065409.1|  cell surface protein [Burkholderia pseudo...  51.6    1e-04
ref|YP_001863342.1|  filamentous haemagglutinin outer membrane...  51.6    1e-04
ref|XP_784086.2|  PREDICTED: hypothetical protein [Strongyloce...  51.6    1e-04
ref|YP_649519.1|  invasin [Yersinia pestis Nepal516] >gb|ABG19...  51.6    1e-04
ref|YP_653680.1|  putative invasin [Yersinia pestis Antiqua] >...  51.6    1e-04
gb|ABF47342.1|  outer membrane protein [Rickettsia massiliae]      51.6    1e-04
ref|XP_457029.1|  hypothetical protein DEHA0B01452g [Debaryomy...  51.6    1e-04
gb|AAP13318.1|  FliC [Escherichia coli]                            51.6    1e-04
gb|AAF34122.1|AF123719_1  OmpB [Rickettsia rhipicephali]           51.6    1e-04
gb|AAF34113.1|AF123710_1  OmpB [Rickettsia sp. Bar29]              51.6    1e-04
ref|YP_402122.1|  hypothetical protein SDY_0422 [Shigella dyse...  51.6    1e-04
ref|YP_379543.1|  VCBS [Chlorobium chlorochromatii CaD3] >gb|A...  51.6    1e-04
ref|YP_136902.1|  hypothetical protein rrnAC2381 [Haloarcula m...  51.6    1e-04
ref|NP_994594.1|  putative invasin [Yersinia pestis biovar Mic...  51.6    1e-04
ref|YP_304787.1|  hypothetical protein Mbar_A1242 [Methanosarc...  51.6    1e-04
ref|YP_420631.1|  large exoprotein [Magnetospirillum magneticu...  51.6    1e-04
ref|XP_501427.1|  YALI0C04136p [Yarrowia lipolytica] >emb|CAG8...  51.6    1e-04
ref|NP_671178.1|  hypothetical protein y3884 [Yersinia pestis ...  51.6    1e-04
ref|YP_001131787.1|  putative outer membrane adhesin like prot...  51.6    1e-04
ref|ZP_02222785.1|  invasin [Yersinia pestis biovar Orientalis...  51.6    1e-04
ref|ZP_01078758.1|  hypothetical protein MED121_03691 [Marinom...  51.6    1e-04
ref|ZP_01068033.1|  putative lipoprotein [Campylobacter jejuni...  51.6    1e-04
ref|ZP_01067585.1|  putative lipoprotein [Campylobacter jejuni...  51.6    1e-04
ref|YP_001608399.1|  hypothetical protein YpAngola_A4116 [Yers...  51.6    1e-04
ref|YP_001344458.1|  YadA domain-containing protein [Actinobac...  51.6    1e-04
ref|XP_637068.1|  hypothetical protein DDB_G0287863 [Dictyoste...  51.6    1e-04
ref|YP_003397924.1|  adhesin HecA family [Acidaminococcus ferm...  51.2    1e-04
gb|EFA78204.1|  zymogen granule membrane glycoprotein [Polysph...  51.2    1e-04
ref|YP_003225779.1|  protein of unknown function DUF1078 domai...  51.2    1e-04
ref|ZP_05587733.1|  filamentous haemagglutinin [Burkholderia t...  51.2    1e-04
ref|XP_002545510.1|  predicted protein [Candida tropicalis MYA...  51.2    1e-04
ref|YP_002947797.1|  outer membrane adhesin like proteiin [Var...  51.2    1e-04
emb|CAV30829.1|  Integrins alpha chain:Na-Ca exchanger/integri...  51.2    1e-04
ref|ZP_04600545.1|  hypothetical protein VEIDISOL_02003 [Veill...  51.2    1e-04
ref|YP_003308262.1|  outer membrane autotransporter barrel dom...  51.2    1e-04
ref|ZP_03823362.1|  possible hemagluttinin family protein [Aci...  51.2    1e-04
gb|ACN23801.1|  large exoprotein [Clostridium sp. enrichment c...  51.2    1e-04
ref|ZP_05114283.1|  Bacterial flagellin N-terminus domain prot...  51.2    1e-04
ref|ZP_03573300.1|  hemagglutinin family protein [Burkholderia...  51.2    1e-04
ref|ZP_03578952.1|  hemagglutinin family protein [Burkholderia...  51.2    1e-04
ref|ZP_03544208.1|  Pyrrolo-quinoline quinone [Comamonas testo...  51.2    1e-04
ref|ZP_05101534.1|  putative RTX family exoprotein [Roseobacte...  51.2    1e-04
ref|XP_002605759.1|  hypothetical protein BRAFLDRAFT_78026 [Br...  51.2    1e-04
ref|ZP_03271180.1|  Hemagluttinin domain protein [Burkholderia...  51.2    1e-04
ref|ZP_05075224.1|  flagellin core protein A [Rhodobacterales ...  51.2    1e-04
ref|ZP_05067757.1|  integrins alpha chain [Octadecabacter anta...  51.2    1e-04
ref|ZP_03132793.1|  hypothetical protein CfE428DRAFT_5961 [Cht...  51.2    1e-04
gb|EDV08318.1|  conserved hypothetical protein [Saccharomyces ...  51.2    1e-04
ref|YP_001867571.1|  FG-GAP repeat-containing protein [Nostoc ...  51.2    1e-04
ref|YP_001843358.1|  hypothetical protein LAF_0542 [Lactobacil...  51.2    1e-04
ref|YP_001821124.1|  Ig family protein [Opitutus terrae PB90-1...  51.2    1e-04
gb|EDP47110.1|  conserved hypothetical protein [Aspergillus fu...  51.2    1e-04
ref|XP_001843502.1|  papilin [Culex quinquefasciatus] >gb|EDS3...  51.2    1e-04
ref|ZP_02240675.1|  invasin [Yersinia pestis biovar Antiqua st...  51.2    1e-04
ref|YP_001449214.1|  hypothetical protein VIBHAR_07115 [Vibrio...  51.2    1e-04
ref|YP_001353306.1|  large exoproteins involved in heme utiliz...  51.2    1e-04
ref|ZP_01874732.1|  VCBS [Lentisphaera araneosa HTCC2155] >gb|...  51.2    1e-04
ref|ZP_01751650.1|  outer membrane autotransporter barrel [Ros...  51.2    1e-04
ref|ZP_01746634.1|  extracellular nuclease [Sagittula stellata...  51.2    1e-04
ref|XP_001268442.1|  CFEM domain protein [Aspergillus clavatus...  51.2    1e-04
ref|YP_001177834.1|  putative outer membrane adhesin like prot...  51.2    1e-04
ref|ZP_01466922.1|  vcbs [Stigmatella aurantiaca DW4/3-1] >gb|...  51.2    1e-04
ref|XP_001660360.1|  papilin [Aedes aegypti] >gb|EAT38304.1| p...  51.2    1e-04
ref|YP_654663.1|  hypothetical protein MIV091L [Invertebrate i...  51.2    1e-04
ref|YP_394226.1|  hemolysin-type calcium-binding region [Sulfu...  51.2    1e-04
ref|YP_111446.1|  membrane-anchored cell surface protein [Burk...  51.2    1e-04
ref|XP_365440.1|  hypothetical protein MGG_02142 [Magnaporthe ...  51.2    1e-04
gb|AAC79513.1|  histidine secretory acid phosphatase [Leishman...  51.2    1e-04
ref|YP_443237.1|  filamentous haemagglutinin [Burkholderia tha...  51.2    1e-04
ref|YP_438314.1|  Hep_Hag family protein [Burkholderia thailan...  51.2    1e-04
ref|YP_345866.1|  von Willebrand factor, type A [Pseudomonas f...  51.2    1e-04
ref|YP_335617.1|  haemagluttinin family protein [Burkholderia ...  51.2    1e-04
ref|YP_214648.1|  fiber [Prochlorococcus phage P-SSM4] >gb|AAX...  51.2    1e-04
gb|AAV85946.1|  serine-rich repeat protein 2 [Streptococcus ag...  51.2    1e-04
ref|YP_349485.1|  Outer membrane autotransporter barrel [Pseud...  51.2    1e-04
ref|YP_157998.1|  hypothetical protein ebA1795 [Aromatoleum ar...  51.2    1e-04
ref|XP_002630500.1|  Hypothetical protein CBG11242 [Caenorhabd...  51.2    1e-04
ref|ZP_00053058.1|  COG2931: RTX toxins and related Ca2+-bindi...  51.2    1e-04
gb|ADE35574.1|  periplasmic copper-binding protein [Methanohal...  50.8    2e-04
ref|XP_002732605.1|  PREDICTED: fibrillin 2-like, partial [Sac...  50.8    2e-04
ref|YP_003371378.1|  autotransporter-associated beta strand re...  50.8    2e-04
ref|YP_003370810.1|  autotransporter-associated beta strand re...  50.8    2e-04
ref|YP_003369879.1|  autotransporter-associated beta strand re...  50.8    2e-04
ref|YP_003360678.1|  Permease protein of ABC transporter syste...  50.8    2e-04
ref|ZP_06229723.1|  outer membrane autotransporter barrel prot...  50.8    2e-04
ref|ZP_06009076.1|  outermembrane transporter [Campylobacter f...  50.8    2e-04
ref|ZP_05864304.1|  conserved hypothetical protein [Lactobacil...  50.8    2e-04
ref|ZP_05634979.1|  hypothetical protein FulcA4_16209 [Fusobac...  50.8    2e-04
ref|YP_002913212.1|  YadA C-terminal domain protein [Burkholde...  50.8    2e-04
ref|YP_003157853.1|  Hemolysin-type calcium-binding region [De...  50.8    2e-04
ref|ZP_03827003.1|  large repetitive protein [Pectobacterium c...  50.8    2e-04
ref|ZP_03698036.1|  Dystroglycan-type cadherin domain protein ...  50.8    2e-04
ref|ZP_03630561.1|  conserved repeat domain protein [bacterium...  50.8    2e-04
ref|ZP_05124630.1|  hemolysin-type calcium-binding region [Rho...  50.8    2e-04
ref|XP_002162014.1|  PREDICTED: similar to Hkr1p, partial [Hyd...  50.8    2e-04
ref|XP_002173208.1|  predicted protein [Schizosaccharomyces ja...  50.8    2e-04
ref|ZP_05052030.1|  type I secretion target GGXGXDXXX repeat p...  50.8    2e-04
ref|ZP_03128642.1|  autotransporter-associated beta strand rep...  50.8    2e-04
ref|ZP_03156937.1|  Parallel beta-helix repeat protein [Cyanot...  50.8    2e-04
ref|XP_002036198.1|  GM16904 [Drosophila sechellia] >gb|EDW521...  50.8    2e-04
gb|ACF09468.1|  pentapeptide repeat protein [uncultured marine...  50.8    2e-04
ref|XP_001919759.1|  PREDICTED: hypothetical protein, partial ...  50.8    2e-04
ref|ZP_04886281.1|  haemagglutinin family protein [Burkholderi...  50.8    2e-04
ref|ZP_02926300.1|  Hemolysin-type calcium-binding region [Ver...  50.8    2e-04
ref|ZP_02918663.1|  hypothetical protein BIFDEN_01971 [Bifidob...  50.8    2e-04
ref|ZP_02884314.1|  Haemagluttinin domain protein [Burkholderi...  50.8    2e-04
ref|YP_001770209.1|  hypothetical protein M446_3385 [Methyloba...  50.8    2e-04
ref|ZP_02357438.1|  flagellar hook-associated protein 2 [Burkh...  50.8    2e-04
ref|YP_001569326.1|  hypothetical protein SARI_00238 [Salmonel...  50.8    2e-04
ref|ZP_04965784.1|  putative cell surface protein [Burkholderi...  50.8    2e-04
ref|ZP_04884722.1|  putative outer membrane protein [Burkholde...  50.8    2e-04
ref|YP_001208728.1|  hypothetical protein BRADO6923 [Bradyrhiz...  50.8    2e-04
ref|ZP_01811815.1|  VCBS repeat protein [Vibrionales bacterium...  50.8    2e-04
ref|YP_001166187.1|  outer membrane autotransporter [Novosphin...  50.8    2e-04
gb|ABL97205.1|  hypothetical cadherin domain containing protei...  50.8    2e-04
ref|ZP_01719786.1|  hypothetical protein ALPR1_21088 [Algoriph...  50.8    2e-04
ref|YP_001816158.1|  outer membrane autotransporter [Burkholde...  50.8    2e-04
ref|YP_001760057.1|  hypothetical protein Swoo_1677 [Shewanell...  50.8    2e-04
ref|YP_001761540.1|  dystroglycan-type cadherin domain-contain...  50.8    2e-04
ref|YP_866885.1|  hemolysin-type calcium-binding region [Magne...  50.8    2e-04
ref|YP_864656.1|  hemolysin-type calcium-binding region [Magne...  50.8    2e-04
ref|XP_001179201.1|  PREDICTED: hypothetical protein [Strongyl...  50.8    2e-04
ref|YP_787342.1|  autotransporter [Bordetella avium 197N] >emb...  50.8    2e-04
ref|ZP_01459797.1|  LigC [Stigmatella aurantiaca DW4/3-1] >gb|...  50.8    2e-04
ref|YP_002537291.1|  Hyalin [Geobacter sp. FRC-32] >gb|ACM2019...  50.8    2e-04
ref|YP_723033.1|  hemolysin-type calcium-binding region [Trich...  50.8    2e-04
ref|YP_656981.1|  cell surface glycoprotein [Haloquadratum wal...  50.8    2e-04
ref|ZP_01287913.1|  hypothetical protein MldDRAFT_0801 [delta ...  50.8    2e-04
ref|YP_561402.1|  hypothetical protein Sden_0384 [Shewanella d...  50.8    2e-04
ref|ZP_01224964.1|  cadherin domain protein [marine gamma prot...  50.8    2e-04
ref|ZP_01225522.1|  cadherin domain protein [marine gamma prot...  50.8    2e-04
gb|AAY28516.1|  biofilm-associated protein [Staphylococcus chr...  50.8    2e-04
gb|AAY28520.1|  biofilm-associated protein [Staphylococcus hyi...  50.8    2e-04
gb|AAR21091.1|  surface layer protein SapB11 [Campylobacter fe...  50.8    2e-04
gb|AAL13053.1|  platelet binding protein GspB [Streptococcus g...  50.8    2e-04
ref|YP_383900.1|  hypothetical protein Gmet_0933 [Geobacter me...  50.8    2e-04
ref|YP_102734.1|  outer membrane protein, putative [Burkholder...  50.8    2e-04
ref|NP_635195.1|  hypothetical protein MM_3171 [Methanosarcina...  50.8    2e-04
ref|YP_363834.1|  putative filamentous hemagglutinin-like prot...  50.8    2e-04
dbj|BAD06577.1|  cell wall protein Awa1p [Saccharomyces cerevi...  50.8    2e-04
sp|Q8TGE1.1|AWA1_YEAST  RecName: Full=Cell wall protein AWA1; ...  50.8    2e-04
ref|YP_001232159.1|  hypothetical protein Gura_3430 [Geobacter...  50.8    2e-04
ref|ZP_01035333.1|  VCBS [Roseovarius sp. 217] >gb|EAQ26408.1|...  50.8    2e-04
ref|YP_778342.1|  outer membrane autotransporter [Burkholderia...  50.8    2e-04
ref|YP_625461.1|  YadA-like [Burkholderia cenocepacia AU 1054]...  50.8    2e-04
dbj|BAI95396.1|  outer membrane autotransporter barrel [Sphing...  50.4    2e-04
ref|YP_003536185.1|  Muc19 precursor, putative [Haloferax volc...  50.4    2e-04
ref|YP_003500028.1|  putative factor [Escherichia coli O55:H7 ...  50.4    2e-04
ref|ZP_06469729.1|  adhesin [Burkholderia sp. CCGE1003] >gb|EF...  50.4    2e-04
ref|ZP_06394374.1|  Hep_Hag family protein [Neisseria mucosa A...  50.4    2e-04
ref|YP_003393095.1|  outer membrane adhesin like proteiin [Con...  50.4    2e-04
dbj|BAI56978.1|  conserved hypothetical protein [Escherichia c...  50.4    2e-04
ref|YP_003335203.1|  Ig family protein [Dickeya dadantii Ech58...  50.4    2e-04
ref|ZP_06144810.1|  hypothetical protein RflaF_16515 [Ruminoco...  50.4    2e-04
ref|ZP_05947222.1|  putative adhesin [Escherichia coli O157:H7...  50.4    2e-04
ref|ZP_05901173.1|  conserved hypothetical protein [Leptotrich...  50.4    2e-04
ref|ZP_05812301.1|  YadA domain protein [Mesorhizobium opportu...  50.4    2e-04
ref|ZP_05632756.1|  hypothetical protein FulcA4_04924 [Fusobac...  50.4    2e-04
ref|YP_003078518.1|  putative adhesin [Escherichia coli O157:H...  50.4    2e-04
ref|ZP_04820060.1|  hemagglutinin family protein [Burkholderia...  50.4    2e-04
ref|ZP_04781760.1|  possible polymorphic outer membrane protei...  50.4    2e-04
ref|XP_002546360.1|  predicted protein [Candida tropicalis MYA...  50.4    2e-04
ref|YP_003163785.1|  Autotransporter beta- domain protein [Lep...  50.4    2e-04
ref|ZP_05940785.1|  putative adhesin [Escherichia coli O157:H7...  50.4    2e-04
ref|ZP_03726641.1|  hypothetical protein ObacDRAFT_6982 [Opitu...  50.4    2e-04
ref|ZP_05125125.1|  Sperm-activating peptides family [Rhodobac...  50.4    2e-04
ref|ZP_05125254.1|  hemolysin-type calcium-binding protein [Rh...  50.4    2e-04
ref|ZP_05125693.1|  iron-regulated protein FrpC [Rhodobacterac...  50.4    2e-04
ref|ZP_03582276.1|  hemagglutinin family protein [Burkholderia...  50.4    2e-04
ref|ZP_02814407.2|  conserved hypothetical protein [Escherichi...  50.4    2e-04
ref|ZP_02926011.1|  hypothetical protein VspiD_05195 [Verrucom...  50.4    2e-04
ref|ZP_04903919.1|  haemagglutinin family protein [Burkholderi...  50.4    2e-04
ref|ZP_02795296.1|  conserved hypothetical protein [Escherichi...  50.4    2e-04
ref|YP_001745904.1|  haemagluttinin family protein [Escherichi...  50.4    2e-04
ref|ZP_02477263.1|  polymorphic membrane protein, Filamentous ...  50.4    2e-04
ref|ZP_02364545.1|  flagellar hook-associated protein 2 [Burkh...  50.4    2e-04
ref|YP_001683484.1|  outer membrane autotransporter [Caulobact...  50.4    2e-04
ref|YP_002429657.1|  Ig domain protein group 1 domain protein ...  50.4    2e-04
ref|YP_001499678.1|  Outer membrane protein rOmpB [Rickettsia ...  50.4    2e-04
ref|XP_001601735.1|  PREDICTED: similar to papilin [Nasonia vi...  50.4    2e-04
ref|ZP_04974875.1|  putative outer membrane protein [Burkholde...  50.4    2e-04
ref|YP_001223846.1|  hypothetical protein SynWH7803_0123 [Syne...  50.4    2e-04
ref|YP_001068101.1|  polymorphic membrane protein, filamentous...  50.4    2e-04
ref|ZP_04928291.1|  hypothetical protein PACG_00846 [Pseudomon...  50.4    2e-04
ref|YP_943727.1|  filamentous haemagglutinin outer membrane pr...  50.4    2e-04
ref|YP_944153.1|  hemagglutinin/hemolysin-related protein [Psy...  50.4    2e-04
gb|ABL74377.1|  antifreeze protein [Marinomonas primoryensis]      50.4    2e-04
ref|YP_484383.1|  flagellar hook-associated protein [Rhodopseu...  50.4    2e-04
ref|YP_464819.1|  hypothetical protein Adeh_1609 [Anaeromyxoba...  50.4    2e-04
gb|AAF34117.1|AF123714_1  OmpB [Rickettsia massiliae]              50.4    2e-04
gb|AAZ95526.1|  serine-rich repeat protein [Streptococcus agal...  50.4    2e-04
ref|YP_400354.1|  Integrins alpha chain [Synechococcus elongat...  50.4    2e-04
ref|NP_644412.1|  YapH protein [Xanthomonas axonopodis pv. cit...  50.4    2e-04
ref|NP_816909.1|  cell wall surface anchor family protein [Ent...  50.4    2e-04
ref|YP_420638.1|  Type V secretory pathway [Magnetospirillum m...  50.4    2e-04
ref|YP_170926.1|  hypothetical protein syc0216_c [Synechococcu...  50.4    2e-04
ref|NP_934339.1|  putative RTX protein [Vibrio vulnificus YJ01...  50.4    2e-04
ref|NP_288487.1|  putative invasin [Escherichia coli O157:H7 E...  50.4    2e-04
sp|Q8X8V7.2|YEEJ_ECO57  RecName: Full=Uncharacterized protein ...  50.4    2e-04
ref|NP_522101.1|  hemagglutinin-related protein [Ralstonia sol...  50.4    2e-04
ref|NP_310803.1|  hypothetical protein ECs2776 [Escherichia co...  50.4    2e-04
ref|YP_003196729.1|  hypothetical protein RB2501_02545 [Robigi...  50.4    2e-04
ref|ZP_00784977.1|  cell wall surface anchor family protein [S...  50.4    2e-04
ref|YP_821916.1|  hypothetical protein Acid_0626 [Solibacter u...  50.4    2e-04
ref|ZP_06351191.1|  YadA domain protein [Rhodomicrobium vannie...  50.1    3e-04
ref|YP_003372629.1|  autotransporter-associated beta strand re...  50.1    3e-04
ref|ZP_05887531.1|  probable RTX [Vibrio coralliilyticus ATCC ...  50.1    3e-04
ref|ZP_05741405.1|  conserved hypothetical protein [Silicibact...  50.1    3e-04
ref|YP_003191420.1|  Ig domain protein group 2 domain protein ...  50.1    3e-04
ref|ZP_05598314.1|  cell wall surface anchor protein [Enteroco...  50.1    3e-04
ref|ZP_05594399.1|  cell wall surface anchor family protein [E...  50.1    3e-04
ref|YP_003135936.1|  hypothetical protein Cyan8802_0130 [Cyano...  50.1    3e-04
ref|ZP_05422115.1|  predicted protein [Enterococcus faecalis T...  50.1    3e-04
ref|YP_003051754.1|  putative rhizobiocin/RTX toxin and hemoly...  50.1    3e-04
ref|YP_002896857.1|  Hep_Hag family protein [Burkholderia pseu...  50.1    3e-04
ref|YP_003307146.1|  hypothetical protein Sterm_0331 [Sebaldel...  50.1    3e-04
ref|XP_002537294.1|  conserved hypothetical protein [Ricinus c...  50.1    3e-04
ref|ZP_03611634.1|  hypothetical protein AM202_0050 [Actinobac...  50.1    3e-04
ref|ZP_03453432.1|  hemagglutinin [Burkholderia pseudomallei 5...  50.1    3e-04
ref|YP_002255439.1|  hypothetical hemagglutinin-related protei...  50.1    3e-04
ref|ZP_05053574.1|  Autotransporter beta-domain protein [Octad...  50.1    3e-04
ref|XP_001999822.1|  GI22868 [Drosophila mojavensis] >gb|EDW15...  50.1    3e-04
ref|YP_001913314.1|  YapH protein [Xanthomonas oryzae pv. oryz...  50.1    3e-04
ref|YP_001820411.1|  beta strand repeat-containing protein [Op...  50.1    3e-04
ref|ZP_02928862.1|  hypothetical protein VspiD_19470 [Verrucom...  50.1    3e-04
ref|ZP_02925551.1|  hypothetical protein VspiD_02885 [Verrucom...  50.1    3e-04
ref|YP_001792758.1|  outer membrane adhesin like proteiin [Lep...  50.1    3e-04
ref|YP_001707965.1|  bifunctional phage-like tail fibre protei...  50.1    3e-04
ref|ZP_01256020.1|  hypothetical protein P700755_21261 [Psychr...  50.1    3e-04
ref|XP_001603259.1|  PREDICTED: hypothetical protein [Nasonia ...  50.1    3e-04
ref|XP_001596494.1|  predicted protein [Sclerotinia sclerotior...  50.1    3e-04
ref|XP_001590523.1|  hypothetical protein SS1G_08263 [Scleroti...  50.1    3e-04
gb|EDN59060.1|  hypothetical protein SCY_2324 [Saccharomyces c...  50.1    3e-04
ref|YP_002566960.1|  Protein of unknown function DUF1628 [Halo...  50.1    3e-04
ref|YP_001356857.1|  hypothetical protein NIS_1393 [Nitratirup...  50.1    3e-04
ref|YP_001356856.1|  hypothetical protein NIS_1391 [Nitratirup...  50.1    3e-04
ref|ZP_01765153.1|  hypothetical protein BURPS305_5906 [Burkho...  50.1    3e-04
ref|ZP_01745429.1|  outer membrane autotransporter barrel [Sag...  50.1    3e-04
ref|YP_001066348.1|  putative outer membrane protein [Burkhold...  50.1    3e-04
ref|YP_001030808.1|  MoxR-like ATPases-like [Methanocorpusculu...  50.1    3e-04
ref|YP_001901191.1|  outer membrane autotransporter barrel dom...  50.1    3e-04
ref|YP_656868.1|  halomucin [Haloquadratum walsbyi DSM 16790] ...  50.1    3e-04
ref|ZP_01255327.1|  Hep_Hag family protein [Psychroflexus torq...  50.1    3e-04
emb|CAH61073.1|  S-layer-like protein, SllB [Bacillus sphaericus]  50.1    3e-04
gb|AAO19442.1|  BpaA [Burkholderia pseudomallei]                   50.1    3e-04
ref|YP_297134.1|  filamentous haemagglutinin adhesin HecA 20-r...  50.1    3e-04
ref|NP_765804.1|  streptococcal hemagglutinin protein [Staphyl...  50.1    3e-04
ref|XP_001823342.1|  hypothetical protein [Aspergillus oryzae ...  50.1    3e-04
ref|YP_451289.1|  YapH protein [Xanthomonas oryzae pv. oryzae ...  50.1    3e-04
ref|ZP_01004566.1|  hypothetical protein SKA53_00190 [Loktanel...  50.1    3e-04
ref|YP_001169330.1|  hypothetical protein Rsph17025_3141 [Rhod...  50.1    3e-04
ref|YP_611521.1|  Outer membrane autotransporter barrel [Ruege...  50.1    3e-04
ref|XP_001055778.2|  PREDICTED: rCG63077-like, partial [Rattus...  49.7    4e-04
gb|ADE39082.1|  flagellar hook-associated 2 domain protein [al...  49.7    4e-04
ref|XP_001920404.2|  PREDICTED: hypothetical protein, partial ...  49.7    4e-04
ref|ZP_06466808.1|  conserved hypothetical protein [Burkholder...  49.7    4e-04
ref|YP_003451319.1|  hypothetical protein AZL_b01120 [Azospiri...  49.7    4e-04
ref|YP_003315743.1|  hypothetical protein Sked_30080 [Sanguiba...  49.7    4e-04
ref|XP_002174158.1|  predicted protein [Schizosaccharomyces ja...  49.3    5e-04

ALIGNMENTS
>ref|ZP_05023899.1| haemagglutination activity domain protein [Microcoleus chthonoplastes 
PCC 7420]
 gb|EDX78139.1| haemagglutination activity domain protein [Microcoleus chthonoplastes 
PCC 7420]
Length=4083

 Score = 79.7 bits (195),  Expect = 4e-13
 Identities = 74/299 (24%), Positives = 135/299 (45%), Gaps = 35/299 (11%)
 Frame = +1

Query  1     TLDFDVDDFTITLGGDLSGSATVTNLGDATLTATITANSVALGTDTTGNFVADLTAGEGI  180
             T D +    TIT+  +L+ +    N GDATLT+     +   GT T      +LT  +G 
Sbjct  2962  TADVEAKTVTITVENNLNTANVTANEGDATLTSKAGDINTINGTITATQGSVNLTTDQG-  3020

Query  181   DVSGGGSENATITVSAEDA-----TSSNKGIASFDST---------DFTVSSGAVTVNAE  318
             DV+    E  T+T++AE+       ++N+G A+  S            T + G+V +  +
Sbjct  3021  DVTTADVEAKTVTITAENNLNTANVTANEGDATLTSKAGDINTINGTITATQGSVNLTTD  3080

Query  319   R----VQDIVGAMVGSNTESGIT---VTYEDSDGTLDFNVADPVITLSGDVAGSATMTNL  477
             +      D+    V    E+ +    VT  + D TL     D + T++G +  +    NL
Sbjct  3081  QGDVTTADVEAKTVTITAENNLNTANVTANEGDATLTSKAGD-INTINGTITATQGSVNL  3139

Query  478   ----GDVTISTTIQANSIALGTDTTGNYVSAISAGEGI-------DVSGSGSETATVTIS  624
                 GDVT +  ++A ++ +  +   N  +  +  + +       DV+ +  E  TVTI+
Sbjct  3140  TTDQGDVT-TADVEAKTVTITAENNLNTANVTATEDAVNLTTNQGDVTTADVEAKTVTIT  3198

Query  625   AEDATDSNKGIASFDATDFTVSSGDVTVNAERIQDIVGAMFSSNTESGISVTYEDSDGT  801
             AE+  ++    A+ DA + T + GDVT      + +     ++ T + ++ T ED++ T
Sbjct  3199  AENNLNTANVTATEDAVNLTTNQGDVTTADVEAKTVTITAQNNLTTANVTATEEDANLT  3257


 Score = 67.8 bits (164),  Expect = 1e-09
 Identities = 72/284 (25%), Positives = 128/284 (45%), Gaps = 26/284 (9%)
 Frame = +1

Query  1     TLDFDVDDFTITLGGDLSGSATVTNLGDATLTATITANSVALGTDTTGNFVADLTAGEGI  180
             T D +    TIT+  +L+ +    N GDATLT+     +   GT T      +LT  +G 
Sbjct  2900  TADVEAKTVTITVENNLNTANVTANEGDATLTSKAGDINTINGTITATQGSVNLTTDQG-  2958

Query  181   DVSGGGSENATITVSAEDATSSNKGIASFDSTDFTVSSGAVTVNAERVQDIVGAMVGSNT  360
             DV+    E  T+T++ E+  + N    + +  D T++S A  +N      I G +    T
Sbjct  2959  DVTTADVEAKTVTITVEN--NLNTANVTANEGDATLTSKAGDINT-----INGTITA--T  3009

Query  361   ESGITVTYEDSD-GTLDFNVADPVITLSGDVAGSATMTNLGDVTISTTIQANSIALGTDT  537
             +  + +T +  D  T D       IT   ++  +    N GD T+++     +   GT T
Sbjct  3010  QGSVNLTTDQGDVTTADVEAKTVTITAENNLNTANVTANEGDATLTSKAGDINTINGTIT  3069

Query  538   TGNYVSAISAGEGIDVSGSGSETATVTISAEDATDSNKGIASFDATDFTVSSGDVTVNAE  717
                    ++  +G DV+ +  E  TVTI+AE+         + +  + T + GD T+ ++
Sbjct  3070  ATQGSVNLTTDQG-DVTTADVEAKTVTITAEN---------NLNTANVTANEGDATLTSK  3119

Query  718   R--IQDIVGAMFSSNTESGISVTYEDSDGTIDLDVSDPTLSLQA  843
                I  I G +  + T+  +++T +  D T   DV   T+++ A
Sbjct  3120  AGDINTINGTI--TATQGSVNLTTDQGDVT-TADVEAKTVTITA  3160


 Score = 65.5 bits (158),  Expect = 7e-09
 Identities = 84/321 (26%), Positives = 133/321 (41%), Gaps = 71/321 (22%)
 Frame = +1

Query  1     TLDFDVDDFTITLGGDLSGSATVTNLGDATLTA------------TITANSVALGTD---  135
             T D +    TIT   +L+ +    N GDATLT+            T T  SV L TD   
Sbjct  3086  TADVEAKTVTITAENNLNTANVTANEGDATLTSKAGDINTINGTITATQGSVNLTTDQGD  3145

Query  136   -------------------TTGNFVA-----DLTAGEGIDVSGGGSENATITVSAEDATS  243
                                 T N  A     +LT  +G DV+    E  T+T++AE+  +
Sbjct  3146  VTTADVEAKTVTITAENNLNTANVTATEDAVNLTTNQG-DVTTADVEAKTVTITAENNLN  3204

Query  244   SNKGIASFDSTDFTVSSGAVTVNAERVQDIVGAMVGSNTESGITVTYEDSDGTLDFNVAD  423
             +    A+ D+ + T + G VT      + +      + T + +T T ED++ T   +   
Sbjct  3205  TANVTATEDAVNLTTNQGDVTTADVEAKTVTITAQNNLTTANVTATEEDANLT---SATG  3261

Query  424   PVITLSGDVAGSATMTNL---------GDVTISTT-IQANSIALGTDTTGNYVSAISAGE  573
              + T  G +  S    NL         GDVT     +QAN+    T+ T    + +++  
Sbjct  3262  SINTSQGKIEASQGAANLTANDGEITTGDVTAQRAELQANNNLTTTNVTTEEDANLTSTT  3321

Query  574   GIDVSGSGSETATVTISAEDATDSNKGIASFDATDFTVSSGDV---TVNAERIQDIVGAM  744
             G   + +G  TAT            +G A+  AT+ +V++ D+   TV  E   +I  A 
Sbjct  3322  GDINTSNGKITAT------------QGAANLTATEGSVTTADLEAQTVQVEGNDNITTAN  3369

Query  745   FSSNTESGISVTYEDSDGTID  807
              +S T+  I+VT E   G+ID
Sbjct  3370  VTS-TDGAINVTSE--SGSID  3387


 Score = 64.7 bits (156),  Expect = 1e-08
 Identities = 70/275 (25%), Positives = 125/275 (45%), Gaps = 26/275 (9%)
 Frame = +1

Query  28    TITLGGDLSGSATVTNLGDATLTATITANSVALGTDTTGNFVADLTAGEGIDVSGGGSEN  207
             TIT   +L+ +    N GDATLT+     +   GT T      +LT  +G DV+    E 
Sbjct  2847  TITAENNLNTANVTANEGDATLTSKAGDINTINGTITATQGSVNLTTDQG-DVTTADVEA  2905

Query  208   ATITVSAEDATSSNKGIASFDSTDFTVSSGAVTVNAERVQDIVGAMVGSNTESGITVTYE  387
              T+T++ E+  + N    + +  D T++S A  +N      I G +    T+  + +T +
Sbjct  2906  KTVTITVEN--NLNTANVTANEGDATLTSKAGDINT-----INGTITA--TQGSVNLTTD  2956

Query  388   DSD-GTLDFNVADPVITLSGDVAGSATMTNLGDVTISTTIQANSIALGTDTTGNYVSAIS  564
               D  T D       IT+  ++  +    N GD T+++     +   GT T       ++
Sbjct  2957  QGDVTTADVEAKTVTITVENNLNTANVTANEGDATLTSKAGDINTINGTITATQGSVNLT  3016

Query  565   AGEGIDVSGSGSETATVTISAEDATDSNKGIASFDATDFTVSSGDVTVNAER--IQDIVG  738
               +G DV+ +  E  TVTI+AE+         + +  + T + GD T+ ++   I  I G
Sbjct  3017  TDQG-DVTTADVEAKTVTITAEN---------NLNTANVTANEGDATLTSKAGDINTING  3066

Query  739   AMFSSNTESGISVTYEDSDGTIDLDVSDPTLSLQA  843
              +  + T+  +++T +  D T   DV   T+++ A
Sbjct  3067  TI--TATQGSVNLTTDQGDVT-TADVEAKTVTITA  3098


 Score = 40.8 bits (94),  Expect = 0.19
 Identities = 55/238 (23%), Positives = 94/238 (39%), Gaps = 25/238 (10%)
 Frame = +1

Query  55    GSATVTNLGDATLTATITANSVALGTDTTGNFVADLTAGEG--------IDVSGGG-SEN  207
             G+ TV   G     + +TANS+    D       D   GE         ++  G   ++N
Sbjct  877   GAITVNTTGTTRFNSNVTANSLTTDDDPDN---PDNVGGETQLNGDVRTLETLGQTYNDN  933

Query  208   ATITVSAEDATSSNKGIASFDSTDFTVSSGA---VTVNAERVQDIVGAMVGSNTESGIT-  375
               +  +    +S N G+ SF+ T  +++ GA   +TV + R        +G++  +  T 
Sbjct  934   VRVDKNITLDSSQNNGVISFEGTLNSLNQGAKKNLTVTSGRGNITFTGEIGTSNATNATR  993

Query  376   --VTYEDSDGTLDFNVADPVITLSGDVAGSATMTNLGDVTISTTIQA--NSIALGTDTTG  543
                   +S GT  FN      +L+ D  G  T  N    T+ T  Q   +++ +  D T 
Sbjct  994   LGAITVNSSGTTRFNSNVRANSLTTDADGE-TQLNGNVTTVGTQGQTYNDNVRVDNDITL  1052

Query  544   NYVSAISAGEGIDVSGSGSETATVTISAEDATDSNKGIASFDATDFTVSSGDVTVNAE  717
             N ++  +  +     GS S   TV      A +  +  A+ +       SGD+T   E
Sbjct  1053  NSINNNANND----DGSISFKGTVNSRDSGAENGEENPATVNNLTINGGSGDITFTGE  1106


 Score = 40.8 bits (94),  Expect = 0.19
 Identities = 73/288 (25%), Positives = 113/288 (39%), Gaps = 44/288 (15%)
 Frame = +1

Query  4     LDFDVDDFTITLGGDLSGSATVTNLGDATLTATITANSVALGTDTTGNFVAD--LTAGEG  177
             L  D  D TI     + GS   T LG+    A    N V +G++T      +  LTA E 
Sbjct  2203  LTVDAGDGTIEAKNIIGGS---TALGNLDFNAK-EINIVGIGSNTAAGVSTNTQLTA-ES  2257

Query  178   IDVSGGGSENATITVSAEDATSSNKGIASFDSTDFTVSSGAVTVNAERVQDIVGAMVGS-  354
             I+ +G      T T +A D   +  G+ SF S++  +S       A+ ++   G   G+ 
Sbjct  2258  INFTGTTYNANTQTYTAADINVNGAGLTSFTSSNDDISFDGNLSLAQDIKVDTGTGNGNI  2317

Query  355   ---NTESGITVTYEDSDGTLDFNVADPVITLSGDVAGSATMTNLGDVTISTTIQANSIAL  525
                   +G +   +   GT + N+ +  +  S +  G+ T+TN  DVT   +I A SI  
Sbjct  2318  YFQEAITGNSKNIKLLAGTGNINL-NGAVGSSENPVGNLTITNANDVTAVNSITAASITQ  2376

Query  526   GTDTTGNYVSAISAGEGIDVSGSGSETATVTISAEDATDSNKGIASFDATDFTVSSGDVT  705
                         SAG G             T +     DS  G  +     F + +   T
Sbjct  2377  ------------SAGIG-------------TTTLNGVLDSRTGAINLTNNSFNIGTNIKT  2411

Query  706   VNAERIQDIVGAMFSSNTESGISVTYEDSDGTIDLD---VSDPTLSLQ  840
              NA+   +  GA+  +N+   + V      GTI L      D +L LQ
Sbjct  2412  NNADITFN--GAVTQTNSVEPVEVI--AGTGTITLSTWTAGDHSLRLQ  2455


 Score = 37.0 bits (84),  Expect = 2.8
 Identities = 71/326 (21%), Positives = 136/326 (41%), Gaps = 71/326 (21%)
 Frame = +1

Query  10    FDVDDFTITLGGDLSGSATVTNLGDATLTATITANSVALGTDTTGNFVADLTAGEGIDVS  189
             ++ +++ + LG  L   +T    GD T  A +       G D   + +  +  G G  + 
Sbjct  2610  YNAENYKVFLGSPLVSLSTGLGQGDITFNANVN------GIDDNEHGL-QVQPGTGKVLF  2662

Query  190   GG--GSENATITVSAEDATSSNKGIASFDSTDFTVSSGAVTVNAE--RVQDIVGAMVGSN  357
              G  G+ +   +++  D T++N  +        TV   ++ VNA+  RV  ++    GS 
Sbjct  2663  NGEVGTIHPLDSLTIGDQTTTNADLIGSS----TVVFNSLRVNAQQTRVSGLIRTRGGSI  2718

Query  358   TESGITVTYEDS--DGTLDFN--------VADPVITLSGDVA-GSATMTNL------GDV  486
             + SG  +  ED+  D T + N            V +L  ++A GS    +L      G V
Sbjct  2719  SFSGHVILIEDTTFDTTFETNGLSEGEIVFGKTVSSLQENLADGSLRSFDLTLKPGAGKV  2778

Query  487   TISTTIQANSIA-------------LGTDTTGNYVS------AISAGEGIDVS------G  591
                  ++A S               L  +++G  ++         A EGI+++      G
Sbjct  2779  DFQGAVEAQSFGSEFSIPETQDLGRLVIESSGEVIAKEGETITTIAPEGINITADSINLG  2838

Query  592   SGSETATVTISAEDATDSNKGIASFDATDFTVSSGDVTVNAER--IQDIVGAMFSSNTES  765
             S  E  TVTI+AE+         + +  + T + GD T+ ++   I  I G +  + T+ 
Sbjct  2839  SNVEAKTVTITAEN---------NLNTANVTANEGDATLTSKAGDINTINGTI--TATQG  2887

Query  766   GISVTYEDSD-GTIDLDVSDPTLSLQ  840
              +++T +  D  T D++    T++++
Sbjct  2888  SVNLTTDQGDVTTADVEAKTVTITVE  2913


 Score = 36.6 bits (83),  Expect = 3.7
 Identities = 55/230 (23%), Positives = 92/230 (40%), Gaps = 46/230 (20%)
 Frame = +1

Query  55    GSATVTNLGDATLTATITANSVAL-GTD--TTGNFVADLTAGEGIDVSGGGSENATITVS  225
             G+A +T    +  TA + A +V + G D  TT N V        +    G  ++  +T  
Sbjct  3336  GAANLTATEGSVTTADLEAQTVQVEGNDNITTAN-VTSTDGAINVTSESGSIDSRDLTAK  3394

Query  226   AEDATSS-------NKGIASFDSTDFTVSSGAVTVNAERVQDIVGAM-VGSNTESGITVT  381
              E   S+       N    + D+   T   GA+ +N+E      GA+  G  T SG T  
Sbjct  3395  GETQESAVTLNAPGNITTGNIDTASETAQGGAIALNSE-----TGAIQSGDLTSSGAT--  3447

Query  382   YEDSDGTLDFNVADPVITLSGDVAGSATMTNLGDVTISTTIQANSIALGTDTTGNYVSAI  561
                  G  D  V+ P    +G++  S++  N   + +++ +       G  TTGN  S+ 
Sbjct  3448  -----GGGDVTVSAPNTITTGNIDCSSSNGNGCAIALTSEV-------GDITTGNLNSS-  3494

Query  562   SAGEGIDVSGSGSETATVTISAEDATDSNKGIASFDATDFTVSSGDVTVN  711
                     SG GS+T         +T S   +   +++    S G +T+N
Sbjct  3495  ------GASGGGSQTI--------STLSQIILGELNSSSTLGSPGQITIN  3530


>gb|EFA77040.1| hypothetical protein PPL_09793 [Polysphondylium pallidum PN500]
Length=4804

 Score = 75.1 bits (183),  Expect = 9e-12
 Identities = 75/280 (26%), Positives = 119/280 (42%), Gaps = 19/280 (6%)
 Frame = +1

Query  28    TITLGGDLSGSATVTNLGDATLTATITANSV-ALGTDTTGNFVADLTAGEGIDVSGGGSE  204
             T T G DL+   T T  GD+T T+  T  S  ++ T TTG+    ++   G   S   SE
Sbjct  4350  TSTTGDDLT---TSTISGDSTTTSLTTGESTTSVPTTTTGHESTSISTTGGS--SSTTSE  4404

Query  205   NATITVSAEDATSSNKGIASFDSTDFTVSSGAVTVNA--ERVQDIVGAMVGSNTESGITV  378
               T   +  D+T+SN    + DST  T +  + T  +  +    I+G    S T +    
Sbjct  4405  PITTLTTGGDSTTSNNSTTTGDSTISTTTGDSTTSTSGGDSTSSIIGGDSISTTLTTGEP  4464

Query  379   TYEDSDGTLDFNVADPVITLSGDVAGSATMTNLGDVTISTTIQANSIALGTDTTGNYVSA  558
             T   S   +D   +  + T S D   S T T  GD+T STT   ++ +    TTG+  ++
Sbjct  4465  TITSSTTGIDSATSSAISTTSSD---STTSTTSGDLTTSTTSGDSTTSTIPTTTGDSTTS  4521

Query  559   ISAGEGIDVSGSGSET--------ATVTISAEDATDSNKGIASFDATDFTVSSGDVTVNA  714
              ++G+ I  + SG  T         T T S++  T +   ++S  + D T S+   T   
Sbjct  4522  TTSGDSIPSTTSGDLTTSTTSGDWTTSTTSSDLTTSTTSSVSSTTSGDSTTSTNSTTGGD  4581

Query  715   ERIQDIVGAMFSSNTESGISVTYEDSDGTIDLDVSDPTLS  834
                    G   +S T   ++++    D T      D T S
Sbjct  4582  LTTSTTSGDSSTSTTSGNLTISTTSGDLTTSTTNGDLTTS  4621


 Score = 68.2 bits (165),  Expect = 1e-09
 Identities = 80/298 (26%), Positives = 132/298 (44%), Gaps = 50/298 (16%)
 Frame = +1

Query  19    DDFTITLGGDLSGSATVTNLGDATLTATITANSVALGTDTTGNFVADLTAGEGIDVSGGG  198
             D  T T  GDL+   T T  GD+T +   T    +  + T+G+ +   T+G+ +  S   
Sbjct  4487  DSTTSTTSGDLT---TSTTSGDSTTSTIPTTTGDSTTSTTSGDSIPSTTSGD-LTTSTTS  4542

Query  199   SENATITVSAEDATSSNKGIASFDSTDFTVSSGAVTVNAERVQDIVGAMVGSNTESGITV  378
              +  T T S++  TS+   ++S  S D T S+ + T          G  + ++T SG   
Sbjct  4543  GDWTTSTTSSDLTTSTTSSVSSTTSGDSTTSTNSTT----------GGDLTTSTTSG---  4589

Query  379   TYEDSDGTLDFNVADPVITLSGDVAGSATMTNLGDVTISTTIQANSIALGTDTTGNYVSA  558
               + S  T   N+   + T SGD+    T T  GD+T STT   ++    T TTG+  S 
Sbjct  4590  --DSSTSTTSGNLT--ISTTSGDLT---TSTTNGDLTTSTTSGDST----TSTTGSATST  4638

Query  559   IS-----------AGEGIDVSGSGSETATVTISAEDAT--DSNKGIASFDATDFTVSSGD  699
              S           +G+    + SG  + + T SA  +T  DS   I + D T+ T +  D
Sbjct  4639  TSDDSITSTTSTTSGDSTTSTTSGDSSISTTNSATSSTSGDSTTSITT-DGTNST-TGDD  4696

Query  700   VTVNAERIQDIVGAMFSSNTESG------ISVTYEDSDGTIDLDVSDPTLSLQAMSQV  855
              +   + +    G ++S+ + SG      IS T      T +++ S PT S+   S++
Sbjct  4697  TSTTGDILTSTTGVIYSTTSTSGDFNSSMISTTSTVDSTTGEIN-STPTTSMITTSEL  4753


 Score = 62.8 bits (151),  Expect = 5e-08
 Identities = 75/300 (25%), Positives = 108/300 (36%), Gaps = 32/300 (10%)
 Frame = +1

Query  19    DDFTITLGGDLSGSATVTNLGDATLTATITANSVALGT-----------DTTGNFVADLT  165
             D  T T  GD + S T T  G    T + T     L T            TTG   +  T
Sbjct  4144  DSTTSTTTGDSTTSITSTTSGSDNSTTSTTGEMSTLTTGEMSTSTTGTNSTTGEHTSTST  4203

Query  166   AGEGIDVS----GGGSENATITVSAEDATSSNKG----IASFDSTDFTVSSGAVTVNAER  321
              GE    S    G  S + T+T S +  T+S  G     ++  +T    S+ + T     
Sbjct  4204  TGEDPSTSSTTGGDSSTSTTLTTSEQTITTSTTGGDLTTSNNSTTGGDTSTTSTTGEPTT  4263

Query  322   VQDIVGAMVGSNTESGITVTYEDSDGTLDFNVADPVITLSGDVAGSATMTNLGDVTISTT  501
                      G +T S  + T E +  ++  + +    T  GD   S T T     TISTT
Sbjct  4264  ASTTTSTTGGDSTTSTTSTTGEPTTISITTSTSTTTSTTGGDSTTSTTSTTSEPTTISTT  4323

Query  502   IQANSIALGTDTTGN-------YVSAIS-AGEGIDVSG-SGSETATVTISAEDATDSNKG  654
                    L + TT         Y++  S  G+ +  S  SG  T T   + E  T     
Sbjct  4324  TSTTGGDLTSSTTSTTSEPTTVYLTTTSTTGDDLTTSTISGDSTTTSLTTGESTTSVPTT  4383

Query  655   IASFDATDFTVSSGDVTVNAERIQDIVGAMFSSNTESGISVTYEDSDGTIDLDVSDPTLS  834
                 ++T  + + G  +  +E I  +       +T S  S T  DS  TI     D T S
Sbjct  4384  TTGHESTSISTTGGSSSTTSEPITTLTTG--GDSTTSNNSTTTGDS--TISTTTGDSTTS  4439


 Score = 58.5 bits (140),  Expect = 9e-07
 Identities = 70/278 (25%), Positives = 111/278 (39%), Gaps = 33/278 (11%)
 Frame = +1

Query  1     TLDFDVDDFTITLGGDLSGSATVTNLGDATLTATITANSVALGTDTTGNFVADLTAGEGI  180
             T +F      IT    L+ S T T  G    T  +T+NS    T TT     D T G  I
Sbjct  3770  TGEFTTTGEPITTTTILTTSTTSTTGGT---TGGVTSNSTTSTTSTT--ISTDSTTGSTI  3824

Query  181   DVSGGGSENATITVSAEDATSSNKGIASFDSTDFTVSSGAVTVNAERVQ--------DIV  336
               SG  S+N+T + + E  TSS    ++   T  + +  A T   E              
Sbjct  3825  TTSG--SDNSTTSTTGELTTSSTSTTSTTGETSSSTTGVANTTTGEPTSTSTTGQDPSTS  3882

Query  337   GAMVGSNTESGITVTYEDSDGTLDFNVADPVITLSGDVAGSATMTN--LGDVTISTTIQA  510
              +  G +  +G T+T   SD +          + +G+   S T T+   G++++STT  A
Sbjct  3883  NSTTGGDLTTGSTITTSGSDNS--------TTSTTGEFTTSTTYTSSITGEISLSTTGGA  3934

Query  511   NSIALGTDTTGNYVSAISAGEGIDVSGSGSETATVTISAEDATDSNKGIASF-DATDFTV  687
             N+      TTG   S  +  +    S +G  +++ T      T     I+S  +    T 
Sbjct  3935  NT------TTGESTSTSTTSDSTTTSTTGEISSSTTGGTNTTTGEPTSISSTGENPSTTT  3988

Query  688   SSGDVTVNAERIQDIVGAMFSSNTESGISVTYEDSDGT  801
             S+ D T ++       G M S++T  G + T  +   T
Sbjct  3989  STTDSTTSSTSTTSTTGEM-STSTTGGTNTTTGEPTST  4025