GOS 1149010

From Metagenes
Warning: this metagenomic sequence has been carefully annotated by students during bioinformatics assignments. These quality annotations are therefore the result of a teaching exercise that you are most welcome to amend and extend if necessary!


Sequence
CAMERA AccNum : JCVI_READ_1091140850066
Annotathon code: GOS_1149010
Sample :
  • GPS :1°12'58s; 90°25'22w
  • Galapagos Islands: Devil's Crown, Floreana Island - Ecuador
  • Coastal (-2.2m, 25.5°C, 0.1-0.8 microns)
Authors
Team : COMSATSISB
Username : saadia_arshad
Annotated on : 2010-01-21 17:44:45
  • arshad saadia

Synopsis

Genomic Sequence

>JCVI_READ_1091140850066 GOS_1149010 Genomic DNA
TCTTGACGGGTTGCTTCATCTTCTGACATTACCATCACGCCCTCTTTTGAGAGGACTTCAAACGCAGGTAGAGAAGATGGCAGCTTTATTGGCATTTTGA
TAACTATTAAATACTTCTTGAAATTGTTGATCCAATTAATGAATTAAACTCCGAAGGTGTTTTTACTTGAGAGATTAAATCAGCTTGTACTGTTATTCCC
CAGGAAGCCATTGACTTATAAATTGGGTCTCTGTGCGCCATGGCTTCCCTATAGGTCCAGCGCACAAAATCATCGGGGTCTACGTTTTCAACAGAACACT
TATTTATCTTGAGATACTCTCTCCATTTTAGATCTAGAAAATCTGGGGCGTAACACATAGGTTTAGGGTTCTCGTCAAACCGTTTTATCAATTCATCTGT
ATGGTGGCTTGAGCCCTCAATCCACACCATAAGAGTATTTTCTGCCAACGTCTTTAAAACTGGATCATTGGGATCGTCAGGATTAACGACCTCACAGATT
GATCCACCTGTGTCACAGACAAAGTTAGGATAGCCATAAATTCTATTGGATTTATTAATGAAATTTGGGGTATCTAACAGTGCTTTTATTTCTGCAATAT
GATGCTGTGCCTGGCGTTTTTTATATTCTTCTATACTGAGCCCGCCTTTTCCCTGATTGCCTGGCTTGCCCAGATAATTGGAAAGCGGAGCAAGGTTATC
AAAGCTAATGTTTGACGATATATGAATAGAGTCACCTAGAAGTAATTCTCGCAAATAAGGCGATTTCATTGCGTCTTCTTTGTAGCTATCCTCTATATGC
TCTCCCATATACCTAGTACCAATGCGATAATCGATTGAATAATGGTACCACTCACCCGATTGACGCAGAATAT

Translation

[309 - 764/873]   indirect strand
>GOS_1149010 Translation [309-764   indirect strand]
NFINKSNRIYGYPNFVCDTGGSICEVVNPDDPNDPVLKTLAENTLMVWIEGSSHHTDELIKRFDENPKPMCYAPDFLDLKWREYLKINKCSVENVDPDDF
VRWTYREAMAHRDPIYKSMASWGITVQADLISQVKTPSEFNSLIGSTISRSI

[ Warning ] 5' incomplete: does not start with a Methionine

Annotator commentaries

The metagenomic sequence is which is given to me is that:

>GOS_1149010 Genomic DNA (Galapagos Islands: Devil's Crown, Floreana Island)

TCTTGACGGGTTGCTTCATCTTCTGACATTACCATCACGCCCTCTTTTGAGAGGACTTCAAACGCAGGTAGAGAAGATGGCAGCTTTATTGGCATTTTGA

TAACTATTAAATACTTCTTGAAATTGTTGATCCAATTAATGAATTAAACTCCGAAGGTGTTTTTACTTGAGAGATTAAATCAGCTTGTACTGTTATTCCC

CAGGAAGCCATTGACTTATAAATTGGGTCTCTGTGCGCCATGGCTTCCCTATAGGTCCAGCGCACAAAATCATCGGGGTCTACGTTTTCAACAGAACACT

TATTTATCTTGAGATACTCTCTCCATTTTAGATCTAGAAAATCTGGGGCGTAACACATAGGTTTAGGGTTCTCGTCAAACCGTTTTATCAATTCATCTGT

ATGGTGGCTTGAGCCCTCAATCCACACCATAAGAGTATTTTCTGCCAACGTCTTTAAAACTGGATCATTGGGATCGTCAGGATTAACGACCTCACAGATT

GATCCACCTGTGTCACAGACAAAGTTAGGATAGCCATAAATTCTATTGGATTTATTAATGAAATTTGGGGTATCTAACAGTGCTTTTATTTCTGCAATAT

GATGCTGTGCCTGGCGTTTTTTATATTCTTCTATACTGAGCCCGCCTTTTCCCTGATTGCCTGGCTTGCCCAGATAATTGGAAAGCGGAGCAAGGTTATC

AAAGCTAATGTTTGACGATATATGAATAGAGTCACCTAGAAGTAATTCTCGCAAATAAGGCGATTTCATTGCGTCTTCTTTGTAGCTATCCTCTATATGC

TCTCCCATATACCTAGTACCAATGCGATAATCGATTGAATAATGGTACCACTCACCCGATTGACGCAGAATAT


its Length is 873 bp


step 1. ORF finding: I used SMS for searching ORFs. After giving the sequence as input, one orf was detected.

of varying length. I chose the longer one but when did blastp and domain searching i found that

it is not true ORF so I considered that.


a) coding it has the ORF in the indirect strand

b) the ORF is considered as true ORF as its length is greater than 150.


step 3. Protein Molecular Weight

The Sequence Manipulation Suite: Results for 152 residue sequence "GOS_1149010 Translation [309-764 indirect strand]" starting "NFINKSNRIY".

The protein weighs 17.44 kilodaltons


step 4. Protien domain: I did this with the help of PROSITE/ CDD/ PFAM/ INTERPRO/PRODOM/ SMART

a) No conserved domains found with PROSITE, PFAM, INTERPRO, CDD the reason is that the may be in molecular evolution there is a rearrangement of the protein folding and that part is evolved out.

b) PRODOM: one domain found with PRODOM


setp 5. BLAST: I did blastp i.e. by using my translated ORF against the Non-redundant protein database The only parameter

I changed was the threshold.By default it is 10 and i used 1000.

a)As a result i got homologues related to alpha proteobacterium.

b) Many good hits were reported

c) The strong homology with a alpha-proteobacteria (58%identity and e-value of 3e-48), although no domains were reported .

d) Based on these computational results, it was hypothesised that the metagenomic sequence may be a partial protein and since it shows strong homology.


step 6. Taxonomy Report: In this step I got the lineage report from blast analysis and analyzed it to get

ingroups and outgroups.


step 7. Multiple alignment: In this step i did multiple alignment by using ingroups and outgroups

with phylogeny.fr.As a result i got good alignment between the homologues in ingroup and the

query sequence.


step 8. Tree: In phylogeny.fr, by using PhyML method and e_with 100 boot straps, i got the

phylogenetic tree. In detailed analysis of tree it is stated that there is directional evolution in the species of bacterium selected as ingroup and outgroup sequences.



step 9. Biological process:As no conserved domains were reported, so it cant b predicted the biological process.


By using all the evidences provided by all softwares and by careful analysis and according to phylogenetic tree i conclude that the sequence belongs to alpha protobacterium or it is a bacterium.










ORF finding

PROTOCOL

a) SMS ORFinder / forward strand / frames 1, 2 & 3 / min 60 AA / 'atg codon' initiation / 'universal' genetic code

b) SMS ORFinder / reverse strand / frames 1, 2 & 3 / min 60 AA / 'atg codon' initiation / 'universal' genetic code


RESULTS ANALYSIS

By using SMS ORFinder i analyzed the metagenomic sequence to find its open reading frame(ORF)


1) on the direct strand


on the direct strand i got

a) No ORFs were found in reading frame 1.

b) No ORFs were found in reading frame 2.

c) No ORFs were found in reading frame 3.


2) on the indirect strand


on the indirect strand i got

a) No ORFs were found in reading frame 1.

b) No ORFs were found in reading frame 2.

c) one ORF number were found in reading frame 3 on the reverse strand.

d) Its length estends from 309 to 764

e) It has the stop codon

f)The sequence is coding from 5' to 3' end.

g) The ORF is complete at the 3' end thats why i substracted the 3 STOP codon nucleotides from the end position

h) as it is Open reading frame ending with a stop codon and therefore potentially signals a coding sequence that is translated into a functional product RNA or polypeptide.


i chooses the ORF of reading frame 3 of indirect strand whose length is 152 aa and this ORF is longest in length but this contain a stop codon at the of sequence.So i remove the last three codon to eliminate the stop codon and reduced the length from 66 to 764 and now length of OFR is 152.


RAW RESULTS
a) direct strand 
 
No ORFs were found in reading frame 1.

No ORFs were found in reading frame 2.

No ORFs were found in reading frame 3.

----------------------------------------------------------------
b) indirect strand

No ORFs were found in reading frame 1.

No ORFs were found in reading frame 2.

>ORF number 1 in reading frame 3 on the reverse strand extends from base 66 to base 767.
ATGGGAGAGCATATAGAGGATAGCTACAAAGAAGACGCAATGAAATCGCCTTATTTGCGA
GAATTACTTCTAGGTGACTCTATTCATATATCGTCAAACATTAGCTTTGATAACCTTGCT
CCGCTTTCCAATTATCTGGGCAAGCCAGGCAATCAGGGAAAAGGCGGGCTCAGTATAGAA
GAATATAAAAAACGCCAGGCACAGCATCATATTGCAGAAATAAAAGCACTGTTAGATACC
CCAAATTTCATTAATAAATCCAATAGAATTTATGGCTATCCTAACTTTGTCTGTGACACA
GGTGGATCAATCTGTGAGGTCGTTAATCCTGACGATCCCAATGATCCAGTTTTAAAGACG
TTGGCAGAAAATACTCTTATGGTGTGGATTGAGGGCTCAAGCCACCATACAGATGAATTG
ATAAAACGGTTTGACGAGAACCCTAAACCTATGTGTTACGCCCCAGATTTTCTAGATCTA
AAATGGAGAGAGTATCTCAAGATAAATAAGTGTTCTGTTGAAAACGTAGACCCCGATGAT
TTTGTGCGCTGGACCTATAGGGAAGCCATGGCGCACAGAGACCCAATTTATAAGTCAATG
GCTTCCTGGGGAATAACAGTACAAGCTGATTTAATCTCTCAAGTAAAAACACCTTCGGAG
TTTAATTCATTAATTGGATCAACAATTTCAAGAAGTATTTAA

>Translation of ORF number 1 in reading frame 3 on the reverse strand.
MGEHIEDSYKEDAMKSPYLRELLLGDSIHISSNISFDNLAPLSNYLGKPGNQGKGGLSIE
EYKKRQAQHHIAEIKALLDTPNFINKSNRIYGYPNFVCDTGGSICEVVNPDDPNDPVLKT
LAENTLMVWIEGSSHHTDELIKRFDENPKPMCYAPDFLDLKWREYLKINKCSVENVDPDD
FVRWTYREAMAHRDPIYKSMASWGITVQADLISQVKTPSEFNSLIGSTISRSI*

Multiple Alignement

PROTOCOL


A)Multiple Alignment Of both INGROUP AND OUTGROUP sequences

using Muscle Default parameters without curation (CLUSTALW format)

B)Multiple Alignment Of both INGROUP AND OUTGROUP sequences

after curation (Gblock Result)



RESULTS ANALYSIS


I aligned my query sequence(the best hit of the blast) and got good alignments.

Gblocks 0.91b Results shows that the Alignment assumed to be: Protein

RAW RESULTS
CLUSTAL FORMAT: MUSCLE (3.7) multiple sequence alignment


gi|2585451      -----------------------------MPPISRITLMGTSGVGKTTLATLLHGAG-WF
GOS_114901      ------------------------------------------------------------
gi|1267268      -------------------MFYNSADEWHNAREKRVLIFGMSGLGKTHVANMLRDTGEWY
gi|1462782      -------------------MIYRTGAEWRAAQAKRVLLFGMSGLGKTHVSTLLRDSGDWF
gi|1264624      -------------------MIYRTGAEWRAAQAKRVLLFGMSGLGKTHVSTLLRDSGDWF
gi|2216394      -------------------MIYRTGAEWRAAQAKRVLLFGMSGLGKTHVSTLLRDNGDWF
gi|7746357      -------------------MIYRTGAEWRAAQAKRVLLFGMSGLGKTHVSTLLRDSGDWF
gi|2552634      -------------------MSYKSADDWRKSRHKRVLLLGMSGLGKTHVSNMLRAQGDWF
gi|8906952      -------------------MIYDSSAAWRGAAHKRLLFFGMSGLGKTRLAAMLREGGDWF
gi|1590441      -------------------MLYDTPAAWASAPEKHVLLFGMSGLGKTFVSNRLRESGNWF
gi|1267313      -------------------MLYPTAQAWRDAPNKRVVLFAMSGLGKTHVSNILRQAGDWF
gi|1637316      -------------------MLYSSASDWHAAPRKSVLVFGMSGLGKTHMSNLLRASGSWF
gi|8394234      -------------------MLYPSADAWRAAPHKHVLVFGMSGLGKTHMAHLLRQSGGWF
gi|8395356      -------------------MLYPSADAWRAAPHKHVLVFGMSGLGKTHMAHLLRQSARWF
gi|1499152      -------------------MIYTSAQEWRDAAEKRVLFYGMSGLGKTHLSNMLRDAGDWF
gi|1492038      -------------------MIYTSAQDWREATHKRVLFYGMSGLGKTHVSNTLRDAGNWF
gi|8570278      -------------------MIYTSAQDWREATHKRVLFYGMSGLGKTHLSNMLRDAGDWF
gi|2545124      -------------------MIYNSATDWRDAARKKVLFFGMSGLGKSYVSNTLRDAGDWF
gi|2594166      -------------------MLYTSAQAWRDAPSKRVLFFGMSGLGKTYISNILRHSGQWF
gi|9908121      ----------------------------------------MSGLGKTYISNILRRSGQWF
gi|2544745      -------------------MLYSSAEEWRNAPQKRVLFFGMSGLGKTYISNILRGSGSWF
gi|1637363      -------------------MLYSSAEEWRKAPHKRVLFFGMSGLGKTYISNILRGAGSWF
gi|1637411      -------------------MLYSSAEEWRKAPHKRVLFFGMSGLGKTYISNILRGAGSWF
gi|1267376      -----------MIEEAGRTMLYSSAQEWRDAPQKRVLFFGMSGLGKTYVSNILRGAGSWF
gi|2544661      -------------------MLYSSAQDWREAPNKRVLFFGMSGLGKTYVSNVLRGAGSWF
gi|5669698      MTRPAAPISKGANSKDEGGMIYGSARDWRAASRKKVLFFGMSGLGKTHVSNRLRDNGDWF
gi|2604339      -------------------MIYGSAREWREAARKKVLFFGMSGLGKTYVSNILRESGDWF
gi|1186020      --------------------MKLSAIKFKNSSHKCLTLLGMSGVGKTHLAKLLSYQDKYF
gi|1482441      --------------------MKLSFSEFKNSNHKRLTLLGMSGVGKTHLAKLLSHQDKYF
gi|7431626      -------------------MHF-TPESFRAWPTKRVTLLGMSGVGKTHISGMLRGH-DWF
gi|2417773      -------------------MKF-TVDEFRAWKRKCVTLLGMSGVGKTHLSGLLRRH-DWF
gi|7716616      --------------------MKLSVKEFKAWKNRRITLLGMSGVGKTRLAYMLRKH-HWF
gi|2544350      ----MFRVQYLDSAYQPQPSMKLSVKEFKAWKNRRITLLGMSGVGKTRLAYMLRKH-HWF
gi|2209336      --------------------MRLTRQEFRDWPHKSITLLGMSGVGKTRLAHILRQR-HWF
gi|2248161      --------------------MRLSPEEYRQLDHRRITLLGMSGVGKTRLSNMLRRE-EWF
                                                                            

gi|2585451      HYSGDYRIATRYLNEPISDWLTVLARREPTLAALLRDDAVSVKGKVSIERLHILSAFVGK
GOS_114901      ------------------------------------------------------------
gi|1267268      HYSIDYRIGTRYMGEHILDNIKREAMKEPFIANLLRTDSIYIGSNISFNNLAPLSTYLGK
gi|1462782      HYSVDYRIGTRYMGEHIADNFKREAMKVPFLRELLRTDSVYIASNITFDNLAPLSTYLGK
gi|1264624      HYSVDYRIGTRYMGEHIADNFKREAMKVPFLRELLRTDSVYIASNITFDNLAPLSTYLGK
gi|2216394      HYSVDYRIGTRYMGEHIADNFKREAMKVPFLRELLRTDSVYIASNITFDNLAPLSTYLGK
gi|7746357      HYSVDYRIGTRYMGEHIADNFKREAMKVPFLRELLRTDSVYIASNITFDNLAPLSTYLGK
gi|2552634      HYSVDYRIGTRYMGELIADNAKHEAMKVPFLREMLMSDSIYIASNITFENLAPLSAYLGK
gi|8906952      HYSIDYRIGTRYMGELIADNAKREAMRVPLLRELLMTDSIYIGSNITFDNLAPLSTYLGK
gi|1590441      HYSVDYRIGTRYMGEHIVDNFKREAMQVPFLRDLLMSDSIYIGSNITLENLSPVSTYLGK
gi|1267313      HYSIDYRIGTRYMGEYIVDNAKRQAMQVPFLRDLLLSDSIYIGSNISFNNLTPVSTYLGK
gi|1637316      HYSIDYRIGTRYMGEYIADNAKLEAMKVPFLRDLLMSDSIYIGSNITFDNLSPVATYLGK
gi|8394234      HYSIDYRIGTRYLGETIADNAKAEAMKVPFLRDLLLSDSIYIGSNITFDNLSPVAAWLGK
gi|8395356      HYSIDYRIGTRYLGETIADNAKAEAMKVPFLRDLLLSDSIYIGSNITFDNLSPVATWLGK
gi|1499152      HYSIDYRIGTRYMGELIADNAKAHAMQVPFLRDLLLTDSIYIGSNITFDNLAPVSTYLGK
gi|1492038      HYSIDYRIGTRYMGELIADNAKAHAMQVPFLRDLLLSDSIYIGSNITFHNLTPVSTYLGK
gi|8570278      HYSIDYRIGTRHMGELIADNAKAHAMQVPFLRDLLLSDSIYIGSNITFHNLTPVSTYLGK
gi|2545124      HYSIDYRIGTRYMGEYITDNAKAEAMKVPFLRELLLTDSIYIGSNITFQNLTPVSAYLGK
gi|2594166      HYSADYRIGTRYMGEYIADNAKAEAMKVPFLRDLLLSDSIDISSNITFENLSAVTAYLGV
gi|9908121      HYSADYRIGTRYMGEYIADNAKAEAMKVPFLRDLLLSDSIDISSNITFENLSAVTAYLGV
gi|2544745      HYSIDYRIGTRYMGEYIADNAKAEAMKVPFLRDLLLSDSIYIGSNISFENLTPVASYLGK
gi|1637363      HYSIDYRIGTRYMGEYIADNAKAEAMKVPFLRDLLLSDSIYIGSNISFENLTPVASYLGK
gi|1637411      HYSIDYRIGTRYMGEYIADNAKAEAMKVPFLRDLLLSDSIYIGSNISFENLTPVASYLGK
gi|1267376      HYSIDYRIGTRYMGEYIADNAKAEAMKNPFLRDLLLSDSIYIGSNISFENLTPVASYLGK
gi|2544661      HYSVDYRIGTRYMGEYIADNAKAEAMKVPFLRELLLSDSIFIGSNITFENLTPVAAYLGK
gi|5669698      HYSIDYRIGTRYMGEFITDNAKAEAMKVPFLRDLLMSDSIYIGSNISFENLTPVSTYLGK
gi|2604339      HYSIDYRIGTRYMGEYITDNAKAEAMKVPFLRDLLLTDSIYIGSNISFENLTPVSAYLGK
gi|1186020      HYSGDYRIGAEYLNDKILDNIKNYIRQDKWLKDLLDNESISIQNHITFDNLSSVSAFLGK
gi|1482441      HYSSDYRIGAKYLNDKILGNIKNHVKQDKWLKDLLDKESISIQNHITFDNLSSVSTFLGK
gi|7431626      HFSGDYRIGTRYLDEPILDLIKQQAMQVPFLRDLLRRDWIDIKNNIKIHDLGPVLTFVGK
gi|2417773      HYSGDYRIGTRYLDEPILDLIKSRAMRDPFLRDLLRRDWITIRNVIKINDLGPVLSFIGK
gi|7716616      HYSGDYRVGTRYLDEAILDNIKQKVMGIPFLRDLLYSDSIYIRNNINVDNLKPVSSFLGK
gi|2544350      HYSGDYRVGTRYLDEAILDNIKQKVMGIPFLRDLLYSDSIYIRNNINVDNLKPVSSFLGK
gi|2209336      HYSGDYRIGTRYLDEPILDNIKHQAMQVPFLRELLRSDSIYILNNITVDNLQPVSSFLGK
gi|2248161      HYSGDYRIGTRYLSEPILDNIKAQAMAVPFLRDLLRSDSIQIINNITVDNLHPVASFLGK
                                                                            

gi|2585451      LG---RDGYDARTFIERQRLFNEAERAAMYDVPAFIERAERLYGYKAFINDAGGSICEID
GOS_114901      ---------------------------------NFINKSNRIYGYPNFVCDTGGSICEVV
gi|1267268      PGDPAKGGVPFDEYMRRQALHREAEIASLLDSRYFIDRAQAIYGYPHFICDTGGSICEVV
gi|1462782      PGNPNRGGLDFADYLERQGQHREAEVAALHDTQRFIRRATEIYGYQNFVCDSGGSICEVV
gi|1264624      PGNPNRGGLAFADYLERQGQHREAEVAALHDTPRFIRRATEIYGYQNFVCDSGGSICEVV
gi|2216394      PGNPNRGGLAFADYLERQGQHREAEVAALHDTPRFIRRATEIYGYQNFVCDSGGSICEVV
gi|7746357      PGNPNRGGLAFADYLERQGQHREAEVAALHDTPRFIRRATEIYGYQNFVCDSGGSICEVV
gi|2552634      PGDPELGGLSIDEYRRRQDQHRAAEISALLDTPHFMDRAQAIYGYDHFVCDTGGSICEVV
gi|8906952      PGDPARGGLPFADYVRRQAEHRDAETAALLDTGHFIDRAGSLYGYPHFVCDTGGSICEVV
gi|1590441      PGNPALGGVPIDEYRLRQAQFRRAEIAALQDTAHFIDRAKALYGYPHFVCDSGGSICEWV
gi|1267313      PGDPDRGGLPMHEYVRRQGQFERAERLALLDTEYFIDRAQDLYGYPHFVCDTGGSICEWV
gi|1637316      PGNPAKSGLPFQEYERRQDQFQRAEIAALQDTAHFAKRAEDLYGYPNFICDTGGSICEWV
gi|8394234      PGDAAKGGLPMAEYASRQEAFKQAEIAALRDTGHFARRAQTLYGYPHFICDTGGSICEWV
gi|8395356      PGDAAKGGLPMAEYASRQDAFKQAEIAALRDTGHFARRAQALYGYPHFICDTGGSICEWV
gi|1499152      PGDPAKGGLPIDEYRRRQQQFRTAEIAALMDTAHFIDRAERLYGYPNFVCDTGGSICEWV
gi|1492038      PGNPAKGGLPIEEYRIRQEQFRQAEIHALMDTAYFIRRADALYGYPHFICDTGGSICEWV
gi|8570278      PGNPVKGGLPIEEYRIRQEQFRQAEIHALMDTAYFIRRAGALYGYPHFICDTGGSICEWV
gi|2545124      PGNPERGGLEISEYQLRQEQFRQAEIYALLDTKYFVERARRLYSYPNFICDTGGSICEWV
gi|2594166      PGDARKGGLPIEEYRTRQDQFRRAEIDALLDTRYFIDRSERIYGYPHFICDSGGSICEWV
gi|9908121      PGDAQKGGLPIEEYRTRQDQFRRAEIDALLDTRYFIDRSDRIYGYPHFICDSGGSICEWV
gi|2544745      PGNPDKGGLPMAEYTRRQDQFRTAELNALRDTGYFIDRAERLYDYPNFICDTGGSICEWV
gi|1637363      PGDPAKGGLAMQEYTRRQDQFRTAELNALRDTGYFIDRAARLYDYPNFICDTGGSICEWV
gi|1637411      PGNPAKGGLAMQEYTRRQDQFRTAELNALRDTGYFIDRAARLYDYPNFICDTGGSICEWV
gi|1267376      PGDPAKGGLPIEEYRRRQEQFRLAEIHALLDTEYFIDRADRLYGYSNFICDTGGSICEWV
gi|2544661      PGDVAKGGLPIAEYKRRQEQFRVAEIRALLDTEYFEDRAQQLYGYPNFICDTGGSICEWV
gi|5669698      PGAPGKGGLPIAEYRRRQDQFRHAEIYALLDTEYFIDRAQRLYGYPHFICDTGGSICEWV
gi|2604339      PGNPDLGGLPMPEYRRRQEQFRLAEIHALLDTEYFIDRAQRLYGYPHFICDTGGSICEWV
gi|1186020      AGNPELGGTPIGTFIARQTMHLNAETKAMLDVPQFIQKAKTQ-GFNHFINDAGGSLCEL-
gi|1482441      TGNPELGGTSIDTFIARQIMHLNAEIKTMLDVPQFIKKAKAQ-GFNHFINDAGGSLCEL-
gi|7431626      LGGPEWGGLPLDEFTRRQAAYREAEIAAMRDVPGFIRKGQEIYGYPHFVNDVGGSLCELD
gi|2417773      LGNPEQGGLPLDEFSRRQALYREAEIAAMFDLPEFVRKAQEVYGYAHLVNDVGGSLCELE
gi|7716616      LGNPEQGGLGLTEFKRRQELHRQAEIAAMKDVPVFIRRAQQIYDYPHFVNDAGGSISELD
gi|2544350      LGNPEQGGLGLTEFKRRQELHRQAEIAAMKDVPVFIRRAQQIYDYPHFVNDAGGSISELD
gi|2209336      LGDPERGGLPLTEFKRRQALHHTAEIAAMRDVPEFIGKARSIYGYEHFVNDAGGSVCELD
gi|2248161      LGNPELGGLPLTEFKRRQNLHHEAEVQAMLDVPDFIHKSQTLFGYPHFVNDAGGSVCELD
                                                  *  ..    .:  :: * ***:.*  

gi|2585451      D-----DALMQTLAAHTLFVYIDTDEALYAE-LEARAMPTPNPSATTPASCKK-------
GOS_114901      NPDDPNDPVLKTLAENTLMVWIEGSSHHTDE-LIKRFDENPKPMCYAPDFLDLKWREYLK
gi|1267268      EPNNPDDPVLSSLSQDNLLVWIKGDDSHIDR-LAARFDRAPKPMYYSPAFLTEKWSQYLS
gi|1462782      DPDNPNDPVMSDLAGNLLMVWIKGSEAHTAE-LVRRFDKAPKPMYYQPAFLRAAWTDYRV
gi|1264624      DPDNPNDPVMSDLAGNLLMVWIKGSDAHTAE-LVRRFDRAPKPMYYQPGFLHAAWTDYRV
gi|2216394      DPDNPNDPVMSDLAGNLLMVWIKGSEAHTAE-LVRRFDRAPKPMYYQPGFLHAAWTDYRV
gi|7746357      DPDNPNDPVMSDLAGNLLMVWIKGSDAHTAE-LVRRFDRAPKPMYYQPGFLHAAWTDYRV
gi|2552634      DPSNKNDEVLNTLSNDLLIVWIEGSDAHAAE-LARRFDLAPKPMYYQPEFLEAAWAEYLN
gi|8906952      DPDDPADPILTHLSERLLMVWIRGSEAHTDA-LVRRFDRAPKPMYYAPAFLEACWSAYLA
gi|1590441      DAEDPEDPLLRDLSAVTLLVWIKGDAAHTEE-LIRRFDRDPKPMAYQPEFLDRVWAEYLA
gi|1267313      DPQDAEDPILTALSSVALMVWVRGSEAHTED-LIRRFDRAPKPMAYRPDFLSSAWTRYLD
gi|1637316      DPEDDADPLMTELANHCLMIWIKGDEAHTQE-LIRRFDRAPKPMAYQPAFLTRVWREYLS
gi|8394234      DPDDDNDPLMQTLSAECLPVWIKGDHAHTQE-LIRRFDKAPKPMAYQPEFLLRVWQEYLR
gi|8395356      DPDDDNDPLMQTLSAECLPVWIKGDDAHTQE-LIRRFDKAPKPMAYQPEFLLRVWQEYLS
gi|1499152      DGDDPADPLLTELSRHCLLIYIEGSEAHTQS-LIRRFDRAPKPMAYQPEFLDQAWVQYLN
gi|1492038      DGDDPEDTILKELSAHCLLIYIEGSEAHTAE-LIRRFDRAPKPMAYQPAFLDAAWAEYLK
gi|8570278      DGNNHDDPILTELSAQCLLIYIEGSEAHTAE-LIRRFDRAPKPMAYQPAFLDAAWAEYLR
gi|2545124      EPEDPNDQILTELSSQTLMIWIKGDDAHTEE-LIRRFDKAPKPMSYQAAFLQRVWSEYLA
gi|2594166      DADDPNDKLLTELSAHTLLVWIKGTEDHTQE-LIRRFDRAPKPMAYQPAFLTRVWEEYLS
gi|9908121      DANDPNDPLLSELSAHTLLVWIKGTEDHTQE-LIRRFDRAPKPMAYQPAFLTRVWEEYLN
gi|2544745      EADDPSDPLLTQLSQQTLMVWIKGDEAHTQE-LIRRFDRAPKPMAYQPAFLARVWQEYLK
gi|1637363      DANDPGDPLLTELAKNTLMVWIKGDEAHTQE-LIRRFDRAPKPMAYEPAFLTRVWQEYLK
gi|1637411      DANDPNDPLLTELSQHTLMVWIKGDEAHTQE-LIRRFDRAPKPMAYEPAFLARVWQEYLK
gi|1267376      DANDPNDPVLSELSKHCLMVWIKGDDAHTEE-LVRRFDRAPKPMSYQPEFLGRVWEEYLK
gi|2544661      DVDDPKDQIMSELSKHTLMVWIKGDEAHTEE-LVRRFDRAPKPMSYQPEFLDRVWQEYLK
gi|5669698      DPEDPRDQVLSELSKHTLMIWIEGSEAHTAE-LIRRFDKAPKPMSYQPEFLSRVWEAYLA
gi|2604339      DPDDPADRVLSELSRHTLMIWIKGDDDHTAE-LIRRFDKAPKPMSYQAEFLTRVWQDYLD
gi|1186020      ----DNEQVYQTLANNTVILYIRASKTNETA-LVECAQTHPKPLYYQANFLKQQLSIYLQ
gi|1482441      ----DNEQVYQTLSDNTVILYIRASKTNEIA-LIKRAQTHPKPLYYQANFLKHELDIYLQ
gi|7431626      EPG-----VIELLAEHTLILYIQTTSREEEETLIRRAQSDPKPLYFRPAFLEENLPRYLE
gi|2417773      EPR-----VIDMLARHSLILYIRVPEADEIK-LIERAQADPKPLYYRPEFLQTAVKDYLE
gi|7716616      DPE-----VLQTLDQHTLILYIKATQRDERK-LIQRAEEEPKPLYYRESFLDEQLAIYRE
gi|2544350      DPE-----VLQTLDQHTLILYIKATQRDERK-LIQRAEEEPKPLYYRESFLDEQLAIYRE
gi|2209336      DPE-----TLELLAEHTLLIYIQATRQDEEE-LIRRAESAPKPLYYREAFLDEQLREYKQ
gi|2248161      SPG-----VLETLAEHTLILYIKATDEDERQ-LIERAEHDPKPLYYREAFLDQQLAEYMR
                            *    : :::          *       *:*                 

gi|2585451      ------------------------------------------------------------
GOS_114901      INKCS-VENVDPDDFVRWTYREAMAHRDPIYKSMAS-WGITVQADLISQVKTPSEFNSLI
gi|1267268      LNGVA-EDKVDPDDFVRWTYRQAMKHREPIYAAMAEKWGVTVMADDVANVSNATDFDRLI
gi|1462782      EKGLT-EEQVDPDDFIRWTYARALAHRQPRYAAMAR-WGITVTAEEVAEAGNAPAFVDLI
gi|1264624      EKGLT-EEQVDPDDFIRWTYARALAHRQPRYEAMAR-WGITVTAEEVAEARDAQGFVDLI
gi|2216394      EKGLT-EEQVDPDDFIRWTYARALAHRQPRYEAMAR-WGITVTAEEVAEARDAQGFVDLI
gi|7746357      EKGLT-EEQVDPDDFIRWTYARALAHRQPRYEAMAR-WGITVTAEEVAEARDAQGFVDLI
gi|2552634      NNKVS-EGDVNPDDFVRWTYSRALAHRAPLYREIAQ-WGVAVQAHDIAQVKSAGAFEALI
gi|8906952      ETGTA-PGAVDPDAFIRWTYARAMAHREPLYAGMAR-WGVTLEASDVAEVRDEADITALV
gi|1590441      QNRMA-PEAVDPDAFIRWTYAQALAHRQPRYAAMAQNWGVTVEADDIARMRDAADFEALI
gi|1267313      ATGTS-EEGIDPDAFIRWTYAQALAHRQPLYEAMASNWGITVDASDMSGVRTAADFEDII
gi|1637316      EKNCT-ETEVDPDDFIRFTYAKALAHRQPRYQAMSR-WGITVTASEVSALKTPDDFYALV
gi|8394234      ENNCQ-EDDVDPDAFIRWTYAQALAHRQPRYEAMSR-WGVTVTADQVAGLKSEDDFNDLI
gi|8395356      ENNCQ-EDDVDPDAFIRWTYAQALAHRQPRYEAMSR-WGVTVTADQVAGLKSEDDFNDLI
gi|1499152      ENKCE-ADEVDPGAFIRWTYARALAHRKPRYEAMAK-WGITVTADEVAETRDADAFDALV
gi|1492038      ENNCE-EDEVDPDSFIRWTYARALAHRQPRYEAMVK-WGVSVTANEVATVKTAQEFEALI
gi|8570278      ENKCE-EGDVNPDSFIRWTYARALAHRQPRYEAMVK-WGVSVSATEVAGVKTAQDFEALI
gi|2545124      LNTCS-GDEVDPDAFIRWTYARALAHRQPRYEAMARNWGVTIAADDVAKVRDSADFNELI
gi|2594166      ENNIS-DEDVDPHSFLRWTYAQALAHRQPRYEAMAKNWGVTVTADQVAQVRSTLDFEELI
gi|9908121      ENNIT-DADVDPHAFLRWTYAQALAHRQPRYEAMAAHWGVTVTADQVAHVQNTSDFEELI
gi|2544745      ENALS-EEQVDPDTFIRWTYAQALAHRQPRYEAMAKNWGVTVTADQIASVRSEADFTDVI
gi|1637363      ENTLS-EADVDPDSFIRWTYAQALAHRQPRYAAMAEKWGVTVTADQISTIRTEADFTDVI
gi|1637411      ENTLS-EADVDPDSFIRWTYAQALAHRQPRYAAMAAHWGVTVTADQISGIRTEADFKDVI
gi|1267376      EMSIS-EGDADPDAFIRWTYAQALAHRQPRYESMARNWGVTVTADQISAVQSEADFNELI
gi|2544661      ENGIA-EGDVDPDAFIRWTYAQALAHRQPRYQAMADNWGITVTADQISTVKDEAGFVDLI
gi|5669698      ENDCT-PDEVDPDAFIRWTYAQALAHRQPRYRAMARNWGITVTAEQIGAIRDAADFDALI
gi|2604339      RHGCA-EADVNPDDFIRWTYAQALAHRQPRYRAMAENWGVTVTADQIGQVRDAGDFDELI
gi|1186020      KNQLIYVAQIDPNEFVRWVFPRLLEHRKPKYEAIAKKYGYTIDSEGLYRCKNADEVFELI
gi|1482441      ENQLIYIAQINPNKFVRWVFPRLLEHRKPKYEAIAKKYGYIIDNEDLYQCRSANEIFELI
gi|7431626      EKGLAYVAQIEPNDFARWIFPRLFHSRIPRYEAIAGPHGYTVTSEEAAQVRDERDFLALL
gi|2417773      LRQLDYVAQIEPDDFTRWVFPRLFHSRVPRYEAIAQPHGYTVSSEDVAQVRDEADFLALV
gi|7716616      KHDLPYVALIEPDEFVRWIFPRLFYSRLPRYQAIADEYGYTVSTDQLAQVNDETGFLELL
gi|2544350      KHDLPYVALIEPDEFVRWIFPRLFYSRLPRYQAIADEYGYTVSTDQLAQVNDETGFLELL
gi|2209336      AHDIEYVAQIDPDDFVRWMFPRLFRSRIPRYEAIARDYGYTITTEELGRVKSEQDFLALI
gi|2248161      DEGLDYVAQIDPDAFVRWVFSRLFRARLPRYEAIAEAHGYTISTTELAQVRNAQDFDDLV
                                                                            

gi|2585451      ------------
GOS_114901      GSTISRSI----
gi|1267268      ADALGKQAN---
gi|1462782      ARTLDRARTEA-
gi|1264624      ARTLDRTRTHA-
gi|2216394      ARTLDQARTHA-
gi|7746357      ARTLDQARTHA-
gi|2552634      ASTLEMRG----
gi|8906952      ADALDRRAS---
gi|1590441      ADALGNRSSSA-
gi|1267313      AKALPD------
gi|1637316      ASYL--------
gi|8394234      AGTLSA------
gi|8395356      AGTLSA------
gi|1499152      ATALERST----
gi|1492038      ATALERRAATPI
gi|8570278      ATALERRAATPI
gi|2545124      ERTLETRPNPT-
gi|2594166      ASALEARSEQT-
gi|9908121      ASALETRAEQT-
gi|2544745      ANALELRAQAF-
gi|1637363      AGALEARG----
gi|1637411      AGALEARG----
gi|1267376      ARTLEAGG----
gi|2544661      ASALEARS----
gi|5669698      ESSLENRPD---
gi|2604339      ERTLEARRAAS-
gi|1186020      YGAMN-------
gi|1482441      YRAMN-------
gi|7431626      EAAIARKGS---
gi|2417773      ESAIARSA----
gi|7716616      ETVVAKQP----
gi|2544350      ETVVAKQP----
gi|2209336      ETALER------
gi|2248161      CMALEREARL--
                            
=====================================================================                                             
------------------------------------------------------------------------
==========================================================================
Gblocks 0.91b Results
Processed file: input.fasta
Number of sequences: 35
Alignment assumed to be: Protein
New number of positions: 53 (selected positions are underlined in blue) 

                         10        20        30        40        50        60
                 =========+=========+=========+=========+=========+=========+
gi|258545142|re  -----------------------------MPPISRITLMGTSGVGKTTLATLLHGAG-WF
GOS_1149010_Tra  ------------------------------------------------------------
gi|126726873|re  -------------------MFYNSADEWHNAREKRVLIFGMSGLGKTHVANMLRDTGEWY
gi|146278201|re  -------------------MIYRTGAEWRAAQAKRVLLFGMSGLGKTHVSTLLRDSGDWF
gi|126462407|re  -------------------MIYRTGAEWRAAQAKRVLLFGMSGLGKTHVSTLLRDSGDWF
gi|221639416|re  -------------------MIYRTGAEWRAAQAKRVLLFGMSGLGKTHVSTLLRDNGDWF
gi|77463575|ref  -------------------MIYRTGAEWRAAQAKRVLLFGMSGLGKTHVSTLLRDSGDWF
gi|255263460|re  -------------------MSYKSADDWRKSRHKRVLLLGMSGLGKTHVSNMLRAQGDWF
gi|89069526|ref  -------------------MIYDSSAAWRGAAHKRLLFFGMSGLGKTRLAAMLREGGDWF
gi|159044141|re  -------------------MLYDTPAAWASAPEKHVLLFGMSGLGKTFVSNRLRESGNWF
gi|126731321|re  -------------------MLYPTAQAWRDAPNKRVVLFAMSGLGKTHVSNILRQAGDWF
gi|163731600|re  -------------------MLYSSASDWHAAPRKSVLVFGMSGLGKTHMSNLLRASGSWF
gi|83942343|ref  -------------------MLYPSADAWRAAPHKHVLVFGMSGLGKTHMAHLLRQSGGWF
gi|83953562|ref  -------------------MLYPSADAWRAAPHKHVLVFGMSGLGKTHMAHLLRQSARWF
gi|149915282|re  -------------------MIYTSAQEWRDAAEKRVLFYGMSGLGKTHLSNMLRDAGDWF
gi|149203858|re  -------------------MIYTSAQDWREATHKRVLFYGMSGLGKTHVSNTLRDAGNWF
gi|85702786|ref  -------------------MIYTSAQDWREATHKRVLFYGMSGLGKTHLSNMLRDAGDWF
gi|254512450|re  -------------------MIYNSATDWRDAARKKVLFFGMSGLGKSYVSNTLRDAGDWF
gi|259416668|re  -------------------MLYTSAQAWRDAPSKRVLFFGMSGLGKTYISNILRHSGQWF
gi|99081219|ref  ----------------------------------------MSGLGKTYISNILRRSGQWF
gi|254474535|re  -------------------MLYSSAEEWRNAPQKRVLFFGMSGLGKTYISNILRGSGSWF
gi|163736332|re  -------------------MLYSSAEEWRKAPHKRVLFFGMSGLGKTYISNILRGAGSWF
gi|163741132|re  -------------------MLYSSAEEWRKAPHKRVLFFGMSGLGKTYISNILRGAGSWF
gi|126737663|re  -----------MIEEAGRTMLYSSAQEWRDAPQKRVLFFGMSGLGKTYVSNILRGAGSWF
gi|254466131|re  -------------------MLYSSAQDWREAPNKRVLFFGMSGLGKTYVSNVLRGAGSWF
gi|56696981|ref  MTRPAAPISKGANSKDEGGMIYGSARDWRAASRKKVLFFGMSGLGKTHVSNRLRDNGDWF
gi|260433938|re  -------------------MIYGSAREWREAARKKVLFFGMSGLGKTYVSNILRESGDWF
gi|118602078|re  --------------------MKLSAIKFKNSSHKCLTLLGMSGVGKTHLAKLLSYQDKYF
gi|148244187|re  --------------------MKLSFSEFKNSNHKRLTLLGMSGVGKTHLAKLLSHQDKYF
gi|74316263|ref  -------------------MHF-TPESFRAWPTKRVTLLGMSGVGKTHISGMLRGH-DWF
gi|241777342|re  -------------------MKF-TVDEFRAWKRKCVTLLGMSGVGKTHLSGLLRRH-DWF
gi|77166160|ref  --------------------MKLSVKEFKAWKNRRITLLGMSGVGKTRLAYMLRKH-HWF
gi|254435004|re  ----MFRVQYLDSAYQPQPSMKLSVKEFKAWKNRRITLLGMSGVGKTRLAYMLRKH-HWF
gi|220933681|re  --------------------MRLTRQEFRDWPHKSITLLGMSGVGKTRLAHILRQR-HWF
gi|224816189|re  --------------------MRLSPEEYRQLDHRRITLLGMSGVGKTRLSNMLRRE-EWF
                                                                             


                         70        80        90       100       110       120
                 =========+=========+=========+=========+=========+=========+
gi|258545142|re  HYSGDYRIATRYLNEPISDWLTVLARREPTLAALLRDDAVSVKGKVSIERLHILSAFVGK
GOS_1149010_Tra  ------------------------------------------------------------
gi|126726873|re  HYSIDYRIGTRYMGEHILDNIKREAMKEPFIANLLRTDSIYIGSNISFNNLAPLSTYLGK
gi|146278201|re  HYSVDYRIGTRYMGEHIADNFKREAMKVPFLRELLRTDSVYIASNITFDNLAPLSTYLGK
gi|126462407|re  HYSVDYRIGTRYMGEHIADNFKREAMKVPFLRELLRTDSVYIASNITFDNLAPLSTYLGK
gi|221639416|re  HYSVDYRIGTRYMGEHIADNFKREAMKVPFLRELLRTDSVYIASNITFDNLAPLSTYLGK
gi|77463575|ref  HYSVDYRIGTRYMGEHIADNFKREAMKVPFLRELLRTDSVYIASNITFDNLAPLSTYLGK
gi|255263460|re  HYSVDYRIGTRYMGELIADNAKHEAMKVPFLREMLMSDSIYIASNITFENLAPLSAYLGK
gi|89069526|ref  HYSIDYRIGTRYMGELIADNAKREAMRVPLLRELLMTDSIYIGSNITFDNLAPLSTYLGK
gi|159044141|re  HYSVDYRIGTRYMGEHIVDNFKREAMQVPFLRDLLMSDSIYIGSNITLENLSPVSTYLGK
gi|126731321|re  HYSIDYRIGTRYMGEYIVDNAKRQAMQVPFLRDLLLSDSIYIGSNISFNNLTPVSTYLGK
gi|163731600|re  HYSIDYRIGTRYMGEYIADNAKLEAMKVPFLRDLLMSDSIYIGSNITFDNLSPVATYLGK
gi|83942343|ref  HYSIDYRIGTRYLGETIADNAKAEAMKVPFLRDLLLSDSIYIGSNITFDNLSPVAAWLGK
gi|83953562|ref  HYSIDYRIGTRYLGETIADNAKAEAMKVPFLRDLLLSDSIYIGSNITFDNLSPVATWLGK
gi|149915282|re  HYSIDYRIGTRYMGELIADNAKAHAMQVPFLRDLLLTDSIYIGSNITFDNLAPVSTYLGK
gi|149203858|re  HYSIDYRIGTRYMGELIADNAKAHAMQVPFLRDLLLSDSIYIGSNITFHNLTPVSTYLGK
gi|85702786|ref  HYSIDYRIGTRHMGELIADNAKAHAMQVPFLRDLLLSDSIYIGSNITFHNLTPVSTYLGK
gi|254512450|re  HYSIDYRIGTRYMGEYITDNAKAEAMKVPFLRELLLTDSIYIGSNITFQNLTPVSAYLGK
gi|259416668|re  HYSADYRIGTRYMGEYIADNAKAEAMKVPFLRDLLLSDSIDISSNITFENLSAVTAYLGV
gi|99081219|ref  HYSADYRIGTRYMGEYIADNAKAEAMKVPFLRDLLLSDSIDISSNITFENLSAVTAYLGV
gi|254474535|re  HYSIDYRIGTRYMGEYIADNAKAEAMKVPFLRDLLLSDSIYIGSNISFENLTPVASYLGK
gi|163736332|re  HYSIDYRIGTRYMGEYIADNAKAEAMKVPFLRDLLLSDSIYIGSNISFENLTPVASYLGK
gi|163741132|re  HYSIDYRIGTRYMGEYIADNAKAEAMKVPFLRDLLLSDSIYIGSNISFENLTPVASYLGK
gi|126737663|re  HYSIDYRIGTRYMGEYIADNAKAEAMKNPFLRDLLLSDSIYIGSNISFENLTPVASYLGK
gi|254466131|re  HYSVDYRIGTRYMGEYIADNAKAEAMKVPFLRELLLSDSIFIGSNITFENLTPVAAYLGK
gi|56696981|ref  HYSIDYRIGTRYMGEFITDNAKAEAMKVPFLRDLLMSDSIYIGSNISFENLTPVSTYLGK
gi|260433938|re  HYSIDYRIGTRYMGEYITDNAKAEAMKVPFLRDLLLTDSIYIGSNISFENLTPVSAYLGK
gi|118602078|re  HYSGDYRIGAEYLNDKILDNIKNYIRQDKWLKDLLDNESISIQNHITFDNLSSVSAFLGK
gi|148244187|re  HYSSDYRIGAKYLNDKILGNIKNHVKQDKWLKDLLDKESISIQNHITFDNLSSVSTFLGK
gi|74316263|ref  HFSGDYRIGTRYLDEPILDLIKQQAMQVPFLRDLLRRDWIDIKNNIKIHDLGPVLTFVGK
gi|241777342|re  HYSGDYRIGTRYLDEPILDLIKSRAMRDPFLRDLLRRDWITIRNVIKINDLGPVLSFIGK
gi|77166160|ref  HYSGDYRVGTRYLDEAILDNIKQKVMGIPFLRDLLYSDSIYIRNNINVDNLKPVSSFLGK
gi|254435004|re  HYSGDYRVGTRYLDEAILDNIKQKVMGIPFLRDLLYSDSIYIRNNINVDNLKPVSSFLGK
gi|220933681|re  HYSGDYRIGTRYLDEPILDNIKHQAMQVPFLRELLRSDSIYILNNITVDNLQPVSSFLGK
gi|224816189|re  HYSGDYRIGTRYLSEPILDNIKAQAMAVPFLRDLLRSDSIQIINNITVDNLHPVASFLGK
                                                                             


                        130       140       150       160       170       180
                 =========+=========+=========+=========+=========+=========+
gi|258545142|re  LG---RDGYDARTFIERQRLFNEAERAAMYDVPAFIERAERLYGYKAFINDAGGSICEID
GOS_1149010_Tra  ---------------------------------NFINKSNRIYGYPNFVCDTGGSICEVV
gi|126726873|re  PGDPAKGGVPFDEYMRRQALHREAEIASLLDSRYFIDRAQAIYGYPHFICDTGGSICEVV
gi|146278201|re  PGNPNRGGLDFADYLERQGQHREAEVAALHDTQRFIRRATEIYGYQNFVCDSGGSICEVV
gi|126462407|re  PGNPNRGGLAFADYLERQGQHREAEVAALHDTPRFIRRATEIYGYQNFVCDSGGSICEVV
gi|221639416|re  PGNPNRGGLAFADYLERQGQHREAEVAALHDTPRFIRRATEIYGYQNFVCDSGGSICEVV
gi|77463575|ref  PGNPNRGGLAFADYLERQGQHREAEVAALHDTPRFIRRATEIYGYQNFVCDSGGSICEVV
gi|255263460|re  PGDPELGGLSIDEYRRRQDQHRAAEISALLDTPHFMDRAQAIYGYDHFVCDTGGSICEVV
gi|89069526|ref  PGDPARGGLPFADYVRRQAEHRDAETAALLDTGHFIDRAGSLYGYPHFVCDTGGSICEVV
gi|159044141|re  PGNPALGGVPIDEYRLRQAQFRRAEIAALQDTAHFIDRAKALYGYPHFVCDSGGSICEWV
gi|126731321|re  PGDPDRGGLPMHEYVRRQGQFERAERLALLDTEYFIDRAQDLYGYPHFVCDTGGSICEWV
gi|163731600|re  PGNPAKSGLPFQEYERRQDQFQRAEIAALQDTAHFAKRAEDLYGYPNFICDTGGSICEWV
gi|83942343|ref  PGDAAKGGLPMAEYASRQEAFKQAEIAALRDTGHFARRAQTLYGYPHFICDTGGSICEWV
gi|83953562|ref  PGDAAKGGLPMAEYASRQDAFKQAEIAALRDTGHFARRAQALYGYPHFICDTGGSICEWV
gi|149915282|re  PGDPAKGGLPIDEYRRRQQQFRTAEIAALMDTAHFIDRAERLYGYPNFVCDTGGSICEWV
gi|149203858|re  PGNPAKGGLPIEEYRIRQEQFRQAEIHALMDTAYFIRRADALYGYPHFICDTGGSICEWV
gi|85702786|ref  PGNPVKGGLPIEEYRIRQEQFRQAEIHALMDTAYFIRRAGALYGYPHFICDTGGSICEWV
gi|254512450|re  PGNPERGGLEISEYQLRQEQFRQAEIYALLDTKYFVERARRLYSYPNFICDTGGSICEWV
gi|259416668|re  PGDARKGGLPIEEYRTRQDQFRRAEIDALLDTRYFIDRSERIYGYPHFICDSGGSICEWV
gi|99081219|ref  PGDAQKGGLPIEEYRTRQDQFRRAEIDALLDTRYFIDRSDRIYGYPHFICDSGGSICEWV
gi|254474535|re  PGNPDKGGLPMAEYTRRQDQFRTAELNALRDTGYFIDRAERLYDYPNFICDTGGSICEWV
gi|163736332|re  PGDPAKGGLAMQEYTRRQDQFRTAELNALRDTGYFIDRAARLYDYPNFICDTGGSICEWV
gi|163741132|re  PGNPAKGGLAMQEYTRRQDQFRTAELNALRDTGYFIDRAARLYDYPNFICDTGGSICEWV
gi|126737663|re  PGDPAKGGLPIEEYRRRQEQFRLAEIHALLDTEYFIDRADRLYGYSNFICDTGGSICEWV
gi|254466131|re  PGDVAKGGLPIAEYKRRQEQFRVAEIRALLDTEYFEDRAQQLYGYPNFICDTGGSICEWV
gi|56696981|ref  PGAPGKGGLPIAEYRRRQDQFRHAEIYALLDTEYFIDRAQRLYGYPHFICDTGGSICEWV
gi|260433938|re  PGNPDLGGLPMPEYRRRQEQFRLAEIHALLDTEYFIDRAQRLYGYPHFICDTGGSICEWV
gi|118602078|re  AGNPELGGTPIGTFIARQTMHLNAETKAMLDVPQFIQKAKTQ-GFNHFINDAGGSLCEL-
gi|148244187|re  TGNPELGGTSIDTFIARQIMHLNAEIKTMLDVPQFIKKAKAQ-GFNHFINDAGGSLCEL-
gi|74316263|ref  LGGPEWGGLPLDEFTRRQAAYREAEIAAMRDVPGFIRKGQEIYGYPHFVNDVGGSLCELD
gi|241777342|re  LGNPEQGGLPLDEFSRRQALYREAEIAAMFDLPEFVRKAQEVYGYAHLVNDVGGSLCELE
gi|77166160|ref  LGNPEQGGLGLTEFKRRQELHRQAEIAAMKDVPVFIRRAQQIYDYPHFVNDAGGSISELD
gi|254435004|re  LGNPEQGGLGLTEFKRRQELHRQAEIAAMKDVPVFIRRAQQIYDYPHFVNDAGGSISELD
gi|220933681|re  LGDPERGGLPLTEFKRRQALHHTAEIAAMRDVPEFIGKARSIYGYEHFVNDAGGSVCELD
gi|224816189|re  LGNPELGGLPLTEFKRRQNLHHEAEVQAMLDVPDFIHKSQTLFGYPHFVNDAGGSVCELD
                                                            ###############  


                        190       200       210       220       230       240
                 =========+=========+=========+=========+=========+=========+
gi|258545142|re  D-----DALMQTLAAHTLFVYIDTDEALYAE-LEARAMPTPNPSATTPASCKK-------
GOS_1149010_Tra  NPDDPNDPVLKTLAENTLMVWIEGSSHHTDE-LIKRFDENPKPMCYAPDFLDLKWREYLK
gi|126726873|re  EPNNPDDPVLSSLSQDNLLVWIKGDDSHIDR-LAARFDRAPKPMYYSPAFLTEKWSQYLS
gi|146278201|re  DPDNPNDPVMSDLAGNLLMVWIKGSEAHTAE-LVRRFDKAPKPMYYQPAFLRAAWTDYRV
gi|126462407|re  DPDNPNDPVMSDLAGNLLMVWIKGSDAHTAE-LVRRFDRAPKPMYYQPGFLHAAWTDYRV
gi|221639416|re  DPDNPNDPVMSDLAGNLLMVWIKGSEAHTAE-LVRRFDRAPKPMYYQPGFLHAAWTDYRV
gi|77463575|ref  DPDNPNDPVMSDLAGNLLMVWIKGSDAHTAE-LVRRFDRAPKPMYYQPGFLHAAWTDYRV
gi|255263460|re  DPSNKNDEVLNTLSNDLLIVWIEGSDAHAAE-LARRFDLAPKPMYYQPEFLEAAWAEYLN
gi|89069526|ref  DPDDPADPILTHLSERLLMVWIRGSEAHTDA-LVRRFDRAPKPMYYAPAFLEACWSAYLA
gi|159044141|re  DAEDPEDPLLRDLSAVTLLVWIKGDAAHTEE-LIRRFDRDPKPMAYQPEFLDRVWAEYLA
gi|126731321|re  DPQDAEDPILTALSSVALMVWVRGSEAHTED-LIRRFDRAPKPMAYRPDFLSSAWTRYLD
gi|163731600|re  DPEDDADPLMTELANHCLMIWIKGDEAHTQE-LIRRFDRAPKPMAYQPAFLTRVWREYLS
gi|83942343|ref  DPDDDNDPLMQTLSAECLPVWIKGDHAHTQE-LIRRFDKAPKPMAYQPEFLLRVWQEYLR
gi|83953562|ref  DPDDDNDPLMQTLSAECLPVWIKGDDAHTQE-LIRRFDKAPKPMAYQPEFLLRVWQEYLS
gi|149915282|re  DGDDPADPLLTELSRHCLLIYIEGSEAHTQS-LIRRFDRAPKPMAYQPEFLDQAWVQYLN
gi|149203858|re  DGDDPEDTILKELSAHCLLIYIEGSEAHTAE-LIRRFDRAPKPMAYQPAFLDAAWAEYLK
gi|85702786|ref  DGNNHDDPILTELSAQCLLIYIEGSEAHTAE-LIRRFDRAPKPMAYQPAFLDAAWAEYLR
gi|254512450|re  EPEDPNDQILTELSSQTLMIWIKGDDAHTEE-LIRRFDKAPKPMSYQAAFLQRVWSEYLA
gi|259416668|re  DADDPNDKLLTELSAHTLLVWIKGTEDHTQE-LIRRFDRAPKPMAYQPAFLTRVWEEYLS
gi|99081219|ref  DANDPNDPLLSELSAHTLLVWIKGTEDHTQE-LIRRFDRAPKPMAYQPAFLTRVWEEYLN
gi|254474535|re  EADDPSDPLLTQLSQQTLMVWIKGDEAHTQE-LIRRFDRAPKPMAYQPAFLARVWQEYLK
gi|163736332|re  DANDPGDPLLTELAKNTLMVWIKGDEAHTQE-LIRRFDRAPKPMAYEPAFLTRVWQEYLK
gi|163741132|re  DANDPNDPLLTELSQHTLMVWIKGDEAHTQE-LIRRFDRAPKPMAYEPAFLARVWQEYLK
gi|126737663|re  DANDPNDPVLSELSKHCLMVWIKGDDAHTEE-LVRRFDRAPKPMSYQPEFLGRVWEEYLK
gi|254466131|re  DVDDPKDQIMSELSKHTLMVWIKGDEAHTEE-LVRRFDRAPKPMSYQPEFLDRVWQEYLK
gi|56696981|ref  DPEDPRDQVLSELSKHTLMIWIEGSEAHTAE-LIRRFDKAPKPMSYQPEFLSRVWEAYLA
gi|260433938|re  DPDDPADRVLSELSRHTLMIWIKGDDDHTAE-LIRRFDKAPKPMSYQAEFLTRVWQDYLD
gi|118602078|re  ----DNEQVYQTLANNTVILYIRASKTNETA-LVECAQTHPKPLYYQANFLKQQLSIYLQ
gi|148244187|re  ----DNEQVYQTLSDNTVILYIRASKTNEIA-LIKRAQTHPKPLYYQANFLKHELDIYLQ
gi|74316263|ref  EPG-----VIELLAEHTLILYIQTTSREEEETLIRRAQSDPKPLYFRPAFLEENLPRYLE
gi|241777342|re  EPR-----VIDMLARHSLILYIRVPEADEIK-LIERAQADPKPLYYRPEFLQTAVKDYLE
gi|77166160|ref  DPE-----VLQTLDQHTLILYIKATQRDERK-LIQRAEEEPKPLYYRESFLDEQLAIYRE
gi|254435004|re  DPE-----VLQTLDQHTLILYIKATQRDERK-LIQRAEEEPKPLYYRESFLDEQLAIYRE
gi|220933681|re  DPE-----TLELLAEHTLLIYIQATRQDEEE-LIRRAESAPKPLYYREAFLDEQLREYKQ
gi|224816189|re  SPG-----VLETLAEHTLILYIKATDEDERQ-LIERAEHDPKPLYYREAFLDQQLAEYMR
                             ################### ###################         


                        250       260       270       280       290       300
                 =========+=========+=========+=========+=========+=========+
gi|258545142|re  ------------------------------------------------------------
GOS_1149010_Tra  INKCS-VENVDPDDFVRWTYREAMAHRDPIYKSMAS-WGITVQADLISQVKTPSEFNSLI
gi|126726873|re  LNGVA-EDKVDPDDFVRWTYRQAMKHREPIYAAMAEKWGVTVMADDVANVSNATDFDRLI
gi|146278201|re  EKGLT-EEQVDPDDFIRWTYARALAHRQPRYAAMAR-WGITVTAEEVAEAGNAPAFVDLI
gi|126462407|re  EKGLT-EEQVDPDDFIRWTYARALAHRQPRYEAMAR-WGITVTAEEVAEARDAQGFVDLI
gi|221639416|re  EKGLT-EEQVDPDDFIRWTYARALAHRQPRYEAMAR-WGITVTAEEVAEARDAQGFVDLI
gi|77463575|ref  EKGLT-EEQVDPDDFIRWTYARALAHRQPRYEAMAR-WGITVTAEEVAEARDAQGFVDLI
gi|255263460|re  NNKVS-EGDVNPDDFVRWTYSRALAHRAPLYREIAQ-WGVAVQAHDIAQVKSAGAFEALI
gi|89069526|ref  ETGTA-PGAVDPDAFIRWTYARAMAHREPLYAGMAR-WGVTLEASDVAEVRDEADITALV
gi|159044141|re  QNRMA-PEAVDPDAFIRWTYAQALAHRQPRYAAMAQNWGVTVEADDIARMRDAADFEALI
gi|126731321|re  ATGTS-EEGIDPDAFIRWTYAQALAHRQPLYEAMASNWGITVDASDMSGVRTAADFEDII
gi|163731600|re  EKNCT-ETEVDPDDFIRFTYAKALAHRQPRYQAMSR-WGITVTASEVSALKTPDDFYALV
gi|83942343|ref  ENNCQ-EDDVDPDAFIRWTYAQALAHRQPRYEAMSR-WGVTVTADQVAGLKSEDDFNDLI
gi|83953562|ref  ENNCQ-EDDVDPDAFIRWTYAQALAHRQPRYEAMSR-WGVTVTADQVAGLKSEDDFNDLI
gi|149915282|re  ENKCE-ADEVDPGAFIRWTYARALAHRKPRYEAMAK-WGITVTADEVAETRDADAFDALV
gi|149203858|re  ENNCE-EDEVDPDSFIRWTYARALAHRQPRYEAMVK-WGVSVTANEVATVKTAQEFEALI
gi|85702786|ref  ENKCE-EGDVNPDSFIRWTYARALAHRQPRYEAMVK-WGVSVSATEVAGVKTAQDFEALI
gi|254512450|re  LNTCS-GDEVDPDAFIRWTYARALAHRQPRYEAMARNWGVTIAADDVAKVRDSADFNELI
gi|259416668|re  ENNIS-DEDVDPHSFLRWTYAQALAHRQPRYEAMAKNWGVTVTADQVAQVRSTLDFEELI
gi|99081219|ref  ENNIT-DADVDPHAFLRWTYAQALAHRQPRYEAMAAHWGVTVTADQVAHVQNTSDFEELI
gi|254474535|re  ENALS-EEQVDPDTFIRWTYAQALAHRQPRYEAMAKNWGVTVTADQIASVRSEADFTDVI
gi|163736332|re  ENTLS-EADVDPDSFIRWTYAQALAHRQPRYAAMAEKWGVTVTADQISTIRTEADFTDVI
gi|163741132|re  ENTLS-EADVDPDSFIRWTYAQALAHRQPRYAAMAAHWGVTVTADQISGIRTEADFKDVI
gi|126737663|re  EMSIS-EGDADPDAFIRWTYAQALAHRQPRYESMARNWGVTVTADQISAVQSEADFNELI
gi|254466131|re  ENGIA-EGDVDPDAFIRWTYAQALAHRQPRYQAMADNWGITVTADQISTVKDEAGFVDLI
gi|56696981|ref  ENDCT-PDEVDPDAFIRWTYAQALAHRQPRYRAMARNWGITVTAEQIGAIRDAADFDALI
gi|260433938|re  RHGCA-EADVNPDDFIRWTYAQALAHRQPRYRAMAENWGVTVTADQIGQVRDAGDFDELI
gi|118602078|re  KNQLIYVAQIDPNEFVRWVFPRLLEHRKPKYEAIAKKYGYTIDSEGLYRCKNADEVFELI
gi|148244187|re  ENQLIYIAQINPNKFVRWVFPRLLEHRKPKYEAIAKKYGYIIDNEDLYQCRSANEIFELI
gi|74316263|ref  EKGLAYVAQIEPNDFARWIFPRLFHSRIPRYEAIAGPHGYTVTSEEAAQVRDERDFLALL
gi|241777342|re  LRQLDYVAQIEPDDFTRWVFPRLFHSRVPRYEAIAQPHGYTVSSEDVAQVRDEADFLALV
gi|77166160|ref  KHDLPYVALIEPDEFVRWIFPRLFYSRLPRYQAIADEYGYTVSTDQLAQVNDETGFLELL
gi|254435004|re  KHDLPYVALIEPDEFVRWIFPRLFYSRLPRYQAIADEYGYTVSTDQLAQVNDETGFLELL
gi|220933681|re  AHDIEYVAQIDPDDFVRWMFPRLFRSRIPRYEAIARDYGYTITTEELGRVKSEQDFLALI
gi|224816189|re  DEGLDYVAQIDPDAFVRWVFSRLFRARLPRYEAIAEAHGYTISTTELAQVRNAQDFDDLV
                                                                             


                        310
                 =========+==
gi|258545142|re  ------------
GOS_1149010_Tra  GSTISRSI----
gi|126726873|re  ADALGKQAN---
gi|146278201|re  ARTLDRARTEA-
gi|126462407|re  ARTLDRTRTHA-
gi|221639416|re  ARTLDQARTHA-
gi|77463575|ref  ARTLDQARTHA-
gi|255263460|re  ASTLEMRG----
gi|89069526|ref  ADALDRRAS---
gi|159044141|re  ADALGNRSSSA-
gi|126731321|re  AKALPD------
gi|163731600|re  ASYL--------
gi|83942343|ref  AGTLSA------
gi|83953562|ref  AGTLSA------
gi|149915282|re  ATALERST----
gi|149203858|re  ATALERRAATPI
gi|85702786|ref  ATALERRAATPI
gi|254512450|re  ERTLETRPNPT-
gi|259416668|re  ASALEARSEQT-
gi|99081219|ref  ASALETRAEQT-
gi|254474535|re  ANALELRAQAF-
gi|163736332|re  AGALEARG----
gi|163741132|re  AGALEARG----
gi|126737663|re  ARTLEAGG----
gi|254466131|re  ASALEARS----
gi|56696981|ref  ESSLENRPD---
gi|260433938|re  ERTLEARRAAS-
gi|118602078|re  YGAMN-------
gi|148244187|re  YRAMN-------
gi|74316263|ref  EAAIARKGS---
gi|241777342|re  ESAIARSA----
gi|77166160|ref  ETVVAKQP----
gi|254435004|re  ETVVAKQP----
gi|220933681|re  ETALER------
gi|224816189|re  CMALEREARL--
                             





Parameters used
Minimum Number Of Sequences For A Conserved Position: 18
Minimum Number Of Sequences For A Flanking Position: 30
Maximum Number Of Contiguous Nonconserved Positions: 8
Minimum Length Of A Block: 10
Allowed Gap Positions: None
Use Similarity Matrices: Yes

Flank positions of the 3 selected block(s)
Flanks: [164  178]  [193  211]  [213  231]  

New number of positions in input.fasta-gb:  53  (16% of the original 312 positions)

Protein Domains

PROTOCOL

PROSITE/ CDD/ PFAM/ INTERPRO/PRODOM/ SMART :default parameters


RESULTS ANALYSIS

a) No conserved domain s found with PROSITE, PFAM, INTERPRO, CDD the reason is that the may be in molecular evolution there is a rearrangement of the protein folding and that part is evolved out.

b) PRODOM: one domain found with PRODOM

c) PRODOM:the domain found is less signifant and less conserved

d) SMART:Less significant domains are found

e) SMART: domaind found are having less score may be these are regulatory domains or shorter and less well conserved, whereas enzymes are mostly longer and have better amino acid conservation, particularly in active site regions.


For this ORF, ' no hits were reported', but since the homology of this a.a sequence was good, i conducted

domain analysis for this homolog.i.e the ORF obtained is a partial protein, So, its homolog is taken for futher analysis.


RAW RESULTS

a) WITH RPODOM
database: multiple alignments 
Program: ncbi-blastp 
Matrix: BLOSUM62 
Expect: 0.01 
Filter: seg 


Align subsequence with ProDom domains, using Multalin 
Domain ID  BEGIN  END   
PDA1P8F7   2     148    

HSP Results 
Warning: Original output has been filtered to yield non-redundant similarities

blastp 2.2.18 [Mar-02-2008]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
_quot;Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs_quot;,  Nucleic Acids Res. 25:3389-3402.

Query: unkwown
	(152 letters)

Database: ProDom 2006.1 (November 6th, 2008) multiple alignments
	8,791,492 sequences; 675,451,875 total letters



ProDom domains producing High-scoring Segment Pairs:

  Position  ProDom domain                                      Score E value

     2-148  #PDA1P8F7                                            484 5e-48


>PDA1P8F7 (Closest domain: Q5LRL2_SILPO 24-306)
Number of domains in family: 7
Commentary (automatic):
 
Length = 283
Score = 484  (191.0 bits), Expect = 5e-48
Identities = 82/148 (55%), Positives = 114/148 (77%), Gaps = 1/148 (0%)

 Query:      2 FINKSNRIYGYPNFVCDTGGSICEVVNPDDPNDPVLKTLAENTLMVWIEGSSHHTDELIK 61
               FI+++ R+YGYP+F+CDTGGSICE V+P+DP D VL  L+++TLM+WIEGS  HT ELI+
 Sbjct:    155 FIDRAQRLYGYPHFICDTGGSICEWVDPEDPRDQVLSELSKHTLMIWIEGSEAHTAELIR 214

 Query:     62 RFDENPKPMCYAPDFLDLKWREYLKINKCSVENVDPDDFVRWTYREAMAHRDPIYKSMA- 120
               RFD+ PKPM Y P+FL   W  YL  N C+ + VDPD F+RWTY +A+AHR P Y++MA 
 Sbjct:    215 RFDKAPKPMSYQPEFLSRVWEAYLAENDCTPDEVDPDAFIRWTYAQALAHRQPRYRAMAR 274

 Query:    121 SWGITVQADLISQVKTPSEFNSLIGSTI 148
               +WGITV A+ I  ++  ++F++LI S++
 Sbjct:    275 NWGITVTAEQIGAIRDAADFDALIESSL 302
======================================================================================
--------------------------------------------------------------------------------------
b) WITH SMART
No domains, repeats, motifs or features could be predicted with confidence. Domains with scores less significant than established cutoffs. Features are also not shown when two or more occupy the same piece of sequence;

Name    Begin  End E-value  Reason 
DysFC   150     176 3.42e+03 threshold

Phylogeny

PROTOCOL

a) Phylogeny.fr / BioNJ method /

b) Phylogeny.fr / PhyML method /


1. BioNJ and phyML both methods have shown that the queried sequence has a close distance with the a-proteobacteria species.


2. These species comes under the classification of alpha proteobacteria, therefore this sequence can be categorized under the very same class.


RESULTS ANALYSIS

According to blast results, i made in groups and out groups which shows that:

InGroup: Aplha-Proteobacteria

> ref|ZP_00954804.1| [Sulfitobacter sp. EE-36] [a-proteobacteria] 191 2e-47,

> ref|ZP_05087921.1| [Ruegeria sp. R11] [a-proteobacteria] 191 2e-47

> ref|YP_167343.1| [Ruegeria pomeroyi DSS-3] [a-proteobacteria] 191 2e-47,

> ref|ZP_02143751.1| [Phaeobacter gallaeciensis BS107] [a-proteobacteria] 191 3e-47,

> ref|ZP_00962283.1| [Sulfitobacter sp. NAS-14.1] [a-proteobacteria] 190 3e-47

> ref|ZP_02148524.1| [Phaeobacter gallaeciensis 2.10] [a-proteobacteria] 190 4e-47,

> ref|ZP_01156870.1| [Oceanicola granulosus HTCC2516] [a-proteobacteria] 190 4e-47,

> ref|ZP_05342802.1| [Thalassiobium sp. R2A62] [a-proteobacteria] 190 4e-47,

> ref|ZP_01742712.1| [Rhodobacterales bacterium HTCC2150] [a-proteobacteria] 190 4e-47,

> ref|ZP_01753393.1| [Roseobacter sp. SK209-2-6] [a-proteobacteria] 189 e-47

> ref|YP_001043521.1| [Rhodobacter sphaeroides ATCC 17029] [a-proteobacteria] 189 6e-47,

> ref|ZP_05740588.1| [Silicibacter sp. TrichCH4B] [a-proteobacteria] 188 1e-46,

> ref|ZP_05124517.1| [Rhodobacteraceae bacterium KLH11] [a-proteobacteria] 188 1e-46,

> ref|YP_002525678.1| [Rhodobacter sphaeroides KD131] [a-proteobacteria] 188 2e-46,

> ref|YP_001168360.1| [Rhodobacter sphaeroides ATCC 17025] [a-proteobacteria] 188 2e-46

> ref|YP_353079.1| [Rhodobacter sphaeroides 2.4.1] [a-proteobacteria] 188 2e-46

> ref|ZP_05787909.1| [Silicibacter lacuscaerulensis ITI-1157] [a-proteobacteria] 187 2e-46

> ref|ZP_05079542.1| [Rhodobacterales bacterium Y4I] [a-proteobacteria] 187 3e-46

> ref|YP_001532935.1| [Dinoroseobacter shibae DFL 12] [a-proteobacteria] 187 3e-46,

> ref|YP_613373.1| [Ruegeria sp. TM1040] [a-proteobacteria] 186 6e-46,

> ref|ZP_02139047.1| [Roseobacter litoralis Och 149] [a-proteobacteria] 185 1e-45,

> ref|ZP_01033890.1| [Roseovarius sp. 217] [a-proteobacteria] 183 6e-45,

> ref|ZP_01747128.1| [Sagittula stellata E-37] [a-proteobacteria] 182 1e-44,


_

OutGroup: All other Proteobacteria: beta, delta, epsilon, & gamma

_

> ref|YP_002512580.1| [Thioalkalivibrio sp. HL-EbGR7] [g-proteobacteria] 119 1e-25,

> ref|ZP_03689324.1| [Thioalkalivibrio sp. K90mix] [g-proteobacteria] 113 5e-24,

> ref|YP_314003.1| [Thiobacillus denitrificans ATCC 25259] [b-proteobacteria] 105 1e-21,

> ref|ZP_04774611.1| [Allochromatium vinosum DSM 180] [g-proteobacteria] 104 4e-21,

> ref|YP_344685.1| [Nitrosococcus oceani ATCC 19707] [g-proteobacteria] 102 1e-20,

> ref|ZP_05048511.1| [Nitrosococcus oceani AFC27] [g-proteobacteria] 102 1e-20

> ref|YP_903293.1| [Candidatus Ruthia magnifica str. Cm (Calyptogena magnifica)] [g-proteobacteria] 90.1 7e-17,

> ref|YP_001218881.1| [Candidatus Vesicomyosocius okutanii HA] [g-proteobacteria] 88.6 2e-16


The tree obtained showed an anomalous classification of in-groups and out-groups.


1) The metagenomic sequence is might also be the alpha protobacterium as in the tree it is closely related to alpha protobacterium species.

2) Both trees constructed by PhyML and BIONJ indicated the realtionship of the ORF to alpha protobacterium.

RAW RESULTS
a) phyML METHOD:
************************************************************************************
result of TreeDyn in text format:
************************************************************************************

                                                                                                                                                                                ----0.2---
 
                                  +----Oceanicola_granulosus_HTCC2516_gi_89069526
                                  |
                                  |      +-Rhodobacter_sphaeroides_ATCC_17025_gi_146278201
                                  |      |
                                +-+ +----++Rhodobacter_sphaeroides_KD131_gi_221639416
                                | | |    ||
                                | | |    +++Rhodobacter_sphaeroides_ATCC_17029_gi_126462407
                                | +-+     ++
                                |   |      +Rhodobacter_sphaeroides_2.4.1_gi_77463575
                                |   |
                                |   +-----------Thalassiobium_sp._R2A62_gi_255263460
                                |
                                |
                                |+---------Sagittula_stellata_E-37_gi_126731321
                                |+
                                |+--------------GOS_1149010_Translation_309-764_indirect_strand
           +--------------------+
           |                    | +-------------Rhodobacterales_bacterium_HTCC2150_gi_126726873
           |                    | |
           |                    | |    +---Roseobacter_sp._SK209-2-6_gi_126737663
           |                    | |   ++
           |                    | |   |+Rhodobacterales_bacterium_Y4I_gi_254466131
           |                    | | +-+
           |                    +-+ | | +--Silicibacter_pomeroyi_DSS-3_gi_56696981
           |                      | | | |
           |                      | | +-+ +------Rhodobacteraceae_bacterium_KLH11_gi_254512450
           |                      | |   +-+
           |                      | |     +--Silicibacter_lacuscaerulensis_ITI-1157_gi_260433938
           |                      | |
           |                      | | +----Roseobacter_litoralis_Och_149_gi_163731600
           |                      +-+ |
           |                        |+++-Ruegeria_sp._R11_gi_254474535
           |                        ||||
           |                        ||+++---Phaeobacter_gallaeciensis_BS107_gi_163736332
           |                        || ++
           |                        ||  +Phaeobacter_gallaeciensis_2.10_gi_163741132
           |                        ||
           |                        ++
           |                         |   +------Dinoroseobacter_shibae_DFL_12_gi_159044141
 +---------+                         | +-+
 |         |                         | | |   +-Sulfitobacter_sp._EE-36_gi_83942343
 |         |                         | | +---+
 |         |                         | |     +Sulfitobacter_sp._NAS-14.1_gi_83953562
 |         |                         | |
 |         |                         +-+    +-----Roseobacter_sp._AzwK-3b_gi_149915282
 |         |                           |+---+
 |         |                           ||   |+Roseovarius_sp._TM1035_gi_149203858
 |         |                           ||   ++
 |         |                           ++    +-Roseovarius_sp._217_gi_85702786
 |         |                            |
 |         |                            | +Silicibacter_sp._TrichCH4B_gi_259416668
 |         |                            +-+
 |         |                              +Silicibacter_sp._TM1040_gi_99081219
 |         |
 |         |                               +----Candidatus_Ruthia_magnifica_str._Cm_{Calyptogena_magnifica}_gi_118602078
 |         |           +-------------------+
 |         |           |                   +-Candidatus_Vesicomyosocius_okutanii_HA_gi_148244187
 |         |    +------+
 |         |    |      +------------Allochromatium_vinosum_DSM_180_gi_241777342
 |         |  +-+
 |         |  | |
 |         |  | +-----Thiobacillus_denitrificans_ATCC_25259_gi_74316263
 |         |  |
 |         +--+                +Nitrosococcus_oceani_ATCC_19707_gi_77166160
 |            |     +----------+
 |            | +---+          +Nitrosococcus_oceani_AFC27_gi_254435004
 |            | |   |
 |            +-+   +----Thioalkalivibrio_sp._K90mix_gi_224816189
 |              |
 |              +-----Thioalkalivibrio_sp._HL-EbGR7_gi_220933681
 |
 +------------------------------------------------Cardiobacterium_hominis_ATCC_15826_gi_258545142

-----------------------------------------------------------------------------------------------------------------
b) BioNJ METHOD:
************************************************************************************
result of TreeDyn in text format:
************************************************************************************
                                                                                                                                                                                        ------0.2-----
 
                          +----Roseobacter_litoralis_Och_149_gi_163731600
                          |
                       +--+ +-Ruegeria_sp._R11_gi_254474535
                       |  | |
                       |  +-+---Phaeobacter_gallaeciensis_BS107_gi_163736332
                       |    |
                       |    +--Phaeobacter_gallaeciensis_2.10_gi_163741132
                       |
                       |     +---Roseobacter_sp._SK209-2-6_gi_126737663
                       | +---+
                       | |   +Rhodobacterales_bacterium_Y4I_gi_254466131
                       | |
                       |-+
                       | |+----Silicibacter_pomeroyi_DSS-3_gi_56696981
                       | ++
                       |  | +-----Rhodobacteraceae_bacterium_KLH11_gi_254512450
                    +--+  +-+
                    |  |    +-----Silicibacter_lacuscaerulensis_ITI-1157_gi_260433938
                    |  |
                    |  |    +------Roseobacter_sp._AzwK-3b_gi_149915282
                    |  |----+
                    |  |    | +-Roseovarius_sp._TM1035_gi_149203858
                    |  |    +-+
                    |  |      +-Roseovarius_sp._217_gi_85702786
                    |  |
                    |  |        +-Sulfitobacter_sp._EE-36_gi_83942343
                   ++  | +------+
                   ||  | |      +Sulfitobacter_sp._NAS-14.1_gi_83953562
                   ||  +-+
                   ||    |+-------Dinoroseobacter_shibae_DFL_12_gi_159044141
                   ||    ||
                   ||    ++    +Silicibacter_sp._TrichCH4B_gi_259416668
                   ||     +----+
                   ||          +Silicibacter_sp._TM1040_gi_99081219
                   ||
                   |+------------------Rhodobacterales_bacterium_HTCC2150_gi_126726873
                   |
                   |
                   |  +--------Oceanicola_granulosus_HTCC2516_gi_89069526
                 +-+--+
                 | |  +--------Sagittula_stellata_E-37_gi_126731321
                 | |
                 | |        +-Rhodobacter_sphaeroides_ATCC_17025_gi_146278201
                 | |        |
                 | |  +-----++Rhodobacter_sphaeroides_KD131_gi_221639416
                 | |  |     ++
 +---------------+ |  |      |+Rhodobacter_sphaeroides_ATCC_17029_gi_126462407
 |               | +--+      ++
 |               |    |       +Rhodobacter_sphaeroides_2.4.1_gi_77463575
 |               |    |
 |               |    +------------Thalassiobium_sp._R2A62_gi_255263460
 |               |
 |               +-----------------GOS_1149010_Translation_309-764_indirect_strand
 |
 |----------------------------------------------Cardiobacterium_hominis_ATCC_15826_gi_258545142
 |
 |                                      +-------Candidatus_Ruthia_magnifica_str._Cm_{Calyptogena_magnifica}_gi_118602078
 |                   +------------------+
 |                   |                  +-Candidatus_Vesicomyosocius_okutanii_HA_gi_148244187
 |             +-----+
 |             |     |
 |             |     +------------------Allochromatium_vinosum_DSM_180_gi_241777342
 +-------------+
               |   +-------Thioalkalivibrio_sp._HL-EbGR7_gi_220933681
               |   |
               +---+-------------Thiobacillus_denitrificans_ATCC_25259_gi_74316263
                   |
                   |                   +Nitrosococcus_oceani_ATCC_19707_gi_77166160
                   |     +-------------+
                   +-----+             +Nitrosococcus_oceani_AFC27_gi_254435004
                         |
                         +-----Thioalkalivibrio_sp._K90mix_gi_224816189

Taxonomy report

PROTOCOL

BLASTp versus NR, NCBI default parameters apart from "Number of descriptions_1000"


RESULTS ANALYSIS


According to blast results, i made in groups and out groups which shows that:

InGroup: Aplha-Proteobacteria

> ref|ZP_00954804.1| [Sulfitobacter sp. EE-36] [a-proteobacteria] 191 2e-47,

> ref|ZP_05087921.1| [Ruegeria sp. R11] [a-proteobacteria] 191 2e-47

> ref|YP_167343.1| [Ruegeria pomeroyi DSS-3] [a-proteobacteria] 191 2e-47,

> ref|ZP_02143751.1| [Phaeobacter gallaeciensis BS107] [a-proteobacteria] 191 3e-47,

> ref|ZP_00962283.1| [Sulfitobacter sp. NAS-14.1] [a-proteobacteria] 190 3e-47

> ref|ZP_02148524.1| [Phaeobacter gallaeciensis 2.10] [a-proteobacteria] 190 4e-47,

> ref|ZP_01156870.1| [Oceanicola granulosus HTCC2516] [a-proteobacteria] 190 4e-47,

> ref|ZP_05342802.1| [Thalassiobium sp. R2A62] [a-proteobacteria] 190 4e-47,

> ref|ZP_01742712.1| [Rhodobacterales bacterium HTCC2150] [a-proteobacteria] 190 4e-47,

> ref|ZP_01753393.1| [Roseobacter sp. SK209-2-6] [a-proteobacteria] 189 e-47

> ref|YP_001043521.1| [Rhodobacter sphaeroides ATCC 17029] [a-proteobacteria] 189 6e-47,

> ref|ZP_05740588.1| [Silicibacter sp. TrichCH4B] [a-proteobacteria] 188 1e-46,

> ref|ZP_05124517.1| [Rhodobacteraceae bacterium KLH11] [a-proteobacteria] 188 1e-46,

> ref|YP_002525678.1| [Rhodobacter sphaeroides KD131] [a-proteobacteria] 188 2e-46,

> ref|YP_001168360.1| [Rhodobacter sphaeroides ATCC 17025] [a-proteobacteria] 188 2e-46

> ref|YP_353079.1| [Rhodobacter sphaeroides 2.4.1] [a-proteobacteria] 188 2e-46

> ref|ZP_05787909.1| [Silicibacter lacuscaerulensis ITI-1157] [a-proteobacteria] 187 2e-46

> ref|ZP_05079542.1| [Rhodobacterales bacterium Y4I] [a-proteobacteria] 187 3e-46

> ref|YP_001532935.1| [Dinoroseobacter shibae DFL 12] [a-proteobacteria] 187 3e-46,

> ref|YP_613373.1| [Ruegeria sp. TM1040] [a-proteobacteria] 186 6e-46,

> ref|ZP_02139047.1| [Roseobacter litoralis Och 149] [a-proteobacteria] 185 1e-45,

> ref|ZP_01033890.1| [Roseovarius sp. 217] [a-proteobacteria] 183 6e-45,

> ref|ZP_01747128.1| [Sagittula stellata E-37] [a-proteobacteria] 182 1e-44,


_

OutGroup: All other Proteobacteria: beta, delta, epsilon, & gamma

_

> ref|YP_002512580.1| [Thioalkalivibrio sp. HL-EbGR7] [g-proteobacteria] 119 1e-25,

> ref|ZP_03689324.1| [Thioalkalivibrio sp. K90mix] [g-proteobacteria] 113 5e-24,

> ref|YP_314003.1| [Thiobacillus denitrificans ATCC 25259] [b-proteobacteria] 105 1e-21,

> ref|ZP_04774611.1| [Allochromatium vinosum DSM 180] [g-proteobacteria] 104 4e-21,

> ref|YP_344685.1| [Nitrosococcus oceani ATCC 19707] [g-proteobacteria] 102 1e-20,

> ref|ZP_05048511.1| [Nitrosococcus oceani AFC27] [g-proteobacteria] 102 1e-20

> ref|YP_903293.1| [Candidatus Ruthia magnifica str. Cm (Calyptogena magnifica)] [g-proteobacteria] 90.1 7e-17,

> ref|YP_001218881.1| [Candidatus Vesicomyosocius okutanii HA] [g-proteobacteria] 88.6 2e-16


a) ingroup: a proteobacteria is selected as ingroup because the translated sequence of orf has the closest homolog with a-proteobacteria


b)outgroup: b proteobacteria and g proteobacteria are selected as outgroup because the translated sequence of orf has not the closest homolog with b and g proteobacteria



RAW RESULTS
Lineage Report

Bacteria            [bacteria]
. Proteobacteria      [proteobacteria]
. . Alphaproteobacteria [a-proteobacteria]
. . . Rhodobacterales     [a-proteobacteria]
. . . . Rhodobacteraceae    [a-proteobacteria]
. . . . . Roseobacter         [a-proteobacteria]
. . . . . . Roseobacter sp. AzwK-3b -----------------------------  194 2 hits [a-proteobacteria]    hypothetical protein RAZWK3B_15633 [Roseobacter sp. AzwK-3b
. . . . . . Roseobacter sp. SK209-2-6 ...........................  189 2 hits [a-proteobacteria]    hypothetical protein RSK20926_18517 [Roseobacter sp. SK209-
. . . . . . Roseobacter litoralis Och 149 .......................  185 2 hits [a-proteobacteria]    hypothetical protein RLO149_19889 [Roseobacter litoralis Oc
. . . . . . Roseobacter denitrificans OCh 114 ...................  180 2 hits [a-proteobacteria]    hypothetical protein RD1_2783 [Roseobacter denitrificans OC
. . . . . . Roseobacter sp. GAI101 ..............................  179 2 hits [a-proteobacteria]    conserved hypothetical protein [Roseobacter sp. GAI101] >gi
. . . . . . Roseobacter sp. MED193 ..............................  178 2 hits [a-proteobacteria]    hypothetical protein MED193_04716 [Roseobacter sp. MED193] 
. . . . . . Roseobacter sp. CCS2 ................................  174 2 hits [a-proteobacteria]    hypothetical protein RCCS2_00167 [Roseobacter sp. CCS2] >gi
. . . . . Roseovarius sp. TM1035 --------------------------------  192 2 hits [a-proteobacteria]    hypothetical protein RTM1035_07318 [Roseovarius sp. TM1035]
. . . . . Sulfitobacter sp. EE-36 ...............................  191 2 hits [a-proteobacteria]    hypothetical protein EE36_14922 [Sulfitobacter sp. EE-36] >
. . . . . Ruegeria sp. R11 ......................................  191 2 hits [a-proteobacteria]    conserved hypothetical protein [Ruegeria sp. R11] >gi|21402
. . . . . Ruegeria pomeroyi DSS-3 ...............................  191 2 hits [a-proteobacteria]    hypothetical protein SPO2113 [Ruegeria pomeroyi DSS-3] >gi|
. . . . . Phaeobacter gallaeciensis BS107 .......................  191 2 hits [a-proteobacteria]    hypothetical protein RGBS107_14411 [Phaeobacter gallaeciens
. . . . . Sulfitobacter sp. NAS-14.1 ............................  190 2 hits [a-proteobacteria]    hypothetical protein NAS141_05043 [Sulfitobacter sp. NAS-14
. . . . . Phaeobacter gallaeciensis 2.10 ........................  190 2 hits [a-proteobacteria]    hypothetical protein RG210_16770 [Phaeobacter gallaeciensis
. . . . . Oceanicola granulosus HTCC2516 ........................  190 2 hits [a-proteobacteria]    hypothetical protein OG2516_03083 [Oceanicola granulosus HT
. . . . . Thalassiobium sp. R2A62 ...............................  190 2 hits [a-proteobacteria]    conserved hypothetical protein [Thalassiobium sp. R2A62] >g
. . . . . Rhodobacter sphaeroides ATCC 17029 ....................  189 2 hits [a-proteobacteria]    hypothetical protein Rsph17029_1639 [Rhodobacter sphaeroide
. . . . . Silicibacter sp. TrichCH4B ............................  188 2 hits [a-proteobacteria]    conserved hypothetical protein [Silicibacter sp. TrichCH4B]
. . . . . Rhodobacteraceae bacterium KLH11 ......................  188 2 hits [a-proteobacteria]    conserved hypothetical protein [Rhodobacteraceae bacterium 
. . . . . Rhodobacter sphaeroides KD131 .........................  188 2 hits [a-proteobacteria]    hypothetical protein RSKD131_1317 [Rhodobacter sphaeroides 
. . . . . Rhodobacter sphaeroides ATCC 17025 ....................  188 2 hits [a-proteobacteria]    hypothetical protein Rsph17025_2165 [Rhodobacter sphaeroide
. . . . . Rhodobacter sphaeroides 2.4.1 .........................  188 2 hits [a-proteobacteria]    hypothetical protein RSP_0009 [Rhodobacter sphaeroides 2.4.
. . . . . Silicibacter lacuscaerulensis ITI-1157 ................  187 2 hits [a-proteobacteria]    conserved hypothetical protein [Silicibacter lacuscaerulens
. . . . . Dinoroseobacter shibae DFL 12 .........................  187 2 hits [a-proteobacteria]    hypothetical protein Dshi_1592 [Dinoroseobacter shibae DFL 
. . . . . Ruegeria sp. TM1040 ...................................  186 2 hits [a-proteobacteria]    hypothetical protein TM1040_1378 [Ruegeria sp. TM1040] >gi|
. . . . . Roseovarius sp. 217 ...................................  183 2 hits [a-proteobacteria]    hypothetical protein ROS217_18632 [Roseovarius sp. 217] >gi
. . . . . Sagittula stellata E-37 ...............................  182 2 hits [a-proteobacteria]    hypothetical protein SSE37_06629 [Sagittula stellata E-37] 
. . . . . Oceanibulbus indolifex HEL-45 .........................  179 2 hits [a-proteobacteria]    hypothetical protein OIHEL45_12425 [Oceanibulbus indolifex 
. . . . . Rhodobacter sp. SW2 ...................................  178 2 hits [a-proteobacteria]    conserved hypothetical protein [Rhodobacter sp. SW2] >gi|25
. . . . . Roseovarius nubinhibens ISM ...........................  178 2 hits [a-proteobacteria]    hypothetical protein ISM_16270 [Roseovarius nubinhibens ISM
. . . . . Roseovarius sp. HTCC2601 ..............................  175 2 hits [a-proteobacteria]    hypothetical protein R2601_25616 [Roseovarius sp. HTCC2601]
. . . . . Octadecabacter antarcticus 238 ........................  174 2 hits [a-proteobacteria]    conserved hypothetical protein [Octadecabacter antarcticus 
. . . . . Citreicella sp. SE45 ..................................  173 2 hits [a-proteobacteria]    conserved hypothetical protein [Citreicella sp. SE45] >gi|2
. . . . . Octadecabacter antarcticus 307 ........................  172 2 hits [a-proteobacteria]    hypothetical protein OA307_3981 [Octadecabacter antarcticus
. . . . . Paracoccus denitrificans PD1222 .......................  169 2 hits [a-proteobacteria]    hypothetical protein Pden_4498 [Paracoccus denitrificans PD
. . . . . Oceanicola batsensis HTCC2597 .........................  168 2 hits [a-proteobacteria]    hypothetical protein OB2597_01917 [Oceanicola batsensis HTC
. . . . . Loktanella vestfoldensis SKA53 ........................  159 2 hits [a-proteobacteria]    hypothetical protein SKA53_04523 [Loktanella vestfoldensis 
. . . . . Jannaschia sp. CCS1 ...................................  150 2 hits [a-proteobacteria]    hypothetical protein Jann_2349 [Jannaschia sp. CCS1] >gi|88
. . . . Rhodobacterales bacterium HTCC2150 ----------------------  190 2 hits [a-proteobacteria]    hypothetical protein RB2150_03134 [Rhodobacterales bacteriu
. . . . Rhodobacterales bacterium Y4I ...........................  187 2 hits [a-proteobacteria]    conserved hypothetical protein [Rhodobacterales bacterium Y
. . . . Rhodobacterales bacterium HTCC2654 ......................  178 2 hits [a-proteobacteria]    hypothetical protein RB2654_03529 [Rhodobacterales bacteriu
. . . . Rhodobacterales bacterium HTCC2083 ......................  170 2 hits [a-proteobacteria]    conserved hypothetical protein [Rhodobacterales bacterium H
. . . . Rhodobacterales bacterium HTCC2255 ......................  142 2 hits [a-proteobacteria]    hypothetical protein OM2255_02942 [alpha proteobacterium HT
. . . Gluconobacter oxydans 621H --------------------------------   35 2 hits [a-proteobacteria]    hypothetical protein GOX0966 [Gluconobacter oxydans 621H] >
. . . Rhizobium sp. NGR234 ......................................   34 2 hits [a-proteobacteria]    putative transcriptional regulator, LysR family [Rhizobium 
. . Thioalkalivibrio sp. HL-EbGR7 -------------------------------  119 2 hits [g-proteobacteria]    hypothetical protein Tgr7_0496 [Thioalkalivibrio sp. HL-EbG
. . Thioalkalivibrio sp. K90mix .................................  113 2 hits [g-proteobacteria]    conserved hypothetical protein [Thioalkalivibrio sp. K90mix
. . Thiobacillus denitrificans ATCC 25259 .......................  105 2 hits [b-proteobacteria]    hypothetical protein Tbd_0245 [Thiobacillus denitrificans A
. . Allochromatium vinosum DSM 180 ..............................  104 2 hits [g-proteobacteria]    conserved hypothetical protein [Allochromatium vinosum DSM 
. . Nitrosococcus oceani ATCC 19707 .............................  102 2 hits [g-proteobacteria]    hypothetical protein Noc_2702 [Nitrosococcus oceani ATCC 19
. . Nitrosococcus oceani AFC27 ..................................  102 2 hits [g-proteobacteria]    hypothetical protein NOC27_2067 [Nitrosococcus oceani AFC27
. . Candidatus Ruthia magnifica str. Cm (Calyptogena magnifica) .   90 2 hits [g-proteobacteria]    hypothetical protein Rmag_0019 [Candidatus Ruthia magnifica
. . Candidatus Vesicomyosocius okutanii HA ......................   88 2 hits [g-proteobacteria]    hypothetical protein COSY_0018 [Candidatus Vesicomyosocius 
. . Cardiobacterium hominis ATCC 15826 ..........................   56 4 hits [g-proteobacteria]    conserved hypothetical protein [Cardiobacterium hominis ATC
. Gemmatimonas aurantiaca T-27 ----------------------------------  135 2 hits [bacteria]            hypothetical protein GAU_1223 [Gemmatimonas aurantiaca T-27
. Lentisphaera araneosa HTCC2155 ................................   36 2 hits [bacteria]            hypothetical protein LNTAR_10216 [Lentisphaera araneosa HTC
. Streptomyces griseoflavus Tu4000 ..............................   35 1 hit  [high GC Gram+]       solute-binding protein (secreted protein) [Streptomyces gri
. Acaryochloris marina MBIC11017 ................................   34 3 hits [cyanobacteria]       hydroxyacylglutathione hydrolase, putative [Acaryochloris m
. Frankia sp. EuI1c .............................................   33 2 hits [high GC Gram+]       von Willebrand factor type A [Frankia sp. EuI1c] >gi|270731
. Bacteroides sp. D1 ............................................   33 2 hits [CFB group bacteria]  conserved hypothetical protein [Bacteroides sp. D1] >gi|262
. Bacteroides sp. 2_1_22 ........................................   33 2 hits [CFB group bacteria]  conserved hypothetical protein [Bacteroides sp. D1] >gi|262
. Bacteroides ovatus ATCC 8483 ..................................   33 2 hits [CFB group bacteria]  hypothetical protein BACOVA_05282 [Bacteroides ovatus ATCC 
. Bacteroides caccae ATCC 43185 .................................   33 2 hits [CFB group bacteria]  hypothetical protein BACCAC_03654 [Bacteroides caccae ATCC 

--------------------------------------------------------------------------------
Organism Report 
  Roseobacter sp. AzwK-3b [a-proteobacteria] taxid 351016
 ref|ZP_01903810.1| hypothetical protein RAZWK3B_15633 [Ros...     194  3e-48
 gb|EDM70842.1| hypothetical protein RAZWK3B_15633 [Roseoba...     194  3e-48

  Roseovarius sp. TM1035 [a-proteobacteria] taxid 391613
 ref|ZP_01880827.1| hypothetical protein RTM1035_07318 [Ros...     192  9e-48
 gb|EDM31017.1| hypothetical protein RTM1035_07318 [Roseova...     192  9e-48

  Sulfitobacter sp. EE-36 [a-proteobacteria] taxid 52598
 ref|ZP_00954804.1| hypothetical protein EE36_14922 [Sulfit...     191  2e-47
 gb|EAP84312.1| hypothetical protein EE36_14922 [Sulfitobac...     191  2e-47

  Ruegeria sp. R11 [a-proteobacteria] taxid 439497
 ref|ZP_05087921.1| conserved hypothetical protein [Ruegeri...     191  2e-47
 gb|EEB69613.1| conserved hypothetical protein [Ruegeria sp...     191  2e-47

  Ruegeria pomeroyi DSS-3 [a-proteobacteria] taxid 246200
 ref|YP_167343.1| hypothetical protein SPO2113 [Ruegeria po...     191  2e-47
 gb|AAV95384.1| hypothetical protein SPO2113 [Ruegeria pome...     191  2e-47

  Phaeobacter gallaeciensis BS107 [a-proteobacteria] taxid 391619
 ref|ZP_02143751.1| hypothetical protein RGBS107_14411 [Pha...     191  3e-47
 gb|EDQ14552.1| hypothetical protein RGBS107_14411 [Phaeoba...     191  3e-47

  Sulfitobacter sp. NAS-14.1 [a-proteobacteria] taxid 314267
 ref|ZP_00962283.1| hypothetical protein NAS141_05043 [Sulf...     190  3e-47
 gb|EAP80676.1| hypothetical protein NAS141_05043 [Sulfitob...     190  3e-47

  Phaeobacter gallaeciensis 2.10 [a-proteobacteria] taxid 383629
 ref|ZP_02148524.1| hypothetical protein RG210_16770 [Phaeo...     190  4e-47
 gb|EDQ09862.1| hypothetical protein RG210_16770 [Phaeobact...     190  4e-47

  Oceanicola granulosus HTCC2516 [a-proteobacteria] taxid 314256
 ref|ZP_01156870.1| hypothetical protein OG2516_03083 [Ocea...     190  4e-47
 gb|EAR50951.1| hypothetical protein OG2516_03083 [Oceanico...     190  4e-47

  Thalassiobium sp. R2A62 [a-proteobacteria] taxid 633131
 ref|ZP_05342802.1| conserved hypothetical protein [Thalass...     190  4e-47
 gb|EET48469.1| conserved hypothetical protein [Thalassiobi...     190  4e-47

  Rhodobacterales bacterium HTCC2150 [a-proteobacteria] taxid 388401
 ref|ZP_01742712.1| hypothetical protein RB2150_03134 [Rhod...     190  4e-47
 gb|EBA02925.1| hypothetical protein RB2150_03134 [Rhodobac...     190  4e-47

  Roseobacter sp. SK209-2-6 [a-proteobacteria] taxid 388739
 ref|ZP_01753393.1| hypothetical protein RSK20926_18517 [Ro...     189  6e-47
 gb|EBA17760.1| hypothetical protein RSK20926_18517 [Roseob...     189  6e-47

  Rhodobacter sphaeroides ATCC 17029 [a-proteobacteria] taxid 349101
 ref|YP_001043521.1| hypothetical protein Rsph17029_1639 [R...     189  6e-47
 gb|ABN76749.1| conserved hypothetical protein [Rhodobacter...     189  6e-47

  Silicibacter sp. TrichCH4B [a-proteobacteria] taxid 644076
 ref|ZP_05740588.1| conserved hypothetical protein [Silicib...     188  1e-46
 gb|EEW59884.1| conserved hypothetical protein [Silicibacte...     188  1e-46

  Rhodobacteraceae bacterium KLH11 [a-proteobacteria] taxid 467661
 ref|ZP_05124517.1| conserved hypothetical protein [Rhodoba...     188  1e-46
 gb|EEE39149.1| conserved hypothetical protein [Rhodobacter...     188  1e-46

  Rhodobacter sphaeroides KD131 [a-proteobacteria] taxid 557760
 ref|YP_002525678.1| hypothetical protein RSKD131_1317 [Rho...     188  2e-46
 gb|ACM01177.1| Hypothetical Protein RSKD131_1317 [Rhodobac...     188  2e-46

  Rhodobacter sphaeroides ATCC 17025 [a-proteobacteria] taxid 349102
 ref|YP_001168360.1| hypothetical protein Rsph17025_2165 [R...     188  2e-46
 gb|ABP71055.1| hypothetical protein Rsph17025_2165 [Rhodob...     188  2e-46

  Rhodobacter sphaeroides 2.4.1 [a-proteobacteria] taxid 272943
 ref|YP_353079.1| hypothetical protein RSP_0009 [Rhodobacte...     188  2e-46
 gb|ABA79178.1| conserved hypothetical protein [Rhodobacter...     188  2e-46

  Silicibacter lacuscaerulensis ITI-1157 [a-proteobacteria] taxid 644107
 ref|ZP_05787909.1| conserved hypothetical protein [Silicib...     187  2e-46
 gb|EEX11025.1| conserved hypothetical protein [Silicibacte...     187  2e-46

  Rhodobacterales bacterium Y4I [a-proteobacteria] taxid 439496
 ref|ZP_05079542.1| conserved hypothetical protein [Rhodoba...     187  3e-46
 gb|EDZ47521.1| conserved hypothetical protein [Rhodobacter...     187  3e-46

  Dinoroseobacter shibae DFL 12 [a-proteobacteria] taxid 398580
 ref|YP_001532935.1| hypothetical protein Dshi_1592 [Dinoro...     187  3e-46
 gb|ABV93334.1| conserved hypothetical protein [Dinoroseoba...     187  3e-46

  Ruegeria sp. TM1040 [a-proteobacteria] taxid 292414
 ref|YP_613373.1| hypothetical protein TM1040_1378 [Ruegeri...     186  6e-46
 gb|ABF64111.1| hypothetical protein TM1040_1378 [Ruegeria ...     186  6e-46

  Roseobacter litoralis Och 149 [a-proteobacteria] taxid 391595
 ref|ZP_02139047.1| hypothetical protein RLO149_19889 [Rose...     185  1e-45
 gb|EDQ19376.1| hypothetical protein RLO149_19889 [Roseobac...     185  1e-45

  Roseovarius sp. 217 [a-proteobacteria] taxid 314264
 ref|ZP_01033890.1| hypothetical protein ROS217_18632 [Rose...     183  6e-45
 gb|EAQ26571.1| hypothetical protein ROS217_18632 [Roseovar...     183  6e-45

  Sagittula stellata E-37 [a-proteobacteria] taxid 388399
 ref|ZP_01747128.1| hypothetical protein SSE37_06629 [Sagit...     182  1e-44
 gb|EBA07291.1| hypothetical protein SSE37_06629 [Sagittula...     182  1e-44

  Roseobacter denitrificans OCh 114 [a-proteobacteria] taxid 375451
 ref|YP_683000.1| hypothetical protein RD1_2783 [Roseobacte...     180  4e-44
 gb|ABG32314.1| hypothetical protein RD1_2783 [Roseobacter ...     180  4e-44

  Roseobacter sp. GAI101 [a-proteobacteria] taxid 391589
 ref|ZP_05101760.1| conserved hypothetical protein [Roseoba...     179  6e-44
 gb|EEB86062.1| conserved hypothetical protein [Roseobacter...     179  6e-44

  Oceanibulbus indolifex HEL-45 [a-proteobacteria] taxid 391624
 ref|ZP_02154523.1| hypothetical protein OIHEL45_12425 [Oce...     179  1e-43
 gb|EDQ04141.1| hypothetical protein OIHEL45_12425 [Oceanib...     179  1e-43

  Rhodobacter sp. SW2 [a-proteobacteria] taxid 371731
 ref|ZP_05843039.1| conserved hypothetical protein [Rhodoba...     178  2e-43
 gb|EEW25955.1| conserved hypothetical protein [Rhodobacter...     178  2e-43

  Roseobacter sp. MED193 [a-proteobacteria] taxid 314262
 ref|ZP_01056951.1| hypothetical protein MED193_04716 [Rose...     178  2e-43
 gb|EAQ45103.1| hypothetical protein MED193_04716 [Roseobac...     178  2e-43

  Rhodobacterales bacterium HTCC2654 [a-proteobacteria] taxid 314271
 ref|ZP_01012848.1| hypothetical protein RB2654_03529 [Rhod...     178  2e-43
 gb|EAQ13753.1| hypothetical protein RB2654_03529 [Rhodobac...     178  2e-43

  Roseovarius nubinhibens ISM [a-proteobacteria] taxid 89187
 ref|ZP_00960867.1| hypothetical protein ISM_16270 [Roseova...     178  2e-43
 gb|EAP76438.1| hypothetical protein ISM_16270 [Roseovarius...     178  2e-43

  Roseovarius sp. HTCC2601 [a-proteobacteria] taxid 314265
 ref|ZP_01445371.1| hypothetical protein R2601_25616 [Roseo...     175  2e-42
 gb|EAU44391.1| hypothetical protein R2601_25616 [Roseovari...     175  2e-42

  Roseobacter sp. CCS2 [a-proteobacteria] taxid 391593
 ref|ZP_01752182.1| hypothetical protein RCCS2_00167 [Roseo...     174  2e-42
 gb|EBA10849.1| hypothetical protein RCCS2_00167 [Roseobact...     174  2e-42

  Octadecabacter antarcticus 238 [a-proteobacteria] taxid 391616
 ref|ZP_05062928.1| conserved hypothetical protein [Octadec...     174  3e-42
 gb|EDY88167.1| conserved hypothetical protein [Octadecabac...     174  3e-42

  Citreicella sp. SE45 [a-proteobacteria] taxid 501479
 ref|ZP_05782011.1| conserved hypothetical protein [Citreic...     173  4e-42
 gb|EEX15775.1| conserved hypothetical protein [Citreicella...     173  4e-42

  Octadecabacter antarcticus 307 [a-proteobacteria] taxid 391626
 ref|ZP_05052605.1| hypothetical protein OA307_3981 [Octade...     172  1e-41
 gb|EDY78871.1| hypothetical protein OA307_3981 [Octadecaba...     172  1e-41

  Rhodobacterales bacterium HTCC2083 [a-proteobacteria] taxid 314270
 ref|ZP_05074698.1| conserved hypothetical protein [Rhodoba...     170  3e-41
 gb|EDZ42358.1| conserved hypothetical protein [Rhodobacter...     170  3e-41

  Paracoccus denitrificans PD1222 [a-proteobacteria] taxid 318586
 ref|YP_918258.1| hypothetical protein Pden_4498 [Paracoccu...     169  9e-41
 gb|ABL72562.1| conserved hypothetical protein [Paracoccus ...     169  9e-41

  Oceanicola batsensis HTCC2597 [a-proteobacteria] taxid 252305
 ref|ZP_00999087.1| hypothetical protein OB2597_01917 [Ocea...     168  2e-40
 gb|EAQ03337.1| hypothetical protein OB2597_01917 [Oceanico...     168  2e-40

  Loktanella vestfoldensis SKA53 [a-proteobacteria] taxid 314232
 ref|ZP_01003529.1| hypothetical protein SKA53_04523 [Lokta...     159  6e-38
 gb|EAQ06322.1| hypothetical protein SKA53_04523 [Loktanell...     159  6e-38

  Jannaschia sp. CCS1 [a-proteobacteria] taxid 290400
 ref|YP_510291.1| hypothetical protein Jann_2349 [Jannaschi...     150  3e-35
 gb|ABD55266.1| hypothetical protein Jann_2349 [Jannaschia ...     150  3e-35

  Rhodobacterales bacterium HTCC2255 [a-proteobacteria] taxid 367336
 ref|ZP_01448460.1| hypothetical protein OM2255_02942 [alph...     142  1e-32
 gb|EAU51188.1| hypothetical protein OM2255_02942 [alpha pr...     142  1e-32

  Gemmatimonas aurantiaca T-27 [bacteria] taxid 379066
 ref|YP_002760735.1| hypothetical protein GAU_1223 [Gemmati...     135  9e-31
 dbj|BAH38265.1| hypothetical protein [Gemmatimonas auranti...     135  9e-31

  Thioalkalivibrio sp. HL-EbGR7 [g-proteobacteria] taxid 396588
 ref|YP_002512580.1| hypothetical protein Tgr7_0496 [Thioal...     119  1e-25
 gb|ACL71593.1| conserved hypothetical protein [Thioalkaliv...     119  1e-25

  Thioalkalivibrio sp. K90mix [g-proteobacteria] taxid 396595
 ref|ZP_03689324.1| conserved hypothetical protein [Thioalk...     113  5e-24
 gb|EEG17163.1| conserved hypothetical protein [Thioalkaliv...     113  5e-24

  Thiobacillus denitrificans ATCC 25259 [b-proteobacteria] taxid 292415
 ref|YP_314003.1| hypothetical protein Tbd_0245 [Thiobacill...     105  1e-21
 gb|AAZ96198.1| conserved hypothetical protein [Thiobacillu...     105  1e-21

  Allochromatium vinosum DSM 180 [g-proteobacteria] taxid 572477
 ref|ZP_04774611.1| conserved hypothetical protein [Allochr...     104  4e-21
 gb|EER65416.1| conserved hypothetical protein [Allochromat...     104  4e-21

  Nitrosococcus oceani ATCC 19707 [g-proteobacteria] taxid 323261
 ref|YP_344685.1| hypothetical protein Noc_2702 [Nitrosococ...     102  1e-20
 gb|ABA59155.1| conserved hypothetical protein [Nitrosococc...     102  1e-20

  Nitrosococcus oceani AFC27 [g-proteobacteria] taxid 473788
 ref|ZP_05048511.1| hypothetical protein NOC27_2067 [Nitros...     102  1e-20
 gb|EDZ65387.1| hypothetical protein NOC27_2067 [Nitrosococ...     102  1e-20

  Candidatus Ruthia magnifica str. Cm (Calyptogena magnifica) [g-proteobacteria] taxid 413404
 ref|YP_903293.1| hypothetical protein Rmag_0019 [Candidatu...      90  7e-17
 gb|ABL01822.1| conserved hypothetical protein [Candidatus ...      90  7e-17

  Candidatus Vesicomyosocius okutanii HA [g-proteobacteria] taxid 412965
 ref|YP_001218881.1| hypothetical protein COSY_0018 [Candid...      88  2e-16
 dbj|BAF61157.1| conserved hypothetical protein [Candidatus...      88  2e-16

  Cardiobacterium hominis ATCC 15826 [g-proteobacteria] taxid 638300
 ref|ZP_05705376.1| conserved hypothetical protein [Cardiob...      56  9e-07
 gb|EEV88472.1| conserved hypothetical protein [Cardiobacte...      56  9e-07
 ref|ZP_05705375.1| hypothetical protein HMPREF0198_1410 [C...      36  0.72
 gb|EEV88471.1| hypothetical protein HMPREF0198_1410 [Cardi...      36  0.72

  Lentisphaera araneosa HTCC2155 [bacteria] taxid 313628
 ref|ZP_01873825.1| hypothetical protein LNTAR_10216 [Lenti...      36  1.1
 gb|EDM28282.1| hypothetical protein LNTAR_10216 [Lentispha...      36  1.1

  Streptomyces griseoflavus Tu4000 [high GC Gram+] taxid 467200
 ref|ZP_05540510.1| solute-binding protein (secreted protei...      35  1.4

  Gluconobacter oxydans 621H [a-proteobacteria] taxid 290633
 ref|YP_191394.1| hypothetical protein GOX0966 [Gluconobact...      35  2.1
 gb|AAW60738.1| Hypothetical protein GOX0966 [Gluconobacter...      35  2.1

  Acaryochloris marina MBIC11017 [cyanobacteria] taxid 329726
 ref|YP_001520052.1| hydroxyacylglutathione hydrolase, puta...      34  4.2
 sp|B0BZI8.1|GLO2_ACAM1 RecName: Full=Hydroxyacylglutathion...      34  4.2
 gb|ABW30733.1| hydroxyacylglutathione hydrolase, putative ...      34  4.2

  Rhizobium sp. NGR234 [a-proteobacteria] taxid 394
 ref|YP_002822908.1| putative transcriptional regulator, Ly...      34  4.8
 gb|ACP22155.1| putative transcriptional regulator, LysR fa...      34  4.8

  Frankia sp. EuI1c [high GC Gram+] taxid 298654
 ref|ZP_06238860.1| von Willebrand factor type A [Frankia s...      33  4.8
 gb|EFA59795.1| von Willebrand factor type A [Frankia sp. E...      33  4.8

  Bacteroides sp. D1 [CFB group bacteria] taxid 556258
 ref|ZP_04546229.1| conserved hypothetical protein [Bactero...      33  7.4
 gb|EEO50248.1| conserved hypothetical protein [Bacteroides...      33  7.4

  Bacteroides sp. 2_1_22 [CFB group bacteria] taxid 469588
 ref|ZP_06085302.1| conserved hypothetical protein [Bactero...      33  7.4
 gb|EEZ02715.1| conserved hypothetical protein [Bacteroides...      33  7.4

  Bacteroides ovatus ATCC 8483 [CFB group bacteria] taxid 411476
 ref|ZP_02068267.1| hypothetical protein BACOVA_05282 [Bact...      33  7.4
 gb|EDO09420.1| hypothetical protein BACOVA_05282 [Bacteroi...      33  7.4

  Bacteroides caccae ATCC 43185 [CFB group bacteria] taxid 411901
 ref|ZP_01962008.1| hypothetical protein BACCAC_03654 [Bact...      33  7.5
 gb|EDM19331.1| hypothetical protein BACCAC_03654 [Bacteroi...      33  7.5


--------------------------------------------------------------------------------
Taxonomy Report 
Bacteria ............................................................   128 hits   63 orgs [root; cellular organisms]
. Proteobacteria ....................................................   110 hits   54 orgs 
. . Alphaproteobacteria .............................................    90 hits   45 orgs 
. . . Rhodobacterales ...............................................    86 hits   43 orgs 
. . . . Rhodobacteraceae ............................................    76 hits   38 orgs 
. . . . . Roseobacter ...............................................    14 hits    7 orgs 
. . . . . . Roseobacter sp. AzwK-3b .................................     2 hits    1 orgs 
. . . . . . Roseobacter sp. SK209-2-6 ...............................     2 hits    1 orgs 
. . . . . . Roseobacter litoralis Och 149 ...........................     2 hits    1 orgs [Roseobacter litoralis]
. . . . . . Roseobacter denitrificans OCh 114 .......................     2 hits    1 orgs [Roseobacter denitrificans]
. . . . . . Roseobacter sp. GAI101 ..................................     2 hits    1 orgs 
. . . . . . Roseobacter sp. MED193 ..................................     2 hits    1 orgs 
. . . . . . Roseobacter sp. CCS2 ....................................     2 hits    1 orgs 
. . . . . Roseovarius ...............................................     8 hits    4 orgs 
. . . . . . Roseovarius sp. TM1035 ..................................     2 hits    1 orgs 
. . . . . . Roseovarius sp. 217 .....................................     2 hits    1 orgs 
. . . . . . Roseovarius nubinhibens ISM .............................     2 hits    1 orgs [Roseovarius nubinhibens]
. . . . . . Roseovarius sp. HTCC2601 ................................     2 hits    1 orgs 
. . . . . Sulfitobacter .............................................     4 hits    2 orgs 
. . . . . . Sulfitobacter sp. EE-36 .................................     2 hits    1 orgs 
. . . . . . Sulfitobacter sp. NAS-14.1 ..............................     2 hits    1 orgs 
. . . . . Ruegeria ..................................................    10 hits    5 orgs 
. . . . . . Ruegeria sp. R11 ........................................     2 hits    1 orgs 
. . . . . . Ruegeria pomeroyi DSS-3 .................................     2 hits    1 orgs [Ruegeria pomeroyi]
. . . . . . Silicibacter sp. TrichCH4B ..............................     2 hits    1 orgs 
. . . . . . Silicibacter lacuscaerulensis ITI-1157 ..................     2 hits    1 orgs [Ruegeria lacuscaerulensis]
. . . . . . Ruegeria sp. TM1040 .....................................     2 hits    1 orgs 
. . . . . Phaeobacter gallaeciensis .................................     4 hits    2 orgs [Phaeobacter]
. . . . . . Phaeobacter gallaeciensis BS107 .........................     2 hits    1 orgs 
. . . . . . Phaeobacter gallaeciensis 2.10 ..........................     2 hits    1 orgs 
. . . . . Oceanicola ................................................     4 hits    2 orgs 
. . . . . . Oceanicola granulosus HTCC2516 ..........................     2 hits    1 orgs [Oceanicola granulosus]
. . . . . . Oceanicola batsensis HTCC2597 ...........................     2 hits    1 orgs [Oceanicola batsensis]
. . . . . Thalassiobium sp. R2A62 ...................................     2 hits    1 orgs [Thalassobium]
. . . . . Rhodobacter ...............................................    10 hits    5 orgs 
. . . . . . Rhodobacter sphaeroides .................................     8 hits    4 orgs 
. . . . . . . Rhodobacter sphaeroides ATCC 17029 ....................     2 hits    1 orgs 
. . . . . . . Rhodobacter sphaeroides KD131 .........................     2 hits    1 orgs 
. . . . . . . Rhodobacter sphaeroides ATCC 17025 ....................     2 hits    1 orgs 
. . . . . . . Rhodobacter sphaeroides 2.4.1 .........................     2 hits    1 orgs 
. . . . . . Rhodobacter sp. SW2 .....................................     2 hits    1 orgs 
. . . . . Rhodobacteraceae bacterium KLH11 ..........................     2 hits    1 orgs [unclassified Rhodobacteraceae]
. . . . . Dinoroseobacter shibae DFL 12 .............................     2 hits    1 orgs [Dinoroseobacter; Dinoroseobacter shibae]
. . . . . Sagittula stellata E-37 ...................................     2 hits    1 orgs [Sagittula; Sagittula stellata]
. . . . . Oceanibulbus indolifex HEL-45 .............................     2 hits    1 orgs [Oceanibulbus; Oceanibulbus indolifex]
. . . . . Octadecabacter antarcticus ................................     4 hits    2 orgs [Octadecabacter]
. . . . . . Octadecabacter antarcticus 238 ..........................     2 hits    1 orgs 
. . . . . . Octadecabacter antarcticus 307 ..........................     2 hits    1 orgs 
. . . . . Citreicella sp. SE45 ......................................     2 hits    1 orgs [Citreicella]
. . . . . Paracoccus denitrificans PD1222 ...........................     2 hits    1 orgs [Paracoccus; Paracoccus denitrificans]
. . . . . Loktanella vestfoldensis SKA53 ............................     2 hits    1 orgs [Loktanella; Loktanella vestfoldensis]
. . . . . Jannaschia sp. CCS1 .......................................     2 hits    1 orgs [Jannaschia]
. . . . unclassified Rhodobacterales ................................    10 hits    5 orgs 
. . . . . Rhodobacterales bacterium HTCC2150 ........................     2 hits    1 orgs 
. . . . . Rhodobacterales bacterium Y4I .............................     2 hits    1 orgs 
. . . . . Rhodobacterales bacterium HTCC2654 ........................     2 hits    1 orgs 
. . . . . Rhodobacterales bacterium HTCC2083 ........................     2 hits    1 orgs 
. . . . . Rhodobacterales bacterium HTCC2255 ........................     2 hits    1 orgs 
. . . Gluconobacter oxydans 621H ....................................     2 hits    1 orgs [Rhodospirillales; Acetobacteraceae; Gluconobacter; Gluconobacter oxydans]
. . . Rhizobium sp. NGR234 ..........................................     2 hits    1 orgs [Rhizobiales; Rhizobiaceae; Rhizobium/Agrobacterium group; Rhizobium]
. . Gammaproteobacteria .............................................    18 hits    8 orgs 
. . . Chromatiales ..................................................    10 hits    5 orgs 
. . . . Thioalkalivibrio ............................................     4 hits    2 orgs [Ectothiorhodospiraceae]
. . . . . Thioalkalivibrio sp. HL-EbGR7 .............................     2 hits    1 orgs 
. . . . . Thioalkalivibrio sp. K90mix ...............................     2 hits    1 orgs 
. . . . Chromatiaceae ...............................................     6 hits    3 orgs 
. . . . . Allochromatium vinosum DSM 180 ............................     2 hits    1 orgs [Allochromatium; Allochromatium vinosum]
. . . . . Nitrosococcus oceani ......................................     4 hits    2 orgs [Nitrosococcus]
. . . . . . Nitrosococcus oceani ATCC 19707 .........................     2 hits    1 orgs 
. . . . . . Nitrosococcus oceani AFC27 ..............................     2 hits    1 orgs 
. . . sulfur-oxidizing symbionts ....................................     4 hits    2 orgs [unclassified Gammaproteobacteria]
. . . . Candidatus Ruthia magnifica str. Cm (Calyptogena magnifica) .     2 hits    1 orgs [Candidatus Ruthia magnifica]
. . . . Candidatus Vesicomyosocius okutanii HA ......................     2 hits    1 orgs [Calyptogena okutanii thioautotrophic gill symbiont]
. . . Cardiobacterium hominis ATCC 15826 ............................     4 hits    1 orgs [Cardiobacteriales; Cardiobacteriaceae; Cardiobacterium; Cardiobacterium hominis]
. . Thiobacillus denitrificans ATCC 25259 ...........................     2 hits    1 orgs [Betaproteobacteria; Hydrogenophilales; Hydrogenophilaceae; Thiobacillus; Thiobacillus denitrificans]
. Gemmatimonas aurantiaca T-27 ......................................     2 hits    1 orgs [Gemmatimonadetes; Gemmatimonadetes (class); Gemmatimonadales; Gemmatimonadaceae; Gemmatimonas; Gemmatimonas aurantiaca]
. Lentisphaera araneosa HTCC2155 ....................................     2 hits    1 orgs [Chlamydiae/Verrucomicrobia group; Lentisphaerae; Lentisphaerales; Lentisphaeraceae; Lentisphaera; Lentisphaera araneosa]
. Actinomycetales ...................................................     3 hits    2 orgs [Actinobacteria; Actinobacteria (class); Actinobacteridae]
. . Streptomyces griseoflavus Tu4000 ................................     1 hits    1 orgs [Streptomycineae; Streptomycetaceae; Streptomyces; Streptomyces griseoflavus]
. . Frankia sp. EuI1c ...............................................     2 hits    1 orgs [Frankineae; Frankiaceae; Frankia]
. Acaryochloris marina MBIC11017 ....................................     3 hits    1 orgs [Cyanobacteria; unclassified Cyanobacteria; Acaryochloris; Acaryochloris marina]
. Bacteroides .......................................................     8 hits    4 orgs [Bacteroidetes/Chlorobi group; Bacteroidetes; Bacteroidia; Bacteroidales; Bacteroidaceae]
. . Bacteroides sp. D1 ..............................................     2 hits    1 orgs 
. . Bacteroides sp. 2_1_22 ..........................................     2 hits    1 orgs 
. . Bacteroides ovatus ATCC 8483 ....................................     2 hits    1 orgs [Bacteroides ovatus]
. . Bacteroides caccae ATCC 43185 ...................................     2 hits    1 orgs [Bacteroides caccae]

BLAST

PROTOCOL

BLAST

Description GOS_1149010 Translation [66-764 indirect strand]

Molecule type amino acid

Query Length 233

Database Name nr

Description All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects

Program BLASTP 2.2.22+


RESULTS ANALYSIS


I have done the blast against non redudant protein sequences(nr) and maximum target sequences was upto 1000.


a) the BLAST protein homology shows that this sequence is very homologous to hypothetical protein [Rhodobacter sphaeroides ATCC 17029] and

b) this homologous organism is a Bacteria; Proteobacteria; Alphaproteobacteria; Rhodobacterales; Rhodobacteraceae; Rhodobacter.

c) it is hypothetical protein coding gene.

d) so this sequence is conserved, putative, uncharacterized, hypothetical protein found in some species like Rhodobacter sphaeroides, Thalassiobium sp., Rhodobacter sphaeroides, Rhodobacterales bacterium etc

e) the homology shows that is translated sequence belongs to the Rhodobacterales

so I found that the homology with conserved hypothetical protein [Rhodobacter sphaeroides ATCC 17029] is 58% which is best because may be my metagenomic sequence have the same function.

f) the best e-value is 3e-48, it also shows the good similarity with this sequences, having the bit score 194



RAW RESULTS
                                                                   Score     E
Sequences producing significant alignments:                       (Bits)  Value

ref|ZP_01903810.1|  hypothetical protein RAZWK3B_15633 [Roseob...   194    3e-48
ref|ZP_01880827.1|  hypothetical protein RTM1035_07318 [Roseov...   192    9e-48
ref|ZP_00954804.1|  hypothetical protein EE36_14922 [Sulfitoba...   191    2e-47
ref|ZP_05087921.1|  conserved hypothetical protein [Ruegeria s...   191    2e-47
ref|YP_167343.1|  hypothetical protein SPO2113 [Ruegeria pomer...   191    2e-47 
ref|ZP_02143751.1|  hypothetical protein RGBS107_14411 [Phaeob...   191    3e-47
ref|ZP_00962283.1|  hypothetical protein NAS141_05043 [Sulfito...   190    3e-47
ref|ZP_02148524.1|  hypothetical protein RG210_16770 [Phaeobac...   190    4e-47
ref|ZP_01156870.1|  hypothetical protein OG2516_03083 [Oceanic...   190    4e-47
ref|ZP_05342802.1|  conserved hypothetical protein [Thalassiob...   190    4e-47
ref|ZP_01742712.1|  hypothetical protein RB2150_03134 [Rhodoba...   190    4e-47
ref|ZP_01753393.1|  hypothetical protein RSK20926_18517 [Roseo...   189    6e-47
ref|YP_001043521.1|  hypothetical protein Rsph17029_1639 [Rhod...   189    6e-47 
ref|ZP_05740588.1|  conserved hypothetical protein [Silicibact...   188    1e-46
ref|ZP_05124517.1|  conserved hypothetical protein [Rhodobacte...   188    1e-46
ref|YP_002525678.1|  hypothetical protein RSKD131_1317 [Rhodob...   188    2e-46 
ref|YP_001168360.1|  hypothetical protein Rsph17025_2165 [Rhod...   188    2e-46 
ref|YP_353079.1|  hypothetical protein RSP_0009 [Rhodobacter s...   188    2e-46 
ref|ZP_05787909.1|  conserved hypothetical protein [Silicibact...   187    2e-46
ref|ZP_05079542.1|  conserved hypothetical protein [Rhodobacte...   187    3e-46
ref|YP_001532935.1|  hypothetical protein Dshi_1592 [Dinoroseo...   187    3e-46 
ref|YP_613373.1|  hypothetical protein TM1040_1378 [Ruegeria s...   186    6e-46 
ref|ZP_02139047.1|  hypothetical protein RLO149_19889 [Roseoba...   185    1e-45
ref|ZP_01033890.1|  hypothetical protein ROS217_18632 [Roseova...   183    6e-45
ref|ZP_01747128.1|  hypothetical protein SSE37_06629 [Sagittul...   182    1e-44
ref|YP_683000.1|  hypothetical protein RD1_2783 [Roseobacter d...   180    4e-44 
ref|ZP_05101760.1|  conserved hypothetical protein [Roseobacte...   179    6e-44
ref|ZP_02154523.1|  hypothetical protein OIHEL45_12425 [Oceani...   179    1e-43
ref|ZP_05843039.1|  conserved hypothetical protein [Rhodobacte...   178    2e-43
ref|ZP_01056951.1|  hypothetical protein MED193_04716 [Roseoba...   178    2e-43
ref|ZP_01012848.1|  hypothetical protein RB2654_03529 [Rhodoba...   178    2e-43
ref|ZP_00960867.1|  hypothetical protein ISM_16270 [Roseovariu...   178    2e-43
ref|ZP_01445371.1|  hypothetical protein R2601_25616 [Roseovar...   175    2e-42
ref|ZP_01752182.1|  hypothetical protein RCCS2_00167 [Roseobac...   174    2e-42
ref|ZP_05062928.1|  conserved hypothetical protein [Octadecaba...   174    3e-42
ref|ZP_05782011.1|  conserved hypothetical protein [Citreicell...   173    4e-42
ref|ZP_05052605.1|  hypothetical protein OA307_3981 [Octadecab...   172    1e-41
ref|ZP_05074698.1|  conserved hypothetical protein [Rhodobacte...   170    3e-41
ref|YP_918258.1|  hypothetical protein Pden_4498 [Paracoccus d...   169    9e-41 
ref|ZP_00999087.1|  hypothetical protein OB2597_01917 [Oceanic...   168    2e-40
ref|ZP_01003529.1|  hypothetical protein SKA53_04523 [Loktanel...   159    6e-38
ref|YP_510291.1|  hypothetical protein Jann_2349 [Jannaschia s...   150    3e-35 
ref|ZP_01448460.1|  hypothetical protein OM2255_02942 [alpha p...   142    1e-32
ref|YP_002760735.1|  hypothetical protein GAU_1223 [Gemmatimon...   135    9e-31 
ref|YP_002512580.1|  hypothetical protein Tgr7_0496 [Thioalkal...   119    1e-25 
ref|ZP_03689324.1|  conserved hypothetical protein [Thioalkali...   113    5e-24
ref|YP_314003.1|  hypothetical protein Tbd_0245 [Thiobacillus ...   105    1e-21 
ref|ZP_04774611.1|  conserved hypothetical protein [Allochroma...   104    4e-21
ref|YP_344685.1|  hypothetical protein Noc_2702 [Nitrosococcus...   102    1e-20 
ref|ZP_05048511.1|  hypothetical protein NOC27_2067 [Nitrosoco...   102    1e-20
ref|YP_903293.1|  hypothetical protein Rmag_0019 [Candidatus R...  90.1    7e-17 
ref|YP_001218881.1|  hypothetical protein COSY_0018 [Candidatu...  88.6    2e-16 
ref|ZP_05705376.1|  conserved hypothetical protein [Cardiobact...  56.6    9e-07
ref|ZP_05705375.1|  hypothetical protein HMPREF0198_1410 [Card...  37.0    0.72 
ref|ZP_01873825.1|  hypothetical protein LNTAR_10216 [Lentisph...  36.2    1.1  
ref|ZP_05540510.1|  solute-binding protein (secreted protein) ...  35.8    1.4  
ref|YP_191394.1|  hypothetical protein GOX0966 [Gluconobacter ...  35.4    2.1   
ref|YP_001520052.1|  hydroxyacylglutathione hydrolase, putativ...  34.3    4.2   
ref|YP_002822908.1|  putative transcriptional regulator, LysR ...  34.3    4.8   
ref|ZP_06238860.1|  von Willebrand factor type A [Frankia sp. ...  33.9    4.8  
ref|ZP_04546229.1|  conserved hypothetical protein [Bacteroide...  33.5    7.4  
ref|ZP_02068267.1|  hypothetical protein BACOVA_05282 [Bactero...  33.5    7.4  
ref|ZP_01962008.1|  hypothetical protein BACCAC_03654 [Bactero...  33.5    7.5  

-----------------------------------------------------------------------------------------

>ref|YP_001043521.1|  hypothetical protein Rsph17029_1639 [Rhodobacter sphaeroides 
ATCC 17029]
 gb|ABN76749.1|  conserved hypothetical protein [Rhodobacter sphaeroides ATCC 
17029]
Length=289

 GENE ID: 4895276 Rsph17029_1639 | hypothetical protein
[Rhodobacter sphaeroides ATCC 17029]

 Score =  301 bits (771),  Expect = 3e-80, Method: Compositional matrix adjust.
 Identities = 135/232 (58%), Positives = 170/232 (73%), Gaps = 0/232 (0%)

Query  1    MGEHIEDSYKEDAMKSPYLRELLLGDSIHISSNISFDNLAPLSNYLGKPGNQGKGGLSIE  60
            MGEHI D++K +AMK P+LRELL  DS++I+SNI+FDNLAPLS YLGKPGN  +GGL+  
Sbjct  54   MGEHIADNFKREAMKVPFLRELLRTDSVYIASNITFDNLAPLSTYLGKPGNPNRGGLAFA  113

Query  61   EYKKRQAQHHIAEIKALLDTPNFINKSNRIYGYPNFVCDTGGSICEVVNPDDPNDPVLKT  120
            +Y +RQ QH  AE+ AL DTP FI ++  IYGY NFVCD+GGSICEVV+PD+PNDPV+  
Sbjct  114  DYLERQGQHREAEVAALHDTPRFIRRATEIYGYQNFVCDSGGSICEVVDPDNPNDPVMSD  173

Query  121  LAENTLMVWIEGSSHHTDELIKRFDENPKPMCYAPDFLDLKWREYLKINKCSVENVDPDD  180
            LA N LMVWI+GS  HT EL++RFD  PKPM Y P FL   W +Y      + E VDPDD
Sbjct  174  LAGNLLMVWIKGSDAHTAELVRRFDRAPKPMYYQPGFLHAAWTDYRVEKGLTEEQVDPDD  233

Query  181  FVRWTYREAMAHRDPIYKSMASWGITVQADLISQVKTPSEFNSLIGSTISRS  232
            F+RWTY  A+AHR P Y++MA WGITV A+ +++ +    F  LI  T+ R+
Sbjct  234  FIRWTYARALAHRQPRYEAMARWGITVTAEEVAEARDAQGFVDLIARTLDRT  285


>ref|ZP_05342802.1|  conserved hypothetical protein [Thalassiobium sp. R2A62]
 gb|EET48469.1|  conserved hypothetical protein [Thalassiobium sp. R2A62]
Length=286

 Score =  300 bits (768),  Expect = 7e-80, Method: Compositional matrix adjust.
 Identities = 138/229 (60%), Positives = 176/229 (76%), Gaps = 0/229 (0%)

Query  1    MGEHIEDSYKEDAMKSPYLRELLLGDSIHISSNISFDNLAPLSNYLGKPGNQGKGGLSIE  60
            MGE I D+ K +AMK P+LRE+L+ DSI+I+SNI+F+NLAPLS YLGKPG+   GGLSI+
Sbjct  54   MGELIADNAKHEAMKVPFLREMLMSDSIYIASNITFENLAPLSAYLGKPGDPELGGLSID  113

Query  61   EYKKRQAQHHIAEIKALLDTPNFINKSNRIYGYPNFVCDTGGSICEVVNPDDPNDPVLKT  120
            EY++RQ QH  AEI ALLDTP+F++++  IYGY +FVCDTGGSICEVV+P + ND VL T
Sbjct  114  EYRRRQDQHRAAEISALLDTPHFMDRAQAIYGYDHFVCDTGGSICEVVDPSNKNDEVLNT  173

Query  121  LAENTLMVWIEGSSHHTDELIKRFDENPKPMCYAPDFLDLKWREYLKINKCSVENVDPDD  180
            L+ + L+VWIEGS  H  EL +RFD  PKPM Y P+FL+  W EYL  NK S  +V+PDD
Sbjct  174  LSNDLLIVWIEGSDAHAAELARRFDLAPKPMYYQPEFLEAAWAEYLNNNKVSEGDVNPDD  233

Query  181  FVRWTYREAMAHRDPIYKSMASWGITVQADLISQVKTPSEFNSLIGSTI  229
            FVRWTY  A+AHR P+Y+ +A WG+ VQA  I+QVK+   F +LI ST+
Sbjct  234  FVRWTYSRALAHRAPLYREIAQWGVAVQAHDIAQVKSAGAFEALIASTL  282


>ref|YP_002525678.1|  hypothetical protein RSKD131_1317 [Rhodobacter sphaeroides KD131]
 gb|ACM01177.1|  Hypothetical Protein RSKD131_1317 [Rhodobacter sphaeroides KD131]
Length=289

 GENE ID: 7359055 RSKD131_1317 | hypothetical protein
[Rhodobacter sphaeroides KD131] (10 or fewer PubMed links)

 Score =  300 bits (767),  Expect = 1e-79, Method: Compositional matrix adjust.
 Identities = 134/232 (57%), Positives = 170/232 (73%), Gaps = 0/232 (0%)

Query  1    MGEHIEDSYKEDAMKSPYLRELLLGDSIHISSNISFDNLAPLSNYLGKPGNQGKGGLSIE  60
            MGEHI D++K +AMK P+LRELL  DS++I+SNI+FDNLAPLS YLGKPGN  +GGL+  
Sbjct  54   MGEHIADNFKREAMKVPFLRELLRTDSVYIASNITFDNLAPLSTYLGKPGNPNRGGLAFA  113

Query  61   EYKKRQAQHHIAEIKALLDTPNFINKSNRIYGYPNFVCDTGGSICEVVNPDDPNDPVLKT  120
            +Y +RQ QH  AE+ AL DTP FI ++  IYGY NFVCD+GGSICEVV+PD+PNDPV+  
Sbjct  114  DYLERQGQHREAEVAALHDTPRFIRRATEIYGYQNFVCDSGGSICEVVDPDNPNDPVMSD  173

Query  121  LAENTLMVWIEGSSHHTDELIKRFDENPKPMCYAPDFLDLKWREYLKINKCSVENVDPDD  180
            LA N LMVWI+GS  HT EL++RFD  PKPM Y P FL   W +Y      + E VDPDD
Sbjct  174  LAGNLLMVWIKGSEAHTAELVRRFDRAPKPMYYQPGFLHAAWTDYRVEKGLTEEQVDPDD  233

Query  181  FVRWTYREAMAHRDPIYKSMASWGITVQADLISQVKTPSEFNSLIGSTISRS  232
            F+RWTY  A+AHR P Y++MA WGITV A+ +++ +    F  LI  T+ ++
Sbjct  234  FIRWTYARALAHRQPRYEAMARWGITVTAEEVAEARDAQGFVDLIARTLDQA  285


>ref|YP_353079.1|  hypothetical protein RSP_0009 [Rhodobacter sphaeroides 2.4.1]
 gb|ABA79178.1|  conserved hypothetical protein [Rhodobacter sphaeroides 2.4.1]
Length=289

 GENE ID: 3720263 RSP_0009 | hypothetical protein
[Rhodobacter sphaeroides 2.4.1]

 Score =  300 bits (767),  Expect = 1e-79, Method: Compositional matrix adjust.
 Identities = 134/232 (57%), Positives = 170/232 (73%), Gaps = 0/232 (0%)

Query  1    MGEHIEDSYKEDAMKSPYLRELLLGDSIHISSNISFDNLAPLSNYLGKPGNQGKGGLSIE  60
            MGEHI D++K +AMK P+LRELL  DS++I+SNI+FDNLAPLS YLGKPGN  +GGL+  
Sbjct  54   MGEHIADNFKREAMKVPFLRELLRTDSVYIASNITFDNLAPLSTYLGKPGNPNRGGLAFA  113

Query  61   EYKKRQAQHHIAEIKALLDTPNFINKSNRIYGYPNFVCDTGGSICEVVNPDDPNDPVLKT  120
            +Y +RQ QH  AE+ AL DTP FI ++  IYGY NFVCD+GGSICEVV+PD+PNDPV+  
Sbjct  114  DYLERQGQHREAEVAALHDTPRFIRRATEIYGYQNFVCDSGGSICEVVDPDNPNDPVMSD  173

Query  121  LAENTLMVWIEGSSHHTDELIKRFDENPKPMCYAPDFLDLKWREYLKINKCSVENVDPDD  180
            LA N LMVWI+GS  HT EL++RFD  PKPM Y P FL   W +Y      + E VDPDD
Sbjct  174  LAGNLLMVWIKGSDAHTAELVRRFDRAPKPMYYQPGFLHAAWTDYRVEKGLTEEQVDPDD  233

Query  181  FVRWTYREAMAHRDPIYKSMASWGITVQADLISQVKTPSEFNSLIGSTISRS  232
            F+RWTY  A+AHR P Y++MA WGITV A+ +++ +    F  LI  T+ ++
Sbjct  234  FIRWTYARALAHRQPRYEAMARWGITVTAEEVAEARDAQGFVDLIARTLDQA  285


>ref|ZP_01903810.1|  hypothetical protein RAZWK3B_15633 [Roseobacter sp. AzwK-3b]
 gb|EDM70842.1|  hypothetical protein RAZWK3B_15633 [Roseobacter sp. AzwK-3b]
Length=286

 Score =  298 bits (764),  Expect = 3e-79, Method: Compositional matrix adjust.
 Identities = 132/232 (56%), Positives = 173/232 (74%), Gaps = 0/232 (0%)

Query  1    MGEHIEDSYKEDAMKSPYLRELLLGDSIHISSNISFDNLAPLSNYLGKPGNQGKGGLSIE  60
            MGE I D+ K  AM+ P+LR+LLL DSI+I SNI+FDNLAP+S YLGKPG+  KGGL I+
Sbjct  54   MGELIADNAKAHAMQVPFLRDLLLTDSIYIGSNITFDNLAPVSTYLGKPGDPAKGGLPID  113

Query  61   EYKKRQAQHHIAEIKALLDTPNFINKSNRIYGYPNFVCDTGGSICEVVNPDDPNDPVLKT  120
            EY++RQ Q   AEI AL+DT +FI+++ R+YGYPNFVCDTGGSICE V+ DDP DP+L  
Sbjct  114  EYRRRQQQFRTAEIAALMDTAHFIDRAERLYGYPNFVCDTGGSICEWVDGDDPADPLLTE  173

Query  121  LAENTLMVWIEGSSHHTDELIKRFDENPKPMCYAPDFLDLKWREYLKINKCSVENVDPDD  180
            L+ + L+++IEGS  HT  LI+RFD  PKPM Y P+FLD  W +YL  NKC  + VDP  
Sbjct  174  LSRHCLLIYIEGSEAHTQSLIRRFDRAPKPMAYQPEFLDQAWVQYLNENKCEADEVDPGA  233

Query  181  FVRWTYREAMAHRDPIYKSMASWGITVQADLISQVKTPSEFNSLIGSTISRS  232
            F+RWTY  A+AHR P Y++MA WGITV AD +++ +    F++L+ + + RS
Sbjct  234  FIRWTYARALAHRKPRYEAMAKWGITVTADEVAETRDADAFDALVATALERS  285


>ref|ZP_01753393.1|  hypothetical protein RSK20926_18517 [Roseobacter sp. SK209-2-6]
 gb|EBA17760.1|  hypothetical protein RSK20926_18517 [Roseobacter sp. SK209-2-6]
Length=295

 Score =  298 bits (762),  Expect = 4e-79, Method: Compositional matrix adjust.
 Identities = 136/230 (59%), Positives = 179/230 (77%), Gaps = 1/230 (0%)

Query  1    MGEHIEDSYKEDAMKSPYLRELLLGDSIHISSNISFDNLAPLSNYLGKPGNQGKGGLSIE  60
            MGE+I D+ K +AMK+P+LR+LLL DSI+I SNISF+NL P+++YLGKPG+  KGGL IE
Sbjct  62   MGEYIADNAKAEAMKNPFLRDLLLSDSIYIGSNISFENLTPVASYLGKPGDPAKGGLPIE  121

Query  61   EYKKRQAQHHIAEIKALLDTPNFINKSNRIYGYPNFVCDTGGSICEVVNPDDPNDPVLKT  120
            EY++RQ Q  +AEI ALLDT  FI++++R+YGY NF+CDTGGSICE V+ +DPNDPVL  
Sbjct  122  EYRRRQEQFRLAEIHALLDTEYFIDRADRLYGYSNFICDTGGSICEWVDANDPNDPVLSE  181

Query  121  LAENTLMVWIEGSSHHTDELIKRFDENPKPMCYAPDFLDLKWREYLKINKCSVENVDPDD  180
            L+++ LMVWI+G   HT+EL++RFD  PKPM Y P+FL   W EYLK    S  + DPD 
Sbjct  182  LSKHCLMVWIKGDDAHTEELVRRFDRAPKPMSYQPEFLGRVWEEYLKEMSISEGDADPDA  241

Query  181  FVRWTYREAMAHRDPIYKSMA-SWGITVQADLISQVKTPSEFNSLIGSTI  229
            F+RWTY +A+AHR P Y+SMA +WG+TV AD IS V++ ++FN LI  T+
Sbjct  242  FIRWTYAQALAHRQPRYESMARNWGVTVTADQISAVQSEADFNELIARTL  291


>ref|YP_001168360.1|  hypothetical protein Rsph17025_2165 [Rhodobacter sphaeroides 
ATCC 17025]
 gb|ABP71055.1|  hypothetical protein Rsph17025_2165 [Rhodobacter sphaeroides 
ATCC 17025]
Length=289

 GENE ID: 5084180 Rsph17025_2165 | hypothetical protein
[Rhodobacter sphaeroides ATCC 17025]

 Score =  296 bits (758),  Expect = 1e-78, Method: Compositional matrix adjust.
 Identities = 134/232 (57%), Positives = 167/232 (71%), Gaps = 0/232 (0%)

Query  1    MGEHIEDSYKEDAMKSPYLRELLLGDSIHISSNISFDNLAPLSNYLGKPGNQGKGGLSIE  60
            MGEHI D++K +AMK P+LRELL  DS++I+SNI+FDNLAPLS YLGKPGN  +GGL   
Sbjct  54   MGEHIADNFKREAMKVPFLRELLRTDSVYIASNITFDNLAPLSTYLGKPGNPNRGGLDFA  113

Query  61   EYKKRQAQHHIAEIKALLDTPNFINKSNRIYGYPNFVCDTGGSICEVVNPDDPNDPVLKT  120
            +Y +RQ QH  AE+ AL DT  FI ++  IYGY NFVCD+GGSICEVV+PD+PNDPV+  
Sbjct  114  DYLERQGQHREAEVAALHDTQRFIRRATEIYGYQNFVCDSGGSICEVVDPDNPNDPVMSD  173

Query  121  LAENTLMVWIEGSSHHTDELIKRFDENPKPMCYAPDFLDLKWREYLKINKCSVENVDPDD  180
            LA N LMVWI+GS  HT EL++RFD+ PKPM Y P FL   W +Y      + E VDPDD
Sbjct  174  LAGNLLMVWIKGSEAHTAELVRRFDKAPKPMYYQPAFLRAAWTDYRVEKGLTEEQVDPDD  233

Query  181  FVRWTYREAMAHRDPIYKSMASWGITVQADLISQVKTPSEFNSLIGSTISRS  232
            F+RWTY  A+AHR P Y +MA WGITV A+ +++      F  LI  T+ R+
Sbjct  234  FIRWTYARALAHRQPRYAAMARWGITVTAEEVAEAGNAPAFVDLIARTLDRA  285


>ref|ZP_05079542.1|  conserved hypothetical protein [Rhodobacterales bacterium Y4I]
 gb|EDZ47521.1|  conserved hypothetical protein [Rhodobacterales bacterium Y4I]
Length=287

 Score =  294 bits (753),  Expect = 4e-78, Method: Compositional matrix adjust.
 Identities = 135/230 (58%), Positives = 175/230 (76%), Gaps = 1/230 (0%)

Query  1    MGEHIEDSYKEDAMKSPYLRELLLGDSIHISSNISFDNLAPLSNYLGKPGNQGKGGLSIE  60
            MGE+I D+ K +AMK P+LRELLL DSI I SNI+F+NL P++ YLGKPG+  KGGL I 
Sbjct  54   MGEYIADNAKAEAMKVPFLRELLLSDSIFIGSNITFENLTPVAAYLGKPGDVAKGGLPIA  113

Query  61   EYKKRQAQHHIAEIKALLDTPNFINKSNRIYGYPNFVCDTGGSICEVVNPDDPNDPVLKT  120
            EYK+RQ Q  +AEI+ALLDT  F +++ ++YGYPNF+CDTGGSICE V+ DDP D ++  
Sbjct  114  EYKRRQEQFRVAEIRALLDTEYFEDRAQQLYGYPNFICDTGGSICEWVDVDDPKDQIMSE  173

Query  121  LAENTLMVWIEGSSHHTDELIKRFDENPKPMCYAPDFLDLKWREYLKINKCSVENVDPDD  180
            L+++TLMVWI+G   HT+EL++RFD  PKPM Y P+FLD  W+EYLK N  +  +VDPD 
Sbjct  174  LSKHTLMVWIKGDEAHTEELVRRFDRAPKPMSYQPEFLDRVWQEYLKENGIAEGDVDPDA  233

Query  181  FVRWTYREAMAHRDPIYKSMA-SWGITVQADLISQVKTPSEFNSLIGSTI  229
            F+RWTY +A+AHR P Y++MA +WGITV AD IS VK  + F  LI S +
Sbjct  234  FIRWTYAQALAHRQPRYQAMADNWGITVTADQISTVKDEAGFVDLIASAL  283


>ref|ZP_02148524.1|  hypothetical protein RG210_16770 [Phaeobacter gallaeciensis 2.10]
 gb|EDQ09862.1|  hypothetical protein RG210_16770 [Phaeobacter gallaeciensis 2.10]
Length=287

 Score =  293 bits (749),  Expect = 1e-77, Method: Compositional matrix adjust.
 Identities = 132/230 (57%), Positives = 174/230 (75%), Gaps = 1/230 (0%)

Query  1    MGEHIEDSYKEDAMKSPYLRELLLGDSIHISSNISFDNLAPLSNYLGKPGNQGKGGLSIE  60
            MGE+I D+ K +AMK P+LR+LLL DSI+I SNISF+NL P+++YLGKPGN  KGGL+++
Sbjct  54   MGEYIADNAKAEAMKVPFLRDLLLSDSIYIGSNISFENLTPVASYLGKPGNPAKGGLAMQ  113

Query  61   EYKKRQAQHHIAEIKALLDTPNFINKSNRIYGYPNFVCDTGGSICEVVNPDDPNDPVLKT  120
            EY +RQ Q   AE+ AL DT  FI+++ R+Y YPNF+CDTGGSICE V+ +DPNDP+L  
Sbjct  114  EYTRRQDQFRTAELNALRDTGYFIDRAARLYDYPNFICDTGGSICEWVDANDPNDPLLTE  173

Query  121  LAENTLMVWIEGSSHHTDELIKRFDENPKPMCYAPDFLDLKWREYLKINKCSVENVDPDD  180
            L+++TLMVWI+G   HT ELI+RFD  PKPM Y P FL   W+EYLK N  S  +VDPD 
Sbjct  174  LSQHTLMVWIKGDEAHTQELIRRFDRAPKPMAYEPAFLARVWQEYLKENTLSEADVDPDS  233

Query  181  FVRWTYREAMAHRDPIYKSMAS-WGITVQADLISQVKTPSEFNSLIGSTI  229
            F+RWTY +A+AHR P Y +MA+ WG+TV AD IS ++T ++F  +I   +
Sbjct  234  FIRWTYAQALAHRQPRYAAMAAHWGVTVTADQISGIRTEADFKDVIAGAL  283


>ref|ZP_02143751.1|  hypothetical protein RGBS107_14411 [Phaeobacter gallaeciensis 
BS107]
 gb|EDQ14552.1|  hypothetical protein RGBS107_14411 [Phaeobacter gallaeciensis 
BS107]
Length=287

 Score =  292 bits (747),  Expect = 2e-77, Method: Compositional matrix adjust.
 Identities = 132/230 (57%), Positives = 172/230 (74%), Gaps = 1/230 (0%)

Query  1    MGEHIEDSYKEDAMKSPYLRELLLGDSIHISSNISFDNLAPLSNYLGKPGNQGKGGLSIE  60
            MGE+I D+ K +AMK P+LR+LLL DSI+I SNISF+NL P+++YLGKPG+  KGGL+++
Sbjct  54   MGEYIADNAKAEAMKVPFLRDLLLSDSIYIGSNISFENLTPVASYLGKPGDPAKGGLAMQ  113

Query  61   EYKKRQAQHHIAEIKALLDTPNFINKSNRIYGYPNFVCDTGGSICEVVNPDDPNDPVLKT  120
            EY +RQ Q   AE+ AL DT  FI+++ R+Y YPNF+CDTGGSICE V+ +DP DP+L  
Sbjct  114  EYTRRQDQFRTAELNALRDTGYFIDRAARLYDYPNFICDTGGSICEWVDANDPGDPLLTE  173

Query  121  LAENTLMVWIEGSSHHTDELIKRFDENPKPMCYAPDFLDLKWREYLKINKCSVENVDPDD  180
            LA+NTLMVWI+G   HT ELI+RFD  PKPM Y P FL   W+EYLK N  S  +VDPD 
Sbjct  174  LAKNTLMVWIKGDEAHTQELIRRFDRAPKPMAYEPAFLTRVWQEYLKENTLSEADVDPDS  233

Query  181  FVRWTYREAMAHRDPIYKSMA-SWGITVQADLISQVKTPSEFNSLIGSTI  229
            F+RWTY +A+AHR P Y +MA  WG+TV AD IS ++T ++F  +I   +
Sbjct  234  FIRWTYAQALAHRQPRYAAMAEKWGVTVTADQISTIRTEADFTDVIAGAL  283


>ref|ZP_01880827.1|  hypothetical protein RTM1035_07318 [Roseovarius sp. TM1035]
 gb|EDM31017.1|  hypothetical protein RTM1035_07318 [Roseovarius sp. TM1035]
Length=290

 Score =  292 bits (747),  Expect = 2e-77, Method: Compositional matrix adjust.
 Identities = 132/231 (57%), Positives = 168/231 (72%), Gaps = 0/231 (0%)

Query  1    MGEHIEDSYKEDAMKSPYLRELLLGDSIHISSNISFDNLAPLSNYLGKPGNQGKGGLSIE  60
            MGE I D+ K  AM+ P+LR+LLL DSI+I SNI+F NL P+S YLGKPGN  KGGL IE
Sbjct  54   MGELIADNAKAHAMQVPFLRDLLLSDSIYIGSNITFHNLTPVSTYLGKPGNPAKGGLPIE  113

Query  61   EYKKRQAQHHIAEIKALLDTPNFINKSNRIYGYPNFVCDTGGSICEVVNPDDPNDPVLKT  120
            EY+ RQ Q   AEI AL+DT  FI +++ +YGYP+F+CDTGGSICE V+ DDP D +LK 
Sbjct  114  EYRIRQEQFRQAEIHALMDTAYFIRRADALYGYPHFICDTGGSICEWVDGDDPEDTILKE  173

Query  121  LAENTLMVWIEGSSHHTDELIKRFDENPKPMCYAPDFLDLKWREYLKINKCSVENVDPDD  180
            L+ + L+++IEGS  HT ELI+RFD  PKPM Y P FLD  W EYLK N C  + VDPD 
Sbjct  174  LSAHCLLIYIEGSEAHTAELIRRFDRAPKPMAYQPAFLDAAWAEYLKENNCEEDEVDPDS  233

Query  181  FVRWTYREAMAHRDPIYKSMASWGITVQADLISQVKTPSEFNSLIGSTISR  231
            F+RWTY  A+AHR P Y++M  WG++V A+ ++ VKT  EF +LI + + R
Sbjct  234  FIRWTYARALAHRQPRYEAMVKWGVSVTANEVATVKTAQEFEALIATALER  284


>ref|ZP_01156870.1|  hypothetical protein OG2516_03083 [Oceanicola granulosus HTCC2516]
 gb|EAR50951.1|  hypothetical protein OG2516_03083 [Oceanicola granulosus HTCC2516]
Length=287

 Score =  291 bits (746),  Expect = 3e-77, Method: Compositional matrix adjust.
 Identities = 129/231 (55%), Positives = 168/231 (72%), Gaps = 0/231 (0%)

Query  1    MGEHIEDSYKEDAMKSPYLRELLLGDSIHISSNISFDNLAPLSNYLGKPGNQGKGGLSIE  60
            MGE I D+ K +AM+ P LRELL+ DSI+I SNI+FDNLAPLS YLGKPG+  +GGL   
Sbjct  54   MGELIADNAKREAMRVPLLRELLMTDSIYIGSNITFDNLAPLSTYLGKPGDPARGGLPFA  113

Query  61   EYKKRQAQHHIAEIKALLDTPNFINKSNRIYGYPNFVCDTGGSICEVVNPDDPNDPVLKT  120
            +Y +RQA+H  AE  ALLDT +FI+++  +YGYP+FVCDTGGSICEVV+PDDP DP+L  
Sbjct  114  DYVRRQAEHRDAETAALLDTGHFIDRAGSLYGYPHFVCDTGGSICEVVDPDDPADPILTH  173

Query  121  LAENTLMVWIEGSSHHTDELIKRFDENPKPMCYAPDFLDLKWREYLKINKCSVENVDPDD  180
            L+E  LMVWI GS  HTD L++RFD  PKPM YAP FL+  W  YL     +   VDPD 
Sbjct  174  LSERLLMVWIRGSEAHTDALVRRFDRAPKPMYYAPAFLEACWSAYLAETGTAPGAVDPDA  233

Query  181  FVRWTYREAMAHRDPIYKSMASWGITVQADLISQVKTPSEFNSLIGSTISR  231
            F+RWTY  AMAHR+P+Y  MA WG+T++A  +++V+  ++  +L+   + R
Sbjct  234  FIRWTYARAMAHREPLYAGMARWGVTLEASDVAEVRDEADITALVADALDR  284


>ref|ZP_05087921.1|  conserved hypothetical protein [Ruegeria sp. R11]
 gb|EEB69613.1|  conserved hypothetical protein [Ruegeria sp. R11]
Length=290

 Score =  291 bits (744),  Expect = 5e-77, Method: Compositional matrix adjust.
 Identities = 132/230 (57%), Positives = 172/230 (74%), Gaps = 1/230 (0%)

Query  1    MGEHIEDSYKEDAMKSPYLRELLLGDSIHISSNISFDNLAPLSNYLGKPGNQGKGGLSIE  60
            MGE+I D+ K +AMK P+LR+LLL DSI+I SNISF+NL P+++YLGKPGN  KGGL + 
Sbjct  54   MGEYIADNAKAEAMKVPFLRDLLLSDSIYIGSNISFENLTPVASYLGKPGNPDKGGLPMA  113

Query  61   EYKKRQAQHHIAEIKALLDTPNFINKSNRIYGYPNFVCDTGGSICEVVNPDDPNDPVLKT  120
            EY +RQ Q   AE+ AL DT  FI+++ R+Y YPNF+CDTGGSICE V  DDP+DP+L  
Sbjct  114  EYTRRQDQFRTAELNALRDTGYFIDRAERLYDYPNFICDTGGSICEWVEADDPSDPLLTQ  173

Query  121  LAENTLMVWIEGSSHHTDELIKRFDENPKPMCYAPDFLDLKWREYLKINKCSVENVDPDD  180
            L++ TLMVWI+G   HT ELI+RFD  PKPM Y P FL   W+EYLK N  S E VDPD 
Sbjct  174  LSQQTLMVWIKGDEAHTQELIRRFDRAPKPMAYQPAFLARVWQEYLKENALSEEQVDPDT  233

Query  181  FVRWTYREAMAHRDPIYKSMA-SWGITVQADLISQVKTPSEFNSLIGSTI  229
            F+RWTY +A+AHR P Y++MA +WG+TV AD I+ V++ ++F  +I + +
Sbjct  234  FIRWTYAQALAHRQPRYEAMAKNWGVTVTADQIASVRSEADFTDVIANAL  283


>ref|ZP_05124517.1|  conserved hypothetical protein [Rhodobacteraceae bacterium KLH11]
 gb|EEE39149.1|  conserved hypothetical protein [Rhodobacteraceae bacterium KLH11]
Length=290

 Score =  291 bits (744),  Expect = 5e-77, Method: Compositional matrix adjust.
 Identities = 132/230 (57%), Positives = 170/230 (73%), Gaps = 1/230 (0%)

Query  1    MGEHIEDSYKEDAMKSPYLRELLLGDSIHISSNISFDNLAPLSNYLGKPGNQGKGGLSIE  60
            MGE+I D+ K +AMK P+LRELLL DSI+I SNI+F NL P+S YLGKPGN  +GGL I 
Sbjct  54   MGEYITDNAKAEAMKVPFLRELLLTDSIYIGSNITFQNLTPVSAYLGKPGNPERGGLEIS  113

Query  61   EYKKRQAQHHIAEIKALLDTPNFINKSNRIYGYPNFVCDTGGSICEVVNPDDPNDPVLKT  120
            EY+ RQ Q   AEI ALLDT  F+ ++ R+Y YPNF+CDTGGSICE V P+DPND +L  
Sbjct  114  EYQLRQEQFRQAEIYALLDTKYFVERARRLYSYPNFICDTGGSICEWVEPEDPNDQILTE  173

Query  121  LAENTLMVWIEGSSHHTDELIKRFDENPKPMCYAPDFLDLKWREYLKINKCSVENVDPDD  180
            L+  TLM+WI+G   HT+ELI+RFD+ PKPM Y   FL   W EYL +N CS + VDPD 
Sbjct  174  LSSQTLMIWIKGDDAHTEELIRRFDKAPKPMSYQAAFLQRVWSEYLALNTCSGDEVDPDA  233

Query  181  FVRWTYREAMAHRDPIYKSMA-SWGITVQADLISQVKTPSEFNSLIGSTI  229
            F+RWTY  A+AHR P Y++MA +WG+T+ AD +++V+  ++FN LI  T+
Sbjct  234  FIRWTYARALAHRQPRYEAMARNWGVTIAADDVAKVRDSADFNELIERTL  283


>ref|ZP_05787909.1|  conserved hypothetical protein [Silicibacter lacuscaerulensis 
ITI-1157]
 gb|EEX11025.1|  conserved hypothetical protein [Silicibacter lacuscaerulensis 
ITI-1157]
Length=290

 Score =  289 bits (740),  Expect = 2e-76, Method: Compositional matrix adjust.
 Identities = 132/230 (57%), Positives = 175/230 (76%), Gaps = 1/230 (0%)

Query  1    MGEHIEDSYKEDAMKSPYLRELLLGDSIHISSNISFDNLAPLSNYLGKPGNQGKGGLSIE  60
            MGE+I D+ K +AMK P+LR+LLL DSI+I SNISF+NL P+S YLGKPGN   GGL + 
Sbjct  54   MGEYITDNAKAEAMKVPFLRDLLLTDSIYIGSNISFENLTPVSAYLGKPGNPDLGGLPMP  113

Query  61   EYKKRQAQHHIAEIKALLDTPNFINKSNRIYGYPNFVCDTGGSICEVVNPDDPNDPVLKT  120
            EY++RQ Q  +AEI ALLDT  FI+++ R+YGYP+F+CDTGGSICE V+PDDP D VL  
Sbjct  114  EYRRRQEQFRLAEIHALLDTEYFIDRAQRLYGYPHFICDTGGSICEWVDPDDPADRVLSE  173

Query  121  LAENTLMVWIEGSSHHTDELIKRFDENPKPMCYAPDFLDLKWREYLKINKCSVENVDPDD  180
            L+ +TLM+WI+G   HT ELI+RFD+ PKPM Y  +FL   W++YL  + C+  +V+PDD
Sbjct  174  LSRHTLMIWIKGDDDHTAELIRRFDKAPKPMSYQAEFLTRVWQDYLDRHGCAEADVNPDD  233

Query  181  FVRWTYREAMAHRDPIYKSMA-SWGITVQADLISQVKTPSEFNSLIGSTI  229
            F+RWTY +A+AHR P Y++MA +WG+TV AD I QV+   +F+ LI  T+
Sbjct  234  FIRWTYAQALAHRQPRYRAMAENWGVTVTADQIGQVRDAGDFDELIERTL  283


>ref|ZP_05843039.1|  conserved hypothetical protein [Rhodobacter sp. SW2]
 gb|EEW25955.1|  conserved hypothetical protein [Rhodobacter sp. SW2]
Length=293

 Score =  289 bits (739),  Expect = 2e-76, Method: Compositional matrix adjust.
 Identities = 131/230 (56%), Positives = 169/230 (73%), Gaps = 0/230 (0%)

Query  1    MGEHIEDSYKEDAMKSPYLRELLLGDSIHISSNISFDNLAPLSNYLGKPGNQGKGGLSIE  60
            MGE+I D++K +AM+ P+LRELLL DS++I+SNI+F+NLAPLS YLGKPGN  KGGL+  
Sbjct  54   MGEYIADNFKREAMRVPFLRELLLTDSVYIASNITFNNLAPLSTYLGKPGNPDKGGLAFA  113

Query  61   EYKKRQAQHHIAEIKALLDTPNFINKSNRIYGYPNFVCDTGGSICEVVNPDDPNDPVLKT  120
            EY +RQ+QH  AEI ALLDT  FI+++  IYGY NFVCD+GGSICEVV+P++P DPV+K+
Sbjct  114  EYLERQSQHREAEIAALLDTHRFIDRARDIYGYANFVCDSGGSICEVVDPENPADPVMKS  173

Query  121  LAENTLMVWIEGSSHHTDELIKRFDENPKPMCYAPDFLDLKWREYLKINKCSVENVDPDD  180
            L+EN L+VWI+GS  HT ELI+RFD  PKPM Y P+FL   W  Y      +   VDPD 
Sbjct  174  LSENLLLVWIKGSDAHTAELIRRFDRAPKPMYYQPEFLHDAWEAYRSQTGQTEAQVDPDA  233

Query  181  FVRWTYREAMAHRDPIYKSMASWGITVQADLISQVKTPSEFNSLIGSTIS  230
            FVRWTY  A+AHR P Y +MA WG T+ A+ ++ +        LIG+ I+
Sbjct  234  FVRWTYARALAHRQPRYAAMARWGATITAEQVAGLTDAQGLIDLIGTAIT  283


>ref|YP_001532935.1|  hypothetical protein Dshi_1592 [Dinoroseobacter shibae DFL 12]
 gb|ABV93334.1|  conserved hypothetical protein [Dinoroseobacter shibae DFL 12]
Length=290

 GENE ID: 5712736 Dshi_1592 | hypothetical protein
[Dinoroseobacter shibae DFL 12]

 Score =  288 bits (737),  Expect = 3e-76, Method: Compositional matrix adjust.
 Identities = 127/230 (55%), Positives = 178/230 (77%), Gaps = 1/230 (0%)

Query  1    MGEHIEDSYKEDAMKSPYLRELLLGDSIHISSNISFDNLAPLSNYLGKPGNQGKGGLSIE  60
            MGEHI D++K +AM+ P+LR+LL+ DSI+I SNI+ +NL+P+S YLGKPGN   GG+ I+
Sbjct  54   MGEHIVDNFKREAMQVPFLRDLLMSDSIYIGSNITLENLSPVSTYLGKPGNPALGGVPID  113

Query  61   EYKKRQAQHHIAEIKALLDTPNFINKSNRIYGYPNFVCDTGGSICEVVNPDDPNDPVLKT  120
            EY+ RQAQ   AEI AL DT +FI+++  +YGYP+FVCD+GGSICE V+ +DP DP+L+ 
Sbjct  114  EYRLRQAQFRRAEIAALQDTAHFIDRAKALYGYPHFVCDSGGSICEWVDAEDPEDPLLRD  173

Query  121  LAENTLMVWIEGSSHHTDELIKRFDENPKPMCYAPDFLDLKWREYLKINKCSVENVDPDD  180
            L+  TL+VWI+G + HT+ELI+RFD +PKPM Y P+FLD  W EYL  N+ + E VDPD 
Sbjct  174  LSAVTLLVWIKGDAAHTEELIRRFDRDPKPMAYQPEFLDRVWAEYLAQNRMAPEAVDPDA  233

Query  181  FVRWTYREAMAHRDPIYKSMA-SWGITVQADLISQVKTPSEFNSLIGSTI  229
            F+RWTY +A+AHR P Y +MA +WG+TV+AD I++++  ++F +LI   +
Sbjct  234  FIRWTYAQALAHRQPRYAAMAQNWGVTVEADDIARMRDAADFEALIADAL  283


>ref|ZP_01742712.1|  hypothetical protein RB2150_03134 [Rhodobacterales bacterium 
HTCC2150]
 gb|EBA02925.1|  hypothetical protein RB2150_03134 [Rhodobacterales bacterium 
HTCC2150]
Length=288

 Score =  288 bits (736),  Expect = 4e-76, Method: Compositional matrix adjust.
 Identities = 130/232 (56%), Positives = 172/232 (74%), Gaps = 1/232 (0%)

Query  1    MGEHIEDSYKEDAMKSPYLRELLLGDSIHISSNISFDNLAPLSNYLGKPGNQGKGGLSIE  60
            MGEHI D+ K +AMK P++  LL  DSI+I SNISF+NLAPLS YLGKPG+  KGG+  +
Sbjct  54   MGEHILDNIKREAMKEPFIANLLRTDSIYIGSNISFNNLAPLSTYLGKPGDPAKGGVPFD  113

Query  61   EYKKRQAQHHIAEIKALLDTPNFINKSNRIYGYPNFVCDTGGSICEVVNPDDPNDPVLKT  120
            EY +RQA H  AEI +LLD+  FI+++  IYGYP+F+CDTGGSICEVV P++P+DPVL +
Sbjct  114  EYMRRQALHREAEIASLLDSRYFIDRAQAIYGYPHFICDTGGSICEVVEPNNPDDPVLSS  173

Query  121  LAENTLMVWIEGSSHHTDELIKRFDENPKPMCYAPDFLDLKWREYLKINKCSVENVDPDD  180
            L+++ L+VWI+G   H D L  RFD  PKPM Y+P FL  KW +YL +N  + + VDPDD
Sbjct  174  LSQDNLLVWIKGDDSHIDRLAARFDRAPKPMYYSPAFLTEKWSQYLSLNGVAEDKVDPDD  233

Query  181  FVRWTYREAMAHRDPIYKSMA-SWGITVQADLISQVKTPSEFNSLIGSTISR  231
            FVRWTYR+AM HR+PIY +MA  WG+TV AD ++ V   ++F+ LI   + +
Sbjct  234  FVRWTYRQAMKHREPIYAAMAEKWGVTVMADDVANVSNATDFDRLIADALGK  285


>ref|ZP_05740588.1|  conserved hypothetical protein [Silicibacter sp. TrichCH4B]
 gb|EEW59884.1|  conserved hypothetical protein [Silicibacter sp. TrichCH4B]
Length=290

 Score =  286 bits (731),  Expect = 2e-75, Method: Compositional matrix adjust.
 Identities = 133/230 (57%), Positives = 172/230 (74%), Gaps = 1/230 (0%)

Query  1    MGEHIEDSYKEDAMKSPYLRELLLGDSIHISSNISFDNLAPLSNYLGKPGNQGKGGLSIE  60
            MGE+I D+ K +AMK P+LR+LLL DSI ISSNI+F+NL+ ++ YLG PG+  KGGL IE
Sbjct  54   MGEYIADNAKAEAMKVPFLRDLLLSDSIDISSNITFENLSAVTAYLGVPGDARKGGLPIE  113

Query  61   EYKKRQAQHHIAEIKALLDTPNFINKSNRIYGYPNFVCDTGGSICEVVNPDDPNDPVLKT  120
            EY+ RQ Q   AEI ALLDT  FI++S RIYGYP+F+CD+GGSICE V+ DDPND +L  
Sbjct  114  EYRTRQDQFRRAEIDALLDTRYFIDRSERIYGYPHFICDSGGSICEWVDADDPNDKLLTE  173

Query  121  LAENTLMVWIEGSSHHTDELIKRFDENPKPMCYAPDFLDLKWREYLKINKCSVENVDPDD  180
            L+ +TL+VWI+G+  HT ELI+RFD  PKPM Y P FL   W EYL  N  S E+VDP  
Sbjct  174  LSAHTLLVWIKGTEDHTQELIRRFDRAPKPMAYQPAFLTRVWEEYLSENNISDEDVDPHS  233

Query  181  FVRWTYREAMAHRDPIYKSMA-SWGITVQADLISQVKTPSEFNSLIGSTI  229
            F+RWTY +A+AHR P Y++MA +WG+TV AD ++QV++  +F  LI S +
Sbjct  234  FLRWTYAQALAHRQPRYEAMAKNWGVTVTADQVAQVRSTLDFEELIASAL  283


>ref|ZP_05062928.1|  conserved hypothetical protein [Octadecabacter antarcticus 238]
 gb|EDY88167.1|  conserved hypothetical protein [Octadecabacter antarcticus 238]
Length=287

 Score =  285 bits (730),  Expect = 2e-75, Method: Compositional matrix adjust.
 Identities = 132/231 (57%), Positives = 174/231 (75%), Gaps = 1/231 (0%)

Query  1    MGEHIEDSYKEDAMKSPYLRELLLGDSIHISSNISFDNLAPLSNYLGKPGNQGKGGLSIE  60
            MGE+I D+ K +AMK P+LRELLL DSIHI SNISFDNL P+S YLGKPG++ KGGL++ 
Sbjct  54   MGEYIADNAKAEAMKVPFLRELLLSDSIHIGSNISFDNLTPVSTYLGKPGDRSKGGLALA  113

Query  61   EYKKRQAQHHIAEIKALLDTPNFINKSNRIYGYPNFVCDTGGSICEVVNPDDPNDPVLKT  120
            EY++RQ Q   AEI AL DTP FI+++  +YGYP+FVCDTGGSICE V+ +DP DPVLK 
Sbjct  114  EYQRRQDQFRRAEIDALRDTPYFIDRAQALYGYPHFVCDTGGSICEWVDANDPADPVLKM  173

Query  121  LAENTLMVWIEGSSHHTDELIKRFDENPKPMCYAPDFLDLKWREYLKINKCSVENVDPDD  180
            L+++ LMVWI+GS  HT +L++RFD+ PKPM Y P+FL   W EYL  NK    +VDPD 
Sbjct  174  LSDSCLMVWIKGSDDHTSDLVQRFDKAPKPMAYEPEFLLAAWNEYLIENKFVEGDVDPDT  233

Query  181  FVRWTYREAMAHRDPIYKSMA-SWGITVQADLISQVKTPSEFNSLIGSTIS  230
            F+RWTY  A+AHR P Y+++A +WGI VQA  +  V+  ++F ++I   ++
Sbjct  234  FIRWTYARALAHRQPRYEAIAKNWGIFVQASDMGSVRDEADFVAVIAQGLA  284


>ref|ZP_02139047.1|  hypothetical protein RLO149_19889 [Roseobacter litoralis Och 
149]
 gb|EDQ19376.1|  hypothetical protein RLO149_19889 [Roseobacter litoralis Och 
149]
Length=282

 Score =  285 bits (729),  Expect = 3e-75, Method: Compositional matrix adjust.
 Identities = 127/229 (55%), Positives = 168/229 (73%), Gaps = 0/229 (0%)

Query  1    MGEHIEDSYKEDAMKSPYLRELLLGDSIHISSNISFDNLAPLSNYLGKPGNQGKGGLSIE  60
            MGE+I D+ K +AMK P+LR+LL+ DSI+I SNI+FDNL+P++ YLGKPGN  K GL  +
Sbjct  54   MGEYIADNAKLEAMKVPFLRDLLMSDSIYIGSNITFDNLSPVATYLGKPGNPAKSGLPFQ  113

Query  61   EYKKRQAQHHIAEIKALLDTPNFINKSNRIYGYPNFVCDTGGSICEVVNPDDPNDPVLKT  120
            EY++RQ Q   AEI AL DT +F  ++  +YGYPNF+CDTGGSICE V+P+D  DP++  
Sbjct  114  EYERRQDQFQRAEIAALQDTAHFAKRAEDLYGYPNFICDTGGSICEWVDPEDDADPLMTE  173

Query  121  LAENTLMVWIEGSSHHTDELIKRFDENPKPMCYAPDFLDLKWREYLKINKCSVENVDPDD  180
            LA + LM+WI+G   HT ELI+RFD  PKPM Y P FL   WREYL    C+   VDPDD
Sbjct  174  LANHCLMIWIKGDEAHTQELIRRFDRAPKPMAYQPAFLTRVWREYLSEKNCTETEVDPDD  233

Query  181  FVRWTYREAMAHRDPIYKSMASWGITVQADLISQVKTPSEFNSLIGSTI  229
            F+R+TY +A+AHR P Y++M+ WGITV A  +S +KTP +F +L+ S +
Sbjct  234  FIRFTYAKALAHRQPRYQAMSRWGITVTASEVSALKTPDDFYALVASYL  282


>ref|ZP_01056951.1|  hypothetical protein MED193_04716 [Roseobacter sp. MED193]
 gb|EAQ45103.1|  hypothetical protein MED193_04716 [Roseobacter sp. MED193]
Length=290

 Score =  285 bits (728),  Expect = 4e-75, Method: Compositional matrix adjust.
 Identities = 129/230 (56%), Positives = 170/230 (73%), Gaps = 1/230 (0%)

Query  1    MGEHIEDSYKEDAMKSPYLRELLLGDSIHISSNISFDNLAPLSNYLGKPGNQGKGGLSIE  60
            MGE+I D+ K +AMK+P+LR+LLL DSI+I SNI+F+NL P+S+YLGKPG+  KGGL I 
Sbjct  59   MGEYIADNAKAEAMKNPFLRDLLLSDSIYIGSNITFENLTPVSSYLGKPGDPAKGGLEIN  118

Query  61   EYKKRQAQHHIAEIKALLDTPNFINKSNRIYGYPNFVCDTGGSICEVVNPDDPNDPVLKT  120
            EY++RQ Q  +AE+ ALLDT  F +++ R+Y YPNF+CDTGGSICE V+ DDP D +L  
Sbjct  119  EYRRRQDQFRMAEVHALLDTQYFADRATRLYDYPNFICDTGGSICEWVDADDPKDQILTE  178

Query  121  LAENTLMVWIEGSSHHTDELIKRFDENPKPMCYAPDFLDLKWREYLKINKCSVENVDPDD  180
            L++  LMVWI+G   HT+ELI+RFD  PKPM Y P+FL   W +YL     S  +VDPD 
Sbjct  179  LSKQCLMVWIKGDDAHTEELIRRFDRAPKPMSYQPEFLQTCWEDYLNQMGLSERDVDPDA  238

Query  181  FVRWTYREAMAHRDPIYKSMA-SWGITVQADLISQVKTPSEFNSLIGSTI  229
            F+RWTY +A+AHR P Y +MA +WG+TV AD I  +K+  +FN LI  T+
Sbjct  239  FIRWTYSQALAHRQPRYAAMAKNWGVTVTADQIYAIKSERDFNELIARTL  288


>ref|ZP_00954804.1|  hypothetical protein EE36_14922 [Sulfitobacter sp. EE-36]
 gb|EAP84312.1|  hypothetical protein EE36_14922 [Sulfitobacter sp. EE-36]
Length=284

 Score =  283 bits (725),  Expect = 8e-75, Method: Compositional matrix adjust.
 Identities = 126/230 (54%), Positives = 171/230 (74%), Gaps = 0/230 (0%)

Query  1    MGEHIEDSYKEDAMKSPYLRELLLGDSIHISSNISFDNLAPLSNYLGKPGNQGKGGLSIE  60
            +GE I D+ K +AMK P+LR+LLL DSI+I SNI+FDNL+P++ +LGKPG+  KGGL + 
Sbjct  54   LGETIADNAKAEAMKVPFLRDLLLSDSIYIGSNITFDNLSPVAAWLGKPGDAAKGGLPMA  113

Query  61   EYKKRQAQHHIAEIKALLDTPNFINKSNRIYGYPNFVCDTGGSICEVVNPDDPNDPVLKT  120
            EY  RQ     AEI AL DT +F  ++  +YGYP+F+CDTGGSICE V+PDD NDP+++T
Sbjct  114  EYASRQEAFKQAEIAALRDTGHFARRAQTLYGYPHFICDTGGSICEWVDPDDDNDPLMQT  173

Query  121  LAENTLMVWIEGSSHHTDELIKRFDENPKPMCYAPDFLDLKWREYLKINKCSVENVDPDD  180
            L+   L VWI+G   HT ELI+RFD+ PKPM Y P+FL   W+EYL+ N C  ++VDPD 
Sbjct  174  LSAECLPVWIKGDHAHTQELIRRFDKAPKPMAYQPEFLLRVWQEYLRENNCQEDDVDPDA  233

Query  181  FVRWTYREAMAHRDPIYKSMASWGITVQADLISQVKTPSEFNSLIGSTIS  230
            F+RWTY +A+AHR P Y++M+ WG+TV AD ++ +K+  +FN LI  T+S
Sbjct  234  FIRWTYAQALAHRQPRYEAMSRWGVTVTADQVAGLKSEDDFNDLIAGTLS  283


>ref|YP_613373.1|  hypothetical protein TM1040_1378 [Ruegeria sp. TM1040]
 gb|ABF64111.1|  hypothetical protein TM1040_1378 [Ruegeria sp. TM1040]
Length=269

 GENE ID: 4075871 TM1040_1378 | hypothetical protein [Silicibacter sp. TM1040]

 Score =  283 bits (724),  Expect = 9e-75, Method: Compositional matrix adjust.
 Identities = 131/230 (56%), Positives = 172/230 (74%), Gaps = 1/230 (0%)

Query  1    MGEHIEDSYKEDAMKSPYLRELLLGDSIHISSNISFDNLAPLSNYLGKPGNQGKGGLSIE  60
            MGE+I D+ K +AMK P+LR+LLL DSI ISSNI+F+NL+ ++ YLG PG+  KGGL IE
Sbjct  33   MGEYIADNAKAEAMKVPFLRDLLLSDSIDISSNITFENLSAVTAYLGVPGDAQKGGLPIE  92

Query  61   EYKKRQAQHHIAEIKALLDTPNFINKSNRIYGYPNFVCDTGGSICEVVNPDDPNDPVLKT  120
            EY+ RQ Q   AEI ALLDT  FI++S+RIYGYP+F+CD+GGSICE V+ +DPNDP+L  
Sbjct  93   EYRTRQDQFRRAEIDALLDTRYFIDRSDRIYGYPHFICDSGGSICEWVDANDPNDPLLSE  152

Query  121  LAENTLMVWIEGSSHHTDELIKRFDENPKPMCYAPDFLDLKWREYLKINKCSVENVDPDD  180
            L+ +TL+VWI+G+  HT ELI+RFD  PKPM Y P FL   W EYL  N  +  +VDP  
Sbjct  153  LSAHTLLVWIKGTEDHTQELIRRFDRAPKPMAYQPAFLTRVWEEYLNENNITDADVDPHA  212

Query  181  FVRWTYREAMAHRDPIYKSMAS-WGITVQADLISQVKTPSEFNSLIGSTI  229
            F+RWTY +A+AHR P Y++MA+ WG+TV AD ++ V+  S+F  LI S +
Sbjct  213  FLRWTYAQALAHRQPRYEAMAAHWGVTVTADQVAHVQNTSDFEELIASAL  262


>ref|ZP_00962283.1|  hypothetical protein NAS141_05043 [Sulfitobacter sp. NAS-14.1]
 gb|EAP80676.1|  hypothetical protein NAS141_05043 [Sulfitobacter sp. NAS-14.1]
Length=284

 Score =  283 bits (724),  Expect = 1e-74, Method: Compositional matrix adjust.
 Identities = 126/230 (54%), Positives = 170/230 (73%), Gaps = 0/230 (0%)

Query  1    MGEHIEDSYKEDAMKSPYLRELLLGDSIHISSNISFDNLAPLSNYLGKPGNQGKGGLSIE  60
            +GE I D+ K +AMK P+LR+LLL DSI+I SNI+FDNL+P++ +LGKPG+  KGGL + 
Sbjct  54   LGETIADNAKAEAMKVPFLRDLLLSDSIYIGSNITFDNLSPVATWLGKPGDAAKGGLPMA  113

Query  61   EYKKRQAQHHIAEIKALLDTPNFINKSNRIYGYPNFVCDTGGSICEVVNPDDPNDPVLKT  120
            EY  RQ     AEI AL DT +F  ++  +YGYP+F+CDTGGSICE V+PDD NDP+++T
Sbjct  114  EYASRQDAFKQAEIAALRDTGHFARRAQALYGYPHFICDTGGSICEWVDPDDDNDPLMQT  173

Query  121  LAENTLMVWIEGSSHHTDELIKRFDENPKPMCYAPDFLDLKWREYLKINKCSVENVDPDD  180
            L+   L VWI+G   HT ELI+RFD+ PKPM Y P+FL   W+EYL  N C  ++VDPD 
Sbjct  174  LSAECLPVWIKGDDAHTQELIRRFDKAPKPMAYQPEFLLRVWQEYLSENNCQEDDVDPDA  233

Query  181  FVRWTYREAMAHRDPIYKSMASWGITVQADLISQVKTPSEFNSLIGSTIS  230
            F+RWTY +A+AHR P Y++M+ WG+TV AD ++ +K+  +FN LI  T+S
Sbjct  234  FIRWTYAQALAHRQPRYEAMSRWGVTVTADQVAGLKSEDDFNDLIAGTLS  283


>ref|ZP_01033890.1|  hypothetical protein ROS217_18632 [Roseovarius sp. 217]
 gb|EAQ26571.1|  hypothetical protein ROS217_18632 [Roseovarius sp. 217]
Length=290

 Score =  281 bits (718),  Expect = 5e-74, Method: Compositional matrix adjust.
 Identities = 127/231 (54%), Positives = 166/231 (71%), Gaps = 0/231 (0%)

Query  1    MGEHIEDSYKEDAMKSPYLRELLLGDSIHISSNISFDNLAPLSNYLGKPGNQGKGGLSIE  60
            MGE I D+ K  AM+ P+LR+LLL DSI+I SNI+F NL P+S YLGKPGN  KGGL IE
Sbjct  54   MGELIADNAKAHAMQVPFLRDLLLSDSIYIGSNITFHNLTPVSTYLGKPGNPVKGGLPIE  113

Query  61   EYKKRQAQHHIAEIKALLDTPNFINKSNRIYGYPNFVCDTGGSICEVVNPDDPNDPVLKT  120
            EY+ RQ Q   AEI AL+DT  FI ++  +YGYP+F+CDTGGSICE V+ ++ +DP+L  
Sbjct  114  EYRIRQEQFRQAEIHALMDTAYFIRRAGALYGYPHFICDTGGSICEWVDGNNHDDPILTE  173

Query  121  LAENTLMVWIEGSSHHTDELIKRFDENPKPMCYAPDFLDLKWREYLKINKCSVENVDPDD  180
            L+   L+++IEGS  HT ELI+RFD  PKPM Y P FLD  W EYL+ NKC   +V+PD 
Sbjct  174  LSAQCLLIYIEGSEAHTAELIRRFDRAPKPMAYQPAFLDAAWAEYLRENKCEEGDVNPDS  233

Query  181  FVRWTYREAMAHRDPIYKSMASWGITVQADLISQVKTPSEFNSLIGSTISR  231
            F+RWTY  A+AHR P Y++M  WG++V A  ++ VKT  +F +LI + + R
Sbjct  234  FIRWTYARALAHRQPRYEAMVKWGVSVSATEVAGVKTAQDFEALIATALER  284


>ref|ZP_05052605.1|  hypothetical protein OA307_3981 [Octadecabacter antarcticus 307]
 gb|EDY78871.1|  hypothetical protein OA307_3981 [Octadecabacter antarcticus 307]
Length=287

 Score =  280 bits (717),  Expect = 7e-74, Method: Compositional matrix adjust.
 Identities = 128/231 (55%), Positives = 173/231 (74%), Gaps = 1/231 (0%)

Query  1    MGEHIEDSYKEDAMKSPYLRELLLGDSIHISSNISFDNLAPLSNYLGKPGNQGKGGLSIE  60
            MGE+I D+ K +AMK P+LRELLL DSIHI SNISFDNL P+S YLGKPG+  KGGL + 
Sbjct  54   MGEYIADNAKAEAMKVPFLRELLLSDSIHIGSNISFDNLTPVSTYLGKPGDPAKGGLPMA  113

Query  61   EYKKRQAQHHIAEIKALLDTPNFINKSNRIYGYPNFVCDTGGSICEVVNPDDPNDPVLKT  120
            EY++RQ Q   AE+ AL DTP FI+++  +YGYP+FVCDTGGSICE V+  DP DPVL+ 
Sbjct  114  EYQRRQDQFRRAELDALRDTPYFIDRAQALYGYPHFVCDTGGSICEWVDAGDPADPVLRM  173

Query  121  LAENTLMVWIEGSSHHTDELIKRFDENPKPMCYAPDFLDLKWREYLKINKCSVENVDPDD  180
            L+++ LMVWI+GS  HT +L++RFD+ PKPM Y P+FL   W EY++ NK    +VDP+ 
Sbjct  174  LSDSCLMVWIKGSDDHTVDLVERFDKAPKPMAYEPEFLLAAWGEYMRENKYLAVDVDPNS  233

Query  181  FVRWTYREAMAHRDPIYKSMA-SWGITVQADLISQVKTPSEFNSLIGSTIS  230
            F+RW +  A+AHR P Y++MA +WGITVQA  ++ V+  ++F ++I   ++
Sbjct  234  FIRWIFARALAHRQPRYEAMANNWGITVQASDMASVRDEADFVAVIAQGLA  284


>ref|ZP_01012848.1|  hypothetical protein RB2654_03529 [Rhodobacterales bacterium 
HTCC2654]
 gb|EAQ13753.1|  hypothetical protein RB2654_03529 [Rhodobacterales bacterium 
HTCC2654]
Length=286

 Score =  280 bits (716),  Expect = 9e-74, Method: Compositional matrix adjust.
 Identities = 129/230 (56%), Positives = 173/230 (75%), Gaps = 1/230 (0%)

Query  1    MGEHIEDSYKEDAMKSPYLRELLLGDSIHISSNISFDNLAPLSNYLGKPGNQGKGGLSIE  60
            MGE I D++K +AMK+P+LRELLL DS++I+SNI+F+NLAPLS YLGKPG+  K GL  +
Sbjct  54   MGELISDNFKSEAMKNPFLRELLLTDSVYIASNITFNNLAPLSTYLGKPGDPSKHGLPWD  113

Query  61   EYKKRQAQHHIAEIKALLDTPNFINKSNRIYGYPNFVCDTGGSICEVVNPDDPNDPVLKT  120
             Y +RQA+H  AEI ALLDT  F+ ++  +YGY NFVCD+GGSICEVV+P DP DPVL  
Sbjct  114  VYMQRQAEHRDAEINALLDTVRFVRRAKSLYGYDNFVCDSGGSICEVVDPHDPTDPVLTA  173

Query  121  LAENTLMVWIEGSSHHTDELIKRFDENPKPMCYAPDFLDLKWREYLKINKCSVENVDPDD  180
            L++N LMVWIEGS  HT+EL++RF + PKPM Y P FL+  W +YL       + VDPDD
Sbjct  174  LSDNLLMVWIEGSEAHTEELVRRFIKAPKPMYYQPAFLEACWADYLAEKGLREDQVDPDD  233

Query  181  FVRWTYREAMAHRDPIYKSMA-SWGITVQADLISQVKTPSEFNSLIGSTI  229
            FVR TY +A+ HR P+Y +MA  WG+ V+AD +++V++P +F+ LI + +
Sbjct  234  FVRSTYAKALHHRHPLYAAMAEKWGVRVKADDVAKVRSPDDFDDLIAAAL  283


>ref|YP_918258.1|  hypothetical protein Pden_4498 [Paracoccus denitrificans PD1222]
 gb|ABL72562.1|  conserved hypothetical protein [Paracoccus denitrificans PD1222]
Length=300

 GENE ID: 4583048 Pden_4498 | hypothetical protein
[Paracoccus denitrificans PD1222]

 Score =  279 bits (713),  Expect = 2e-73, Method: Compositional matrix adjust.
 Identities = 125/231 (54%), Positives = 167/231 (72%), Gaps = 0/231 (0%)

Query  1    MGEHIEDSYKEDAMKSPYLRELLLGDSIHISSNISFDNLAPLSNYLGKPGNQGKGGLSIE  60
            MGE I D++K +AMK P+LRELLL DS++I+SNI+F+NLAPLS YLGKPG+  +GGL  +
Sbjct  54   MGELIADNFKREAMKVPFLRELLLSDSVYIASNITFENLAPLSTYLGKPGSVTRGGLEFD  113

Query  61   EYKKRQAQHHIAEIKALLDTPNFINKSNRIYGYPNFVCDTGGSICEVVNPDDPNDPVLKT  120
            EY +RQ QH +AEI ALLDT +FI ++  IY   NFVCD+GGS+CEVV+P    DPVL  
Sbjct  114  EYMRRQDQHRMAEIAALLDTGHFIRRAQEIYNLGNFVCDSGGSVCEVVDPFAQTDPVLDA  173

Query  121  LAENTLMVWIEGSSHHTDELIKRFDENPKPMCYAPDFLDLKWREYLKINKCSVENVDPDD  180
            L    L+VWI+GS  HT+EL++RFD  PKPM Y P+FLD  W +Y        + V+PDD
Sbjct  174  LERELLLVWIKGSDAHTEELVRRFDRAPKPMYYQPEFLDRAWTDYRVEKGLREDQVNPDD  233

Query  181  FVRWTYREAMAHRDPIYKSMASWGITVQADLISQVKTPSEFNSLIGSTISR  231
            F+RWTY  A+AHR P Y++MA  G+T+ A+ +++V  P++  +LIG  I R
Sbjct  234  FIRWTYARALAHRQPRYEAMARRGVTLLAEEVAEVTNPADLIALIGHAIER  284


>ref|ZP_01747128.1|  hypothetical protein SSE37_06629 [Sagittula stellata E-37]
 gb|EBA07291.1|  hypothetical protein SSE37_06629 [Sagittula stellata E-37]
Length=285

 Score =  276 bits (707),  Expect = 8e-73, Method: Compositional matrix adjust.
 Identities = 127/230 (55%), Positives = 164/230 (71%), Gaps = 1/230 (0%)

Query  1    MGEHIEDSYKEDAMKSPYLRELLLGDSIHISSNISFDNLAPLSNYLGKPGNQGKGGLSIE  60
            MGE+I D+ K  AM+ P+LR+LLL DSI+I SNISF+NL P+S YLGKPG+  +GGL + 
Sbjct  54   MGEYIVDNAKRQAMQVPFLRDLLLSDSIYIGSNISFNNLTPVSTYLGKPGDPDRGGLPMH  113

Query  61   EYKKRQAQHHIAEIKALLDTPNFINKSNRIYGYPNFVCDTGGSICEVVNPDDPNDPVLKT  120
            EY +RQ Q   AE  ALLDT  FI+++  +YGYP+FVCDTGGSICE V+P D  DP+L  
Sbjct  114  EYVRRQGQFERAERLALLDTEYFIDRAQDLYGYPHFVCDTGGSICEWVDPQDAEDPILTA  173

Query  121  LAENTLMVWIEGSSHHTDELIKRFDENPKPMCYAPDFLDLKWREYLKINKCSVENVDPDD  180
            L+   LMVW+ GS  HT++LI+RFD  PKPM Y PDFL   W  YL     S E +DPD 
Sbjct  174  LSSVALMVWVRGSEAHTEDLIRRFDRAPKPMAYRPDFLSSAWTRYLDATGTSEEGIDPDA  233

Query  181  FVRWTYREAMAHRDPIYKSMAS-WGITVQADLISQVKTPSEFNSLIGSTI  229
            F+RWTY +A+AHR P+Y++MAS WGITV A  +S V+T ++F  +I   +
Sbjct  234  FIRWTYAQALAHRQPLYEAMASNWGITVDASDMSGVRTAADFEDIIAKAL  283


>ref|YP_683000.1|  hypothetical protein RD1_2783 [Roseobacter denitrificans OCh 
114]
 gb|ABG32314.1|  hypothetical protein RD1_2783 [Roseobacter denitrificans OCh 
114]
Length=282

 GENE ID: 4195534 RD1_2783 | hypothetical protein
[Roseobacter denitrificans OCh 114] (10 or fewer PubMed links)

 Score =  276 bits (707),  Expect = 9e-73, Method: Compositional matrix adjust.
 Identities = 122/229 (53%), Positives = 167/229 (72%), Gaps = 0/229 (0%)

Query  1    MGEHIEDSYKEDAMKSPYLRELLLGDSIHISSNISFDNLAPLSNYLGKPGNQGKGGLSIE  60
            MGE+I D+ K +AMK P+LR+LL+ DSI+I SNI+FDNL+P++ YLGKPG+  K GL   
Sbjct  54   MGEYIADNAKLEAMKVPFLRDLLMSDSIYIGSNITFDNLSPVATYLGKPGDPAKSGLPFA  113

Query  61   EYKKRQAQHHIAEIKALLDTPNFINKSNRIYGYPNFVCDTGGSICEVVNPDDPNDPVLKT  120
            EY++RQ Q   AEI AL DT +F  ++  +YGYPNF+CDTGGSICE ++P+D  DP+++ 
Sbjct  114  EYERRQDQFRRAEIAALRDTAHFAKRAEDLYGYPNFICDTGGSICEWIDPEDDADPLMQE  173

Query  121  LAENTLMVWIEGSSHHTDELIKRFDENPKPMCYAPDFLDLKWREYLKINKCSVENVDPDD  180
            LA++ LM+WI+G   HT ELI+RFD  PKPM Y P FL   W +YL    C+   VDPDD
Sbjct  174  LAQHCLMIWIKGDEAHTQELIRRFDRAPKPMAYQPAFLTRVWHDYLNEKNCTETEVDPDD  233

Query  181  FVRWTYREAMAHRDPIYKSMASWGITVQADLISQVKTPSEFNSLIGSTI  229
            F+R+TY +A+AHR P Y++M+ WGITV A  +S +KT  +F +L+ S +
Sbjct  234  FIRFTYAKALAHRQPRYRAMSRWGITVTASQVSALKTHDDFYALVASCL  282


>ref|ZP_01445371.1|  hypothetical protein R2601_25616 [Roseovarius sp. HTCC2601]
 gb|EAU44391.1|  hypothetical protein R2601_25616 [Roseovarius sp. HTCC2601]
Length=285

 Score =  275 bits (704),  Expect = 2e-72, Method: Compositional matrix adjust.
 Identities = 125/230 (54%), Positives = 167/230 (72%), Gaps = 1/230 (0%)

Query  1    MGEHIEDSYKEDAMKSPYLRELLLGDSIHISSNISFDNLAPLSNYLGKPGNQGKGGLSIE  60
            MGE+I D+ K +AMK P+LRELL+ DSI+I SNI+FDNL P+S YLGKPG+  KGGL +E
Sbjct  54   MGEYIADNAKREAMKVPFLRELLMSDSIYIGSNITFDNLTPVSTYLGKPGDTAKGGLPME  113

Query  61   EYKKRQAQHHIAEIKALLDTPNFINKSNRIYGYPNFVCDTGGSICEVVNPDDPNDPVLKT  120
             Y+ RQ Q   AE +ALLDT  FI+++  +YGYP+FVCDTGGSICE V+ +DP DP+L  
Sbjct  114  AYQLRQDQFRRAEEQALLDTGYFIDRAEELYGYPHFVCDTGGSICEWVDAEDPRDPILSE  173

Query  121  LAENTLMVWIEGSSHHTDELIKRFDENPKPMCYAPDFLDLKWREYLKINKCSVENVDPDD  180
            L+ + LMVWI G+  HT+ELI+RFD  PKPM Y  +FL   WR YL     S  +VDPD 
Sbjct  174  LSSHALMVWIRGTEAHTEELIQRFDRAPKPMAYQSEFLAAAWRAYLDETGLSETSVDPDA  233

Query  181  FVRWTYREAMAHRDPIYKSMA-SWGITVQADLISQVKTPSEFNSLIGSTI  229
            F+RWTY  A+AHR P Y++MA +WG+TV+A  ++ V+  ++F  +I + +
Sbjct  234  FIRWTYARALAHRQPRYEAMAKNWGVTVEAADMATVRDEADFAQVIAAAL  283


>ref|ZP_05101760.1|  conserved hypothetical protein [Roseobacter sp. GAI101]
 gb|EEB86062.1|  conserved hypothetical protein [Roseobacter sp. GAI101]
Length=284

 Score =  275 bits (702),  Expect = 3e-72, Method: Compositional matrix adjust.
 Identities = 119/230 (51%), Positives = 168/230 (73%), Gaps = 0/230 (0%)

Query  1    MGEHIEDSYKEDAMKSPYLRELLLGDSIHISSNISFDNLAPLSNYLGKPGNQGKGGLSIE  60
            +GE I D+ K +AMK P+LR+LLL DSI+I SNI+FDNL+P++ +LGKPG+  KGGL + 
Sbjct  54   LGETITDNAKAEAMKVPFLRDLLLSDSIYIGSNITFDNLSPVATWLGKPGDPAKGGLPMA  113

Query  61   EYKKRQAQHHIAEIKALLDTPNFINKSNRIYGYPNFVCDTGGSICEVVNPDDPNDPVLKT  120
            EY  RQ     AEI ALLDT +F  ++  +Y YP+F+CDTGGSICE V+P+D +DP+++ 
Sbjct  114  EYAARQDAFKQAEIAALLDTAHFARRAKALYDYPHFICDTGGSICEWVDPEDDHDPLIQA  173

Query  121  LAENTLMVWIEGSSHHTDELIKRFDENPKPMCYAPDFLDLKWREYLKINKCSVENVDPDD  180
            L+E  L++WI+G + HT ELI+RFD+ PKPM Y P+FL   W+EYL  N C   +VDPD 
Sbjct  174  LSEECLLIWIKGDAAHTQELIRRFDKAPKPMAYQPEFLARVWQEYLSENNCEEADVDPDA  233

Query  181  FVRWTYREAMAHRDPIYKSMASWGITVQADLISQVKTPSEFNSLIGSTIS  230
            F+RWTY +A+AHR P Y++MA WG+T+ AD +  + + ++F  LI   ++
Sbjct  234  FIRWTYAQALAHRQPRYEAMARWGVTITADQVGALTSEADFVDLIADVMT  283


>ref|ZP_00960867.1|  hypothetical protein ISM_16270 [Roseovarius nubinhibens ISM]
 gb|EAP76438.1|  hypothetical protein ISM_16270 [Roseovarius nubinhibens ISM]
Length=286

 Score =  273 bits (697),  Expect = 1e-71, Method: Compositional matrix adjust.
 Identities = 119/229 (51%), Positives = 167/229 (72%), Gaps = 0/229 (0%)

Query  1    MGEHIEDSYKEDAMKSPYLRELLLGDSIHISSNISFDNLAPLSNYLGKPGNQGKGGLSIE  60
            MGE I D+ K  AM+ P+LR+LLL DSI+I SNI+F+NLAP+S YLGKPG+  +GGL   
Sbjct  54   MGELIADNAKAHAMQVPFLRDLLLSDSIYIGSNITFNNLAPVSTYLGKPGDPTRGGLPYA  113

Query  61   EYKKRQAQHHIAEIKALLDTPNFINKSNRIYGYPNFVCDTGGSICEVVNPDDPNDPVLKT  120
            EY +RQ Q   AEI AL+DT +FI+++  +YGYP+F+CDTGGSICE  + DDPNDP+L  
Sbjct  114  EYTRRQEQFRHAEISALMDTSHFISRAQSLYGYPHFICDTGGSICEWADGDDPNDPLLTE  173

Query  121  LAENTLMVWIEGSSHHTDELIKRFDENPKPMCYAPDFLDLKWREYLKINKCSVENVDPDD  180
            L+++ L+++++GS  HT+ LI+RFD  PKPM Y P+FL   W +YL     + + VDPD 
Sbjct  174  LSQHCLLIYLQGSDAHTEALIRRFDRAPKPMAYQPEFLAEAWTQYLAEQNVTADKVDPDS  233

Query  181  FVRWTYREAMAHRDPIYKSMASWGITVQADLISQVKTPSEFNSLIGSTI  229
            F+RWTY  A+AHR P Y++MA WGIT+ A  +S V++  +F++L+   +
Sbjct  234  FIRWTYARALAHRKPRYEAMAKWGITLPASEVSDVRSAEDFDALVARAL  282


>ref|YP_167343.1|  hypothetical protein SPO2113 [Ruegeria pomeroyi DSS-3]
 gb|AAV95384.1|  hypothetical protein SPO2113 [Ruegeria pomeroyi DSS-3]
Length=307

 GENE ID: 3192971 SPO2113 | hypothetical protein [Silicibacter pomeroyi DSS-3]
(10 or fewer PubMed links)

 Score =  272 bits (695),  Expect = 2e-71, Method: Compositional matrix adjust.
 Identities = 134/230 (58%), Positives = 176/230 (76%), Gaps = 1/230 (0%)

Query  1    MGEHIEDSYKEDAMKSPYLRELLLGDSIHISSNISFDNLAPLSNYLGKPGNQGKGGLSIE  60
            MGE I D+ K +AMK P+LR+LL+ DSI+I SNISF+NL P+S YLGKPG  GKGGL I 
Sbjct  73   MGEFITDNAKAEAMKVPFLRDLLMSDSIYIGSNISFENLTPVSTYLGKPGAPGKGGLPIA  132

Query  61   EYKKRQAQHHIAEIKALLDTPNFINKSNRIYGYPNFVCDTGGSICEVVNPDDPNDPVLKT  120
            EY++RQ Q   AEI ALLDT  FI+++ R+YGYP+F+CDTGGSICE V+P+DP D VL  
Sbjct  133  EYRRRQDQFRHAEIYALLDTEYFIDRAQRLYGYPHFICDTGGSICEWVDPEDPRDQVLSE  192

Query  121  LAENTLMVWIEGSSHHTDELIKRFDENPKPMCYAPDFLDLKWREYLKINKCSVENVDPDD  180
            L+++TLM+WIEGS  HT ELI+RFD+ PKPM Y P+FL   W  YL  N C+ + VDPD 
Sbjct  193  LSKHTLMIWIEGSEAHTAELIRRFDKAPKPMSYQPEFLSRVWEAYLAENDCTPDEVDPDA  252

Query  181  FVRWTYREAMAHRDPIYKSMA-SWGITVQADLISQVKTPSEFNSLIGSTI  229
            F+RWTY +A+AHR P Y++MA +WGITV A+ I  ++  ++F++LI S++
Sbjct  253  FIRWTYAQALAHRQPRYRAMARNWGITVTAEQIGAIRDAADFDALIESSL  302


>ref|ZP_05782011.1|  conserved hypothetical protein [Citreicella sp. SE45]
 gb|EEX15775.1|  conserved hypothetical protein [Citreicella sp. SE45]
Length=285

 Score =  270 bits (689),  Expect = 1e-70, Method: Compositional matrix adjust.
 Identities = 125/230 (54%), Positives = 166/230 (72%), Gaps = 1/230 (0%)

Query  1    MGEHIEDSYKEDAMKSPYLRELLLGDSIHISSNISFDNLAPLSNYLGKPGNQGKGGLSIE  60
            MGE+I D+ K +AM+ P+LRELLL DSI+I SNI+F NL+P+S YLGKPG+  KGGL +E
Sbjct  54   MGEYIVDNAKREAMRVPFLRELLLSDSIYIGSNITFHNLSPVSTYLGKPGDPDKGGLPME  113

Query  61   EYKKRQAQHHIAEIKALLDTPNFINKSNRIYGYPNFVCDTGGSICEVVNPDDPNDPVLKT  120
             Y+ RQ Q   AE +ALLDT  FI+++ ++YGYP+FVCDTGGSICE V+P++  DP+L  
Sbjct  114  IYRHRQDQFQRAEEQALLDTGYFIDRAEQLYGYPHFVCDTGGSICEWVDPENEADPILSA  173

Query  121  LAENTLMVWIEGSSHHTDELIKRFDENPKPMCYAPDFLDLKWREYLKINKCSVENVDPDD  180
            L+   LMVWI GS  HT ELI+RFD  PKPM Y P FL   W  YL     S   VDPD 
Sbjct  174  LSGQALMVWIRGSEDHTAELIRRFDRAPKPMAYQPAFLTACWDAYLDETGLSPARVDPDA  233

Query  181  FVRWTYREAMAHRDPIYKSMA-SWGITVQADLISQVKTPSEFNSLIGSTI  229
            F+RWTY  A+AHR P+Y++MA +WG+TV A  ++ V+  ++FN++I + +
Sbjct  234  FIRWTYARALAHRQPLYEAMARNWGVTVDASDVATVRDEADFNAVIAAAL  283


>ref|ZP_02154523.1|  hypothetical protein OIHEL45_12425 [Oceanibulbus indolifex HEL-45]
 gb|EDQ04141.1|  hypothetical protein OIHEL45_12425 [Oceanibulbus indolifex HEL-45]
Length=286

 Score =  268 bits (686),  Expect = 3e-70, Method: Compositional matrix adjust.
 Identities = 117/226 (51%), Positives = 164/226 (72%), Gaps = 0/226 (0%)

Query  1    MGEHIEDSYKEDAMKSPYLRELLLGDSIHISSNISFDNLAPLSNYLGKPGNQGKGGLSIE  60
            +GE I D+ K +AMK P+LR+LLL DSI+I SNI+FDNL+P++ +LGKPG+  +GGL + 
Sbjct  54   LGELIADNAKAEAMKVPFLRDLLLSDSIYIGSNITFDNLSPVATWLGKPGDPARGGLPMA  113

Query  61   EYKKRQAQHHIAEIKALLDTPNFINKSNRIYGYPNFVCDTGGSICEVVNPDDPNDPVLKT  120
            +Y  RQ     AEI AL DT  F N++ ++YGYP+F+CDTGGSICE V+P+DPNDP+L+ 
Sbjct  114  DYATRQEAFRRAEISALKDTAYFANRAEQLYGYPHFICDTGGSICEWVDPEDPNDPLLQQ  173

Query  121  LAENTLMVWIEGSSHHTDELIKRFDENPKPMCYAPDFLDLKWREYLKINKCSVENVDPDD  180
            LA+  L+VWI+G   HT+ELI+RFD+ PKPM Y P+FL   W +YL     + ++V+PD+
Sbjct  174  LADECLLVWIKGDEAHTEELIRRFDKAPKPMAYQPEFLSRAWADYLAEKNITEDSVNPDE  233

Query  181  FVRWTYREAMAHRDPIYKSMASWGITVQADLISQVKTPSEFNSLIG  226
            F+RWTY  A++HR P Y +M+ WG+T+    I  +K P +F  LI 
Sbjct  234  FIRWTYAAALSHRQPRYAAMSKWGVTITPADIDALKAPEDFTDLIA  279


>ref|ZP_00999087.1|  hypothetical protein OB2597_01917 [Oceanicola batsensis HTCC2597]
 gb|EAQ03337.1|  hypothetical protein OB2597_01917 [Oceanicola batsensis HTCC2597]
Length=285

 Score =  267 bits (683),  Expect = 5e-70, Method: Compositional matrix adjust.
 Identities = 124/225 (55%), Positives = 161/225 (71%), Gaps = 0/225 (0%)

Query  1    MGEHIEDSYKEDAMKSPYLRELLLGDSIHISSNISFDNLAPLSNYLGKPGNQGKGGLSIE  60
            MGE I D+ K  AM+ P+LR+LL  DSI+I SNI+F+NLAPLS YLGKPG+  +GGL  +
Sbjct  54   MGELIADNAKAHAMQVPFLRDLLKSDSIYIGSNITFNNLAPLSTYLGKPGDPTRGGLPFD  113

Query  61   EYKKRQAQHHIAEIKALLDTPNFINKSNRIYGYPNFVCDTGGSICEVVNPDDPNDPVLKT  120
             Y++RQ QH +AEI ALLDT  F  ++ R+YGYP+FVCD+GGSICEVV+P D  DPVL T
Sbjct  114  LYRERQEQHRMAEISALLDTERFAERAVRLYGYPHFVCDSGGSICEVVDPWDDEDPVLDT  173

Query  121  LAENTLMVWIEGSSHHTDELIKRFDENPKPMCYAPDFLDLKWREYLKINKCSVENVDPDD  180
            L +  L+VWI+G + H  EL++RFD  PKPM Y P+FL   W +Y      + + VDPD 
Sbjct  174  LTDMALLVWIQGDAAHRQELVRRFDRAPKPMYYQPEFLRQAWTDYRDETGEAEDAVDPDA  233

Query  181  FVRWTYREAMAHRDPIYKSMASWGITVQADLISQVKTPSEFNSLI  225
            FVRWTY  A+ HR P Y++MA  GITV AD ++QV   + F++LI
Sbjct  234  FVRWTYARALDHRQPRYEAMARRGITVTADQVAQVTDAASFDALI  278


>ref|ZP_01752182.1|  hypothetical protein RCCS2_00167 [Roseobacter sp. CCS2]
 gb|EBA10849.1|  hypothetical protein RCCS2_00167 [Roseobacter sp. CCS2]
Length=287

 Score =  264 bits (674),  Expect = 6e-69, Method: Compositional matrix adjust.
 Identities = 126/230 (54%), Positives = 161/230 (70%), Gaps = 1/230 (0%)

Query  1    MGEHIEDSYKEDAMKSPYLRELLLGDSIHISSNISFDNLAPLSNYLGKPGNQGKGGLSIE  60
            MGE I D+ K +AMK P+LR+LL  DSI+I SNISF++L P+S+YLGKPG+   GGL  +
Sbjct  54   MGELIADNAKREAMKVPFLRDLLRSDSIYIGSNISFNDLRPMSSYLGKPGDPHLGGLPWD  113

Query  61   EYKKRQAQHHIAEIKALLDTPNFINKSNRIYGYPNFVCDTGGSICEVVNPDDPNDPVLKT  120
            EY +RQ Q   AEI AL DT +FI +S+R+Y YP+FVCDTGGSICE V+ DDP+DP+L  
Sbjct  114  EYTRRQDQFQQAEIAALHDTSHFIERSSRLYQYPHFVCDTGGSICEWVDADDPHDPILTD  173

Query  121  LAENTLMVWIEGSSHHTDELIKRFDENPKPMCYAPDFLDLKWREYLKINKCSVENVDPDD  180
            LA   L+VWIEG+  HT ELI+RFD+ PKPM Y P FL   W  YL     + + V+PDD
Sbjct  174  LASKNLLVWIEGTDAHTAELIRRFDKEPKPMSYDPAFLLATWHAYLAEFSVAPDKVNPDD  233

Query  181  FVRWTYREAMAHRDPIYKSMA-SWGITVQADLISQVKTPSEFNSLIGSTI  229
            F+RW + +A+AHR P YK+MA  WGITVQ   + QVK    F  +I   +
Sbjct  234  FIRWAFAKALAHRQPRYKAMADKWGITVQQSAVEQVKDTEGFVGMIADAL  283


>ref|ZP_05074698.1|  conserved hypothetical protein [Rhodobacterales bacterium HTCC2083]
 gb|EDZ42358.1|  conserved hypothetical protein [Rhodobacterales bacterium HTCC2083]
Length=288

 Score =  255 bits (651),  Expect = 3e-66, Method: Compositional matrix adjust.
 Identities = 121/231 (52%), Positives = 158/231 (68%), Gaps = 1/231 (0%)

Query  1    MGEHIEDSYKEDAMKSPYLRELLLGDSIHISSNISFDNLAPLSNYLGKPGNQGKGGLSIE  60
            MGEHI D+ K +AMK+P+LR++L+ D IH   N+   +L P++ YLGKPGN  +GGL + 
Sbjct  54   MGEHIIDNLKREAMKTPFLRKMLMQDLIHFGVNVHDHDLRPVAAYLGKPGNPDQGGLPLA  113

Query  61   EYKKRQAQHHIAEIKALLDTPNFINKSNRIYGYPNFVCDTGGSICEVVNPDDPNDPVLKT  120
            EY+ RQ Q   AEI AL DT +FI +   IYGY NF+CDTGGSICE V+P DPNDP+L  
Sbjct  114  EYQHRQDQFRKAEIAALNDTAHFIERGQSIYGYDNFICDTGGSICEWVDPTDPNDPLLSK  173

Query  121  LAENTLMVWIEGSSHHTDELIKRFDENPKPMCYAPDFLDLKWREYLKINKCSVENVDPDD  180
            LAE+ LMVWI+G   HTD L+ RFD+ PKPM Y P+FL   W EYL  N     +VDPD+
Sbjct  174  LAEDCLMVWIKGGEAHTDSLVARFDKEPKPMSYQPEFLQRIWGEYLNDNNIKEGDVDPDE  233

Query  181  FVRWTYREAMAHRDPIYKSMA-SWGITVQADLISQVKTPSEFNSLIGSTIS  230
            FVRWTY +A+AHR P Y +MA ++GITV A     + + + F  +I   ++
Sbjct  234  FVRWTYAQALAHRQPRYANMAQNFGITVDACAWETITSEATFVDVIADALA  284


>ref|ZP_01003529.1|  hypothetical protein SKA53_04523 [Loktanella vestfoldensis SKA53]
 gb|EAQ06322.1|  hypothetical protein SKA53_04523 [Loktanella vestfoldensis SKA53]
Length=287

 Score =  255 bits (651),  Expect = 3e-66, Method: Compositional matrix adjust.
 Identities = 127/232 (54%), Positives = 165/232 (71%), Gaps = 1/232 (0%)

Query  1    MGEHIEDSYKEDAMKSPYLRELLLGDSIHISSNISFDNLAPLSNYLGKPGNQGKGGLSIE  60
            MGE+I D+ K +AMK P+LR+LL  DSI+I SNISF++L P+S+YLGKPG+  KGGL  +
Sbjct  54   MGEYIADNAKREAMKVPFLRDLLRSDSIYIGSNISFNDLRPMSSYLGKPGDPAKGGLPWD  113

Query  61   EYKKRQAQHHIAEIKALLDTPNFINKSNRIYGYPNFVCDTGGSICEVVNPDDPNDPVLKT  120
            EY +RQ Q   AE+ AL DT +FI +S+ +Y YP+FVCDTGGSICE V+  DPNDP++  
Sbjct  114  EYTRRQDQFQRAEVAALHDTGHFITRSHDLYQYPHFVCDTGGSICEWVDVTDPNDPIMTD  173

Query  121  LAENTLMVWIEGSSHHTDELIKRFDENPKPMCYAPDFLDLKWREYLKINKCSVENVDPDD  180
            LA  TLMVWIEG+  HT ELI+RFD  PKPM Y P FL   W  YL     + ++VDPDD
Sbjct  174  LASKTLMVWIEGTDAHTAELIRRFDREPKPMSYDPVFLLATWTAYLAEFNAAPDDVDPDD  233

Query  181  FVRWTYREAMAHRDPIYKSMA-SWGITVQADLISQVKTPSEFNSLIGSTISR  231
            F+RW + +A+AHR P YK+MA  WGITV  D ++QVK  + F  +I   + +
Sbjct  234  FIRWAFAKALAHRQPRYKAMADRWGITVAQDDVAQVKDAAGFIDMIAQALEK  285


>ref|YP_510291.1|  hypothetical protein Jann_2349 [Jannaschia sp. CCS1]
 gb|ABD55266.1|  hypothetical protein Jann_2349 [Jannaschia sp. CCS1]
Length=301

 GENE ID: 3934805 Jann_2349 | hypothetical protein [Jannaschia sp. CCS1]

 Score =  248 bits (632),  Expect = 4e-64, Method: Compositional matrix adjust.
 Identities = 117/233 (50%), Positives = 157/233 (67%), Gaps = 1/233 (0%)

Query  1    MGEHIEDSYKEDAMKSPYLRELLLGDSIHISSNISFDNLAPLSNYLGKPGNQGKGGLSIE  60
            MGEHI D+ K  AM+ P+L +L   DSI+I SNI+F NLAPLS YLGKPG+  KGGL   
Sbjct  68   MGEHITDNLKAQAMQVPFLADLFRSDSIYIGSNITFGNLAPLSTYLGKPGDPSKGGLPFA  127

Query  61   EYKKRQAQHHIAEIKALLDTPNFINKSNRIYGYPNFVCDTGGSICEVVNPDDPNDPVLKT  120
            EY +RQA H  AE+ ALLDT  FI+++  +YGY NF+CDTGGSICEVV+P D +D VL+T
Sbjct  128  EYTRRQALHRRAEVNALLDTVPFIHRATSLYGYGNFICDTGGSICEVVDPADADDEVLRT  187

Query  121  LAENTLMVWIEGSSHHTDELIKRFDENPKPMCYAPDFLDLKWREYLKINKCSVENVDPDD  180
            LA+NTLM+WIE    H  ELI+RF  NPKP+ Y P+ L+  W+ YL  +  S E VDPD 
Sbjct  188  LAQNTLMIWIESPEGHDAELIRRFKLNPKPIYYRPEMLETLWQGYLDDHGVSEEAVDPDA  247

Query  181  FVRWTYREAMAHRDPIYKSMA-SWGITVQADLISQVKTPSEFNSLIGSTISRS  232
            F    +   + +R P+Y +MA  WG++V A  +  V+  ++   L+   ++R+
Sbjct  248  FAVHAFANVIHNRAPLYAAMAQGWGVSVTASEVEAVRDAADAVDLVADALART  300


>ref|YP_002760735.1|  hypothetical protein GAU_1223 [Gemmatimonas aurantiaca T-27]
 dbj|BAH38265.1|  hypothetical protein [Gemmatimonas aurantiaca T-27]
Length=313

 GENE ID: 7707472 GAU_1223 | hypothetical protein [Gemmatimonas aurantiaca T-27]

 Score =  246 bits (629),  Expect = 1e-63, Method: Compositional matrix adjust.
 Identities = 114/235 (48%), Positives = 169/235 (71%), Gaps = 4/235 (1%)

Query  1    MGEHIEDSYKEDAMKSPYLRELLLGDSIHISSNISFDNLAPLSNYLGKPGNQGKGGLSIE  60
            MGEHI D++K +AMK P+LR+LL  DSI+I SNISF+NL+PLS YLGKPGN+ +GG+   
Sbjct  72   MGEHIVDNFKREAMKVPFLRQLLRSDSIYIRSNISFENLSPLSTYLGKPGNEARGGIPYA  131

Query  61   EYKKRQAQHHIAEIKALLDTPNFINKSNRIYGYPNFVCDTGGSICEVVNPDDPNDPVLKT  120
            EY +RQAQH  AEI ALLD P F+ ++  IYGY +FV D+GGS+CEVV+ +DP+D VL+ 
Sbjct  132  EYCRRQAQHRQAEISALLDIPEFVERAREIYGYTHFVSDSGGSLCEVVDVNDPHDAVLQC  191

Query  121  LAENTLMVWIEGSSHHTDELIKRFDENPKPMCYAPDFLDLKWREYLKINKCSVEN--VDP  178
            L+++T++V+I G++ H   L++RF ++PKPM Y P FLD KWREY K  +   ++  VDP
Sbjct  192  LSQHTMLVYIRGTADHARMLVERFRKHPKPMYYNPGFLDAKWREY-KTERGIGDDAMVDP  250

Query  179  DDFVRWTYREAMAHRDPIYKSMAS-WGITVQADLISQVKTPSEFNSLIGSTISRS  232
            DDF  W + + + HR P+Y+++A  +G  ++ + +  ++T ++  + +  TI  S
Sbjct  251  DDFAVWGFEQLLDHRTPLYEAIAERYGYVIEMEDVPLLRTEADVLAHLARTIDAS  305


>ref|ZP_01448460.1|  hypothetical protein OM2255_02942 [alpha proteobacterium HTCC2255]
 gb|EAU51188.1|  hypothetical protein OM2255_02942 [alpha proteobacterium HTCC2255]
Length=285

 Score =  233 bits (595),  Expect = 9e-60, Method: Compositional matrix adjust.
 Identities = 110/230 (47%), Positives = 160/230 (69%), Gaps = 1/230 (0%)

Query  1    MGEHIEDSYKEDAMKSPYLRELLLGDSIHISSNISFDNLAPLSNYLGKPGNQGKGGLSIE  60
            MGEHI D++K++AM +P+L  LL  D+I+IS+N+ F +L+PLS+YLGKPG+  KGG+  +
Sbjct  54   MGEHIADNFKKEAMSNPFLANLLQTDAIYISANMKFSDLSPLSSYLGKPGDPSKGGIPFK  113

Query  61   EYKKRQAQHHIAEIKALLDTPNFINKSNRIYGYPNFVCDTGGSICEVVNPDDPNDPVLKT  120
            EY +RQ  H  AE  A+LDT +FI ++  +Y Y  FVCDT GSI E+V+ +D ND V+KT
Sbjct  114  EYLRRQRLHRDAETNAMLDTVHFIQRAKSLYDYDKFVCDTSGSIVEIVDCNDKNDNVMKT  173

Query  121  LAENTLMVWIEGSSHHTDELIKRFDENPKPMCYAPDFLDLKWREYLKINKCSVENVDPDD  180
            L++  L VWIEG+  HT+EL+KRF ++PKPM Y   FL   W  + K    S   VDP++
Sbjct  174  LSQYVLPVWIEGTEEHTEELVKRFIKSPKPMYYNEKFLIECWDNFSKEKNISESKVDPNE  233

Query  181  FVRWTYREAMAHRDPIYKSMAS-WGITVQADLISQVKTPSEFNSLIGSTI  229
            F+ W YR+ + +R P Y+ +A  WGIT++A  +++VK+  EF SLI + +
Sbjct  234  FIIWGYRKLLENRSPRYRKIAKDWGITLRASEVAKVKSADEFTSLISANL  283


>ref|YP_002512580.1|  hypothetical protein Tgr7_0496 [Thioalkalivibrio sp. HL-EbGR7]
 gb|ACL71593.1|  conserved hypothetical protein [Thioalkalivibrio sp. HL-EbGR7]
Length=279

 GENE ID: 7317680 Tgr7_0496 | hypothetical protein
[Thioalkalivibrio sp. HL-EbGR7]

 Score =  203 bits (517),  Expect = 9e-51, Method: Compositional matrix adjust.
 Identities = 103/233 (44%), Positives = 152/233 (65%), Gaps = 7/233 (3%)

Query  1    MGEHIEDSYKEDAMKSPYLRELLLGDSIHISSNISFDNLAPLSNYLGKPGNQGKGGLSIE  60
            + E I D+ K  AM+ P+LRELL  DSI+I +NI+ DNL P+S++LGK G+  +GGL + 
Sbjct  52   LDEPILDNIKHQAMQVPFLRELLRSDSIYILNNITVDNLQPVSSFLGKLGDPERGGLPLT  111

Query  61   EYKKRQAQHHIAEIKALLDTPNFINKSNRIYGYPNFVCDTGGSICEVVNPDDPNDPVLKT  120
            E+K+RQA HH AEI A+ D P FI K+  IYGY +FV D GGS+CE+   DDP    L+ 
Sbjct  112  EFKRRQALHHTAEIAAMRDVPEFIGKARSIYGYEHFVNDAGGSVCEL---DDPE--TLEL  166

Query  121  LAENTLMVWIEGSSHHTDELIKRFDENPKPMCYAPDFLDLKWREYLKINKCS-VENVDPD  179
            LAE+TL+++I+ +    +ELI+R +  PKP+ Y   FLD + REY + +    V  +DPD
Sbjct  167  LAEHTLLIYIQATRQDEEELIRRAESAPKPLYYREAFLDEQLREYKQAHDIEYVAQIDPD  226

Query  180  DFVRWTYREAMAHRDPIYKSMAS-WGITVQADLISQVKTPSEFNSLIGSTISR  231
            DFVRW +      R P Y+++A  +G T+  + + +VK+  +F +LI + + R
Sbjct  227  DFVRWMFPRLFRSRIPRYEAIARDYGYTITTEELGRVKSEQDFLALIETALER  279


>ref|ZP_03689324.1|  conserved hypothetical protein [Thioalkalivibrio sp. K90mix]
 gb|EEG17163.1|  conserved hypothetical protein [Thioalkalivibrio sp. K90mix]
Length=283

 Score =  194 bits (494),  Expect = 5e-48, Method: Compositional matrix adjust.
 Identities = 97/233 (41%), Positives = 150/233 (64%), Gaps = 7/233 (3%)

Query  1    MGEHIEDSYKEDAMKSPYLRELLLGDSIHISSNISFDNLAPLSNYLGKPGNQGKGGLSIE  60
            + E I D+ K  AM  P+LR+LL  DSI I +NI+ DNL P++++LGK GN   GGL + 
Sbjct  52   LSEPILDNIKAQAMAVPFLRDLLRSDSIQIINNITVDNLHPVASFLGKLGNPELGGLPLT  111

Query  61   EYKKRQAQHHIAEIKALLDTPNFINKSNRIYGYPNFVCDTGGSICEVVNPDDPNDPVLKT  120
            E+K+RQ  HH AE++A+LD P+FI+KS  ++GYP+FV D GGS+CE+ +P      VL+T
Sbjct  112  EFKRRQNLHHEAEVQAMLDVPDFIHKSQTLFGYPHFVNDAGGSVCELDSPG-----VLET  166

Query  121  LAENTLMVWIEGSSHHTDELIKRFDENPKPMCYAPDFLDLKWREYLKINKCS-VENVDPD  179
            LAE+TL+++I+ +     +LI+R + +PKP+ Y   FLD +  EY++      V  +DPD
Sbjct  167  LAEHTLILYIKATDEDERQLIERAEHDPKPLYYREAFLDQQLAEYMRDEGLDYVAQIDPD  226

Query  180  DFVRWTYREAMAHRDPIYKSMA-SWGITVQADLISQVKTPSEFNSLIGSTISR  231
             FVRW +      R P Y+++A + G T+    ++QV+   +F+ L+   + R
Sbjct  227  AFVRWVFSRLFRARLPRYEAIAEAHGYTISTTELAQVRNAQDFDDLVCMALER  279


>ref|ZP_05048511.1|  hypothetical protein NOC27_2067 [Nitrosococcus oceani AFC27]
 gb|EDZ65387.1|  hypothetical protein NOC27_2067 [Nitrosococcus oceani AFC27]
Length=297

 Score =  179 bits (454),  Expect = 2e-43, Method: Compositional matrix adjust.
 Identities = 97/236 (41%), Positives = 149/236 (63%), Gaps = 11/236 (4%)

Query  1    MGEHIEDSYKEDAMKSPYLRELLLGDSIHISSNISFDNLAPLSNYLGKPGNQGKGGLSIE  60
            + E I D+ K+  M  P+LR+LL  DSI+I +NI+ DNL P+S++LGK GN  +GGL + 
Sbjct  68   LDEAILDNIKQKVMGIPFLRDLLYSDSIYIRNNINVDNLKPVSSFLGKLGNPEQGGLGLT  127

Query  61   EYKKRQAQHHIAEIKALLDTPNFINKSNRIYGYPNFVCDTGGSICEVVNPDDPNDPVLKT  120
            E+K+RQ  H  AEI A+ D P FI ++ +IY YP+FV D GGSI E+   DDP   VL+T
Sbjct  128  EFKRRQELHRQAEIAAMKDVPVFIRRAQQIYDYPHFVNDAGGSISEL---DDPE--VLQT  182

Query  121  LAENTLMVWIEGSSHHTDELIKRFDENPKPMCYAPDFLDLK---WREYLKINKCSVENVD  177
            L ++TL+++I+ +     +LI+R +E PKP+ Y   FLD +   +RE  K +   V  ++
Sbjct  183  LDQHTLILYIKATQRDERKLIQRAEEEPKPLYYRESFLDEQLAIYRE--KHDLPYVALIE  240

Query  178  PDDFVRWTYREAMAHRDPIYKSMA-SWGITVQADLISQVKTPSEFNSLIGSTISRS  232
            PD+FVRW +      R P Y+++A  +G TV  D ++QV   + F  L+ + +++ 
Sbjct  241  PDEFVRWIFPRLFYSRLPRYQAIADEYGYTVSTDQLAQVNDETGFLELLETVVAKQ  296


>ref|YP_344685.1|  hypothetical protein Noc_2702 [Nitrosococcus oceani ATCC 19707]
 gb|ABA59155.1|  conserved hypothetical protein [Nitrosococcus oceani ATCC 19707]
Length=281

 GENE ID: 3704459 Noc_2702 | hypothetical protein
[Nitrosococcus oceani ATCC 19707] (10 or fewer PubMed links)

 Score =  179 bits (454),  Expect = 2e-43, Method: Compositional matrix adjust.
 Identities = 97/236 (41%), Positives = 149/236 (63%), Gaps = 11/236 (4%)

Query  1    MGEHIEDSYKEDAMKSPYLRELLLGDSIHISSNISFDNLAPLSNYLGKPGNQGKGGLSIE  60
            + E I D+ K+  M  P+LR+LL  DSI+I +NI+ DNL P+S++LGK GN  +GGL + 
Sbjct  52   LDEAILDNIKQKVMGIPFLRDLLYSDSIYIRNNINVDNLKPVSSFLGKLGNPEQGGLGLT  111

Query  61   EYKKRQAQHHIAEIKALLDTPNFINKSNRIYGYPNFVCDTGGSICEVVNPDDPNDPVLKT  120
            E+K+RQ  H  AEI A+ D P FI ++ +IY YP+FV D GGSI E+   DDP   VL+T
Sbjct  112  EFKRRQELHRQAEIAAMKDVPVFIRRAQQIYDYPHFVNDAGGSISEL---DDPE--VLQT  166

Query  121  LAENTLMVWIEGSSHHTDELIKRFDENPKPMCYAPDFLDLK---WREYLKINKCSVENVD  177
            L ++TL+++I+ +     +LI+R +E PKP+ Y   FLD +   +RE  K +   V  ++
Sbjct  167  LDQHTLILYIKATQRDERKLIQRAEEEPKPLYYRESFLDEQLAIYRE--KHDLPYVALIE  224

Query  178  PDDFVRWTYREAMAHRDPIYKSMA-SWGITVQADLISQVKTPSEFNSLIGSTISRS  232
            PD+FVRW +      R P Y+++A  +G TV  D ++QV   + F  L+ + +++ 
Sbjct  225  PDEFVRWIFPRLFYSRLPRYQAIADEYGYTVSTDQLAQVNDETGFLELLETVVAKQ  280


>ref|ZP_04774611.1|  conserved hypothetical protein [Allochromatium vinosum DSM 180]
 gb|EER65416.1|  conserved hypothetical protein [Allochromatium vinosum DSM 180]
Length=281

 Score =  168 bits (425),  Expect = 4e-40, Method: Compositional matrix adjust.
 Identities = 86/234 (36%), Positives = 142/234 (60%), Gaps = 7/234 (2%)

Query  1    MGEHIEDSYKEDAMKSPYLRELLLGDSIHISSNISFDNLAPLSNYLGKPGNQGKGGLSIE  60
            + E I D  K  AM+ P+LR+LL  D I I + I  ++L P+ +++GK GN  +GGL ++
Sbjct  52   LDEPILDLIKSRAMRDPFLRDLLRRDWITIRNVIKINDLGPVLSFIGKLGNPEQGGLPLD  111

Query  61   EYKKRQAQHHIAEIKALLDTPNFINKSNRIYGYPNFVCDTGGSICEVVNPDDPNDPVLKT  120
            E+ +RQA +  AEI A+ D P F+ K+  +YGY + V D GGS+CE+  P      V+  
Sbjct  112  EFSRRQALYREAEIAAMFDLPEFVRKAQEVYGYAHLVNDVGGSLCELEEPR-----VIDM  166

Query  121  LAENTLMVWIEGSSHHTDELIKRFDENPKPMCYAPDFLDLKWREYLKINKCS-VENVDPD  179
            LA ++L+++I        +LI+R   +PKP+ Y P+FL    ++YL++ +   V  ++PD
Sbjct  167  LARHSLILYIRVPEADEIKLIERAQADPKPLYYRPEFLQTAVKDYLELRQLDYVAQIEPD  226

Query  180  DFVRWTYREAMAHRDPIYKSMAS-WGITVQADLISQVKTPSEFNSLIGSTISRS  232
            DF RW +      R P Y+++A   G TV ++ ++QV+  ++F +L+ S I+RS
Sbjct  227  DFTRWVFPRLFHSRVPRYEAIAQPHGYTVSSEDVAQVRDEADFLALVESAIARS  280


>ref|YP_314003.1|  hypothetical protein Tbd_0245 [Thiobacillus denitrificans ATCC 
25259]
 gb|AAZ96198.1|  conserved hypothetical protein [Thiobacillus denitrificans ATCC 
25259]
Length=283

 GENE ID: 3671387 Tbd_0245 | hypothetical protein
[Thiobacillus denitrificans ATCC 25259] (10 or fewer PubMed links)

 Score =  166 bits (421),  Expect = 1e-39, Method: Compositional matrix adjust.
 Identities = 91/234 (38%), Positives = 143/234 (61%), Gaps = 8/234 (3%)

Query  1    MGEHIEDSYKEDAMKSPYLRELLLGDSIHISSNISFDNLAPLSNYLGKPGNQGKGGLSIE  60
            + E I D  K+ AM+ P+LR+LL  D I I +NI   +L P+  ++GK G    GGL ++
Sbjct  52   LDEPILDLIKQQAMQVPFLRDLLRRDWIDIKNNIKIHDLGPVLTFVGKLGGPEWGGLPLD  111

Query  61   EYKKRQAQHHIAEIKALLDTPNFINKSNRIYGYPNFVCDTGGSICEVVNPDDPNDPVLKT  120
            E+ +RQA +  AEI A+ D P FI K   IYGYP+FV D GGS+CE+   D+P   V++ 
Sbjct  112  EFTRRQAAYREAEIAAMRDVPGFIRKGQEIYGYPHFVNDVGGSLCEL---DEPG--VIEL  166

Query  121  LAENTLMVWIEGSSHHTDE-LIKRFDENPKPMCYAPDFLDLKWREYLKINKCS-VENVDP  178
            LAE+TL+++I+ +S   +E LI+R   +PKP+ + P FL+     YL+    + V  ++P
Sbjct  167  LAEHTLILYIQTTSREEEETLIRRAQSDPKPLYFRPAFLEENLPRYLEEKGLAYVAQIEP  226

Query  179  DDFVRWTYREAMAHRDPIYKSMAS-WGITVQADLISQVKTPSEFNSLIGSTISR  231
            +DF RW +      R P Y+++A   G TV ++  +QV+   +F +L+ + I+R
Sbjct  227  NDFARWIFPRLFHSRIPRYEAIAGPHGYTVTSEEAAQVRDERDFLALLEAAIAR  280


>ref|YP_001218881.1|  hypothetical protein COSY_0018 [Candidatus Vesicomyosocius okutanii 
HA]
 dbj|BAF61157.1|  conserved hypothetical protein [Candidatus Vesicomyosocius okutanii 
HA]
Length=278

 GENE ID: 5172183 COSY_0018 | hypothetical protein
[Candidatus Vesicomyosocius okutanii HA] (10 or fewer PubMed links)

 Score =  146 bits (369),  Expect = 2e-33, Method: Compositional matrix adjust.
 Identities = 80/227 (35%), Positives = 135/227 (59%), Gaps = 8/227 (3%)

Query  1    MGEHIEDSYKEDAMKSPYLRELLLGDSIHISSNISFDNLAPLSNYLGKPGNQGKGGLSIE  60
            + + I  + K    +  +L++LL  +SI I ++I+FDNL+ +S +LGK GN   GG SI+
Sbjct  53   LNDKILGNIKNHVKQDKWLKDLLDKESISIQNHITFDNLSSVSTFLGKTGNPELGGTSID  112

Query  61   EYKKRQAQHHIAEIKALLDTPNFINKSNRIYGYPNFVCDTGGSICEVVNPDDPNDPVLKT  120
             +  RQ  H  AEIK +LD P FI K+ +  G+ +F+ D GGS+CE+      N+ V +T
Sbjct  113  TFIARQIMHLNAEIKTMLDVPQFIKKA-KAQGFNHFINDAGGSLCEL-----DNEQVYQT  166

Query  121  LAENTLMVWIEGSSHHTDELIKRFDENPKPMCYAPDFLDLKWREYLKINKC-SVENVDPD  179
            L++NT++++I  S  +   LIKR   +PKP+ Y  +FL  +   YL+ N+   +  ++P+
Sbjct  167  LSDNTVILYIRASKTNEIALIKRAQTHPKPLYYQANFLKHELDIYLQENQLIYIAQINPN  226

Query  180  DFVRWTYREAMAHRDPIYKSMA-SWGITVQADLISQVKTPSEFNSLI  225
             FVRW +   + HR P Y+++A  +G  +  + + Q ++ +E   LI
Sbjct  227  KFVRWVFPRLLEHRKPKYEAIAKKYGYIIDNEDLYQCRSANEIFELI  273


>ref|YP_903293.1|  hypothetical protein Rmag_0019 [Candidatus Ruthia magnifica str. 
Cm (Calyptogena magnifica)]
 gb|ABL01822.1|  conserved hypothetical protein [Candidatus Ruthia magnifica str. 
Cm (Calyptogena magnifica)]
Length=278

 GENE ID: 4555345 Rmag_0019 | hypothetical protein
[Candidatus Ruthia magnifica str. Cm (Calyptogena magnifica)]
(10 or fewer PubMed links)

 Score =  146 bits (369),  Expect = 2e-33, Method: Compositional matrix adjust.
 Identities = 81/227 (35%), Positives = 133/227 (58%), Gaps = 8/227 (3%)

Query  1    MGEHIEDSYKEDAMKSPYLRELLLGDSIHISSNISFDNLAPLSNYLGKPGNQGKGGLSIE  60
            + + I D+ K    +  +L++LL  +SI I ++I+FDNL+ +S +LGK GN   GG  I 
Sbjct  53   LNDKILDNIKNYIRQDKWLKDLLDNESISIQNHITFDNLSSVSAFLGKAGNPELGGTPIG  112

Query  61   EYKKRQAQHHIAEIKALLDTPNFINKSNRIYGYPNFVCDTGGSICEVVNPDDPNDPVLKT  120
             +  RQ  H  AE KA+LD P FI K+ +  G+ +F+ D GGS+CE+      N+ V +T
Sbjct  113  TFIARQTMHLNAETKAMLDVPQFIQKA-KTQGFNHFINDAGGSLCEL-----DNEQVYQT  166

Query  121  LAENTLMVWIEGSSHHTDELIKRFDENPKPMCYAPDFLDLKWREYLKINKC-SVENVDPD  179
            LA NT++++I  S  +   L++    +PKP+ Y  +FL  +   YL+ N+   V  +DP+
Sbjct  167  LANNTVILYIRASKTNETALVECAQTHPKPLYYQANFLKQQLSIYLQKNQLIYVAQIDPN  226

Query  180  DFVRWTYREAMAHRDPIYKSMA-SWGITVQADLISQVKTPSEFNSLI  225
            +FVRW +   + HR P Y+++A  +G T+ ++ + + K   E   LI
Sbjct  227  EFVRWVFPRLLEHRKPKYEAIAKKYGYTIDSEGLYRCKNADEVFELI  273


>ref|ZP_05705376.1|  conserved hypothetical protein [Cardiobacterium hominis ATCC 
15826]
 gb|EEV88472.1|  conserved hypothetical protein [Cardiobacterium hominis ATCC 
15826]
Length=194

 Score = 88.6 bits (218),  Expect = 5e-16, Method: Compositional matrix adjust.
 Identities = 54/155 (34%), Positives = 79/155 (50%), Gaps = 8/155 (5%)

Query  1    MGEHIEDSYKEDAMKSPYLRELLLGDSIHISSNISFDNLAPLSNYLGKPGNQGKGGLSIE  60
            + E I D     A + P L  LL  D++ +   +S + L  LS ++GK    G+ G    
Sbjct  43   LNEPISDWLTVLARREPTLAALLRDDAVSVKGKVSIERLHILSAFVGK---LGRDGYDAR  99

Query  61   EYKKRQAQHHIAEIKALLDTPNFINKSNRIYGYPNFVCDTGGSICEVVNPDDPNDPVLKT  120
             + +RQ   + AE  A+ D P FI ++ R+YGY  F+ D GGSICE+   DD  D +++T
Sbjct  100  TFIERQRLFNEAERAAMYDVPAFIERAERLYGYKAFINDAGGSICEI---DD--DALMQT  154

Query  121  LAENTLMVWIEGSSHHTDELIKRFDENPKPMCYAP  155
            LA +TL V+I+       EL  R    P P    P
Sbjct  155  LAAHTLFVYIDTDEALYAELEARAMPTPNPSATTP  189


>ref|ZP_05705375.1|  hypothetical protein HMPREF0198_1410 [Cardiobacterium hominis 
ATCC 15826]
 gb|EEV88471.1|  hypothetical protein HMPREF0198_1410 [Cardiobacterium hominis 
ATCC 15826]
Length=77

 Score = 37.0 bits (84),  Expect = 1.5, Method: Composition-based stats.
 Identities = 17/50 (34%), Positives = 31/50 (62%), Gaps = 1/50 (2%)

Query  174  ENVDPDDFVRWTYREAMAHRDPIYKSMAS-WGITVQADLISQVKTPSEFN  222
            +  + DDF+R+   + MAHR   Y  +A+ +G+T+ A  + QV+  ++FN
Sbjct  15   DRFESDDFIRFVTPKMMAHRRARYLQLAARYGVTLGAREVWQVRDAADFN  64


>emb|CAG04611.1|  unnamed protein product [Tetraodon nigroviridis]
Length=666

 Score = 36.6 bits (83),  Expect = 2.4, Method: Compositional matrix adjust.
 Identities = 30/113 (26%), Positives = 53/113 (46%), Gaps = 10/113 (8%)

Query  17   PYLRELLLGDSIHISSNISFDNLAPLSNYLGKPGNQGKGGLSIEEYKKRQAQHHIAEIKA  76
            P+LR+ L  D+++++  + ++N   +   + K G        IE   + Q +H + E   
Sbjct  312  PHLRDRLHADAVNLAKQVGYENAGTVEFLVDKHGKH----YFIEVNSRLQVEHTVTE--E  365

Query  77   LLDT-PNFINKSNRIYGYPNFVCDTGGSICEVVNPDDPNDPVLKT-LAENTLM  127
            + D  P  +++S  +   P   C TG S   V  P  P+ P+  T  +EN LM
Sbjct  366  ITDCHPVVMHRSTAV--APARRCQTGPSPAGVNLPSPPHPPICFTRRSENQLM  416


>ref|YP_191394.1|  hypothetical protein GOX0966 [Gluconobacter oxydans 621H]
 gb|AAW60738.1|  Hypothetical protein GOX0966 [Gluconobacter oxydans 621H]
Length=623

 GENE ID: 3249023 GOX0966 | hypothetical protein [Gluconobacter oxydans 621H]
(10 or fewer PubMed links)

 Score = 35.8 bits (81),  Expect = 3.8, Method: Compositional matrix adjust.
 Identities = 23/90 (25%), Positives = 40/90 (44%), Gaps = 15/90 (16%)

Query  88   NRIYGYPNFVCDTGGSICEVVNPDDPNDPVLKTLAENTLMVWIEGSSHHTDELIKRFDEN  147
            +R+ G  +F   T  +        D +D +LK LAEN  + WI  +S+    +  +F   
Sbjct  296  HRVIGIKDFFIYTNDNT-------DGSDALLKVLAENGEITWINNTSNPVPGINMQFKAY  348

Query  148  PKPMCYAPDFLDLKWREYLKINKCSVENVD  177
               +   P+ LD +W        C+V ++D
Sbjct  349  NHALTTLPEILDFRW--------CAVVDID  370


>ref|ZP_05540510.1|  solute-binding protein (secreted protein) [Streptomyces griseoflavus 
Tu4000]
Length=411

 Score = 35.8 bits (81),  Expect = 4.0, Method: Compositional matrix adjust.
 Identities = 21/63 (33%), Positives = 31/63 (49%), Gaps = 0/63 (0%)

Query  162  WREYLKINKCSVENVDPDDFVRWTYREAMAHRDPIYKSMASWGITVQADLISQVKTPSEF  221
            W +  +  K  V+  DP D  +       A +D +YK+     I VQAD +SQ+ T  E 
Sbjct  67   WNDVDRKVKEMVDAGDPPDMAQIGAYADYAAKDMLYKAGDLLSIPVQADFVSQLSTAGEV  126

Query  222  NSL  224
            NS+
Sbjct  127  NSV  129


>ref|ZP_04566622.1|  trifunctional thioredoxin/methionine sulfoxide reductase A/B 
protein [Fusobacterium mortiferum ATCC 9817]
 gb|EEO36741.1|  trifunctional thioredoxin/methionine sulfoxide reductase A/B 
protein [Fusobacterium mortiferum ATCC 9817]
Length=499

 Score = 35.4 bits (80),  Expect = 5.3, Method: Compositional matrix adjust.
 Identities = 29/101 (28%), Positives = 45/101 (44%), Gaps = 12/101 (11%)

Query  49   PGNQGKGGLSIEEYKKRQAQHHIAEIKALLDTPNFINKSNRIYGYPNFV-CDTGGSICEV  107
            PG  G+  +S EE+KK  +      IK L+D    + K  RI  +P  +  D  G I  V
Sbjct  88   PGKSGE--MSKEEFKKWYSSLGYKNIKVLVDEKGELLKKARIRAFPTSIFIDETGEIKGV  145

Query  108  VNPDDPNDPVLKTLAENTLMVWIEGSSHHTDELIKRFDENP  148
            V    P + +LK +  +         S   +E++K+ D  P
Sbjct  146  VPGQLPKEQILKIMGVD---------SQKKEEVVKKEDNVP  177


>ref|ZP_01873825.1|  hypothetical protein LNTAR_10216 [Lentisphaera araneosa HTCC2155]
 gb|EDM28282.1|  hypothetical protein LNTAR_10216 [Lentisphaera araneosa HTCC2155]
Length=158

 Score = 35.0 bits (79),  Expect = 5.4, Method: Compositional matrix adjust.
 Identities = 25/86 (29%), Positives = 39/86 (45%), Gaps = 5/86 (5%)

Query  87   SNRIYGYPNFVCDTGGSICEVVNPDDPNDPVLKTLAENTLMVWIEGSSHHT--DELIKRF  144
            SN +  Y  FV   GG + ++   D  ND V + +   T++   EG+      +E+I  F
Sbjct  67   SNEVQDYAKFVGTGGGGVAQIFQEDVNNDGVKEFV---TVLYKGEGNGDEVFWNEVIFWF  123

Query  145  DENPKPMCYAPDFLDLKWREYLKINK  170
             +  +   +  DF   K R YLK  K
Sbjct  124  KDGTQKRTWVKDFDHKKMRHYLKTIK  149