GOS 706020
From Metagenes
| Warning: this metagenomic sequence has been carefully annotated by students during bioinformatics assignments. These quality annotations are therefore the result of a teaching exercise that you are most welcome to amend and extend if necessary! |
| Sequence | |||
|---|---|---|---|
| CAMERA AccNum : | JCVI_READ_1092343625359 | ||
| Annotathon code: | GOS_706020 | ||
| Sample : |
| ||
| Authors | |||
| Team : | BioCell2008 | ||
| Username : | chalia | ||
| Annotated on : | 2009-02-05 17:29:59
| ||
Contents |
Synopsis
- Gene symbol: soxA
- Biological Process: protein biosynthesis GO:0006412
- Molecular Function: enzyme regulator activity GO:0030234
- Taxonomy: Proteobacteria (NCBI info)
Rank: phylum - Genetic Code: Bacterial and Plant Plastid - NCBI Identifier: 1224
Kingdom: Bacteria - Phylum: Proteobacteria - Class: - Order:
Bacteria; Proteobacteria;
Genomic Sequence
>JCVI_READ_1092343625359 GOS_706020 genomic DNA GTTAAACGACCAGAATTAGGTGAAGAAATATCTGATCACGATTGGGATAATTTTGTTTACAATAGAAAAAGCTTGAGAGGAAAGCATTGGGAGTTATGGC AACATTTATCAGGTTGCAGACAATGGATTAAAGTTCAGAGAGATACAGCTACACACGAAATTTTTAAAACTCTTAAAGCAAACGAAGATATTTCATAATG ACACAAAGTTTTAGATTAGAAACTGGTGGATTAATAAATAGAGATAAAAAAATTTCTTTTAAATTTAATGGTAAAAATTATTTTGGTTATGAGGGAGACA CTCTTGCTTCTGCATTAATTGCCAATGGAGTTCATTTAATTGGAAGAAGTTTCAAATATCATAGACCAAGAGGTTTTTTTGGTGCTGGGGTTGATGAGCC ATATGCAATAGTTCAATTATACAGAAACGGTGAAACAGAGCCAAATATTAAAGCTACTGAACAAGAACTTTTTGAAGGTCTTGAAGCAAAAAGTGTTAAT TGTTGGCCGAGTGTGAATTTTGATGTTGGAGCTATAAATAATTTTTTAAAGATATTTCTTCCTGCAGGCTTTTATTACAAGACTTTTATGTGGCCAAAAA GTTTTTGGTATAAAATTTATGAACCATTCATCAGAAAAGCTGCTGGTTTAGGCACTGCATCTATAAAACATGATAAAGAAAGATATGAACATAAATATGA ATATTGTGATCTGCTAATCACAGGCTCACGTCCATCTGGATTAGCGAGTGCTTATTCAGCTGCAAAAAATGGTGCTAAAGTAATTCTCGCAGAGGACAAA TCACGATTTGGTGGAACTCTATTAACCAGTGATGTCAATATAGGGAATCAATCAGTAAAGAGTGGGCAGATAGTATTGTTTCAGAACTTAAAGAAATGTC TAATGTTACTATAAAAATAGGTC
Translation
[198 - 911/923] direct strand
>GOS_706020 Translation [198-911 direct strand] MTQSFRLETGGLINRDKKISFKFNGKNYFGYEGDTLASALIANGVHLIGRSFKYHRPRGFFGAGVDEPYAIVQLYRNGETEPNIKATEQELFEGLEAKSV NCWPSVNFDVGAINNFLKIFLPAGFYYKTFMWPKSFWYKIYEPFIRKAAGLGTASIKHDKERYEHKYEYCDLLITGSRPSGLASAYSAAKNGAKVILAED KSRFGGTLLTSDVNIGNQSVKSGQIVLFQNLKKCLMLL
Phylogeny
PROTOCOLE:
a) Phylogeny.fr / méthode ProtPars
b) Phylogeny.fr / méthode ProtDist/DnaDist-Neighbor
---------------------------------------------------------------------------------------------------
ANALYSE DES RÉSULTATS:
a)
On ne peut pas définir un groupe taxonomique car notre séquence n'est pas vraiment apparentée à un groupe.
De plus notre arbre n'est pas raciné malgré nos tentatives cela n'a pas marché.
b)On peut conclure que notre séquence appartient aux proteobacteries car en l'occurrence dans cet arbre elle est
apparentée aux protéobactéries en particulier aux a-proteobacteries. De plus la phylogénie des gènes semblent
être cohérentes avec la phylogénie des espèces.
---------------------------------------------------------------------------------------------------
RÉSULTATS BRUTS:
a)Parcimonie
Protein parsimony algorithm, version 3.66
One most parsimonious tree found:
+--------------------------------------------------------------------Ppacifica[d-proteobacteria]
|
| +-----------------------------------------------------------------ma_sequence
23 |
| | +--Rxylanophilus[actinobacteria]
| | +----------------------------------------------------------22
| | | +--Tsp[g-proteobacteria]
+--2 |
| | +--------------------------------------------------------Bparapertussis[b-proteobacteria]
| | |
| | | +--------------------------------------Rbacterium[a-proteobacteria]
| | | |
| | | | +-----------------------------------Rlitoralis[a-proteobacteria]
+-21 | +-------20 |
| | | | | +-----Asp[actinobacteria]
| | | | | +-------------------------18
| | | +-19 | | +--Rhsp[actinobacteria]
| | | | | +-17
| | | | | +--Serythraea[actinobacteria]
| | | | |
| | | +-16 +-----------Bphymatum[b-proteobacteria]
+-----3 | | |
| | | +----------15 +--Bthailandensis[b-proteobacteria]
| | | | | +----14
| | | | | | +--Bpseudomallei[b-proteobacteria]
| | | | +-13
| +-----7 +-------11 | +--Bcenocepacia[b-proteobacteria]
| | | | +----12
| | | | +--Bdolosa[b-proteobacteria]
| | | |
| | | | +--------Cpsychrerythraea[g-proteobacteria]
| | | +-------------10
| | | | +-----Paeruginosa[g-proteobacteria]
| | | +--9
| | | | +--Pentomophila[g-proteobacteria]
+--4 | +--8
| | +--Pmendocina[g-proteobacteria]
| |
| | +-----Smeliloti[a-proteobacteria]
| +-----------------------------------------6
| | +--Ssp[a-proteobacteria]
| +--5
| +--R_sp[a-proteobacteria]
|
| +--CPelagibacteru[a-proteobacteria]
+--------------------------------------------------1
+--CPelagibactersp[a-proteobacteria]
---------------------------------------------------------------------------------------------------
b)ProtDist
+-----------Tsp[g-proteobacteria]
+------------1
! +-----------Rxylanophilus[actinobacteria]
!
! +-CPelagibactersp[a-proteobacteria]
! +-2
! +----------3 +-CPelagibacteru[a-proteobacteria]
! ! !
! +-22 +-ma_sequence
! ! !
! ! +-------------Bparapertussis[b-proteobacteria]
! +-21
+--17 ! ! +----R_sp[a-proteobacteria]
! ! ! ! +------4
! ! ! +-20 +---Ssp[a-proteobacteria]
! ! ! !
! ! ! +-----------Smeliloti[a-proteobacteria]
! ! !
! ! ! +--Pmendocina[g-proteobacteria]
! ! ! +-9
! ! ! +--10 +--Pentomophila[g-proteobacteria]
! ! ! ! !
! ! ! +--11 +--Paeruginosa[g-proteobacteria]
! ! ! ! !
! +-19 ! +-------Cpsychrerythraea[g-proteobacteria]
! ! !
! ! +-15 +Bdolosa[b-proteobacteria]
! ! ! ! +-6
! ! ! ! ! +Bcenocepacia[b-proteobacteria]
! ! ! ! +-7
! ! ! ! ! ! +Bpseudomallei[b-proteobacteria]
! ! ! +-----8 +-5
! ! +-16 ! +Bthailandensis[b-proteobacteria]
! ! ! ! !
! ! ! ! +--Bphymatum[b-proteobacteria]
! ! ! !
! ! ! ! +------Serythraea[actinobacteria]
! +-18 ! +-12
! ! +--13 +---------Rhsp[actinobacteria]
! ! !
! ! +----------Asp[actinobacteria]
! !
! +------------Rlitoralis[a-proteobacteria]
!
14----------Rbacterium[a-proteobacteria]
!
+-----------------------------------------------Ppacifica[d-proteobacteria]
Annotator commentaries
Notre séquence provient de Tropical South Pacific et fait 923 paires de bases (pb).
Notre ORF choisis est complet car il commence par une méthionine et se finit par un codon stop. Nous avons pu établir que l'ORF été complet grâce au Blastp.On a vu que la méthionine s'alignait au début des séquences homologues. Notre ORF est codant car il est assez long (714 pb) et qu'on obtient de bon résultats dans le Blastp. Cependant nous ne pouvons pas déterminer le poids moléculaire car il est impossible de déterminer si le "frame shift" est une mutation ou si c'est une erreur de séquençage. Nous ne savons donc pas s'il faut prendre le poids moléculaire de la protéine tronquée ou faire la somme des poids moléculaires des deux parties "discontinues" de la protéine.
Notre hypothèse sur la fonction de la protéine est: "sarcosine oxidase" car c'est la fonction qui prédomine dans les meilleurs E-value.
Nous avons trouvé un domaine protéique : NAD(P)-binding grâce à interpro.
Nous pouvons dire d'après InterProScan que notre fragment aurait comme processus biologique "protein biosynthesis" d'après ce que nous pouvons voir ici :"The chemical reactions and pathways, including anabolism and catabolism, by which living organisms transform chemical substances. Metabolic processes typically transform small molecules, but also include macromolecular processes such as DNA repair and replication, and protein synthesis and degradation". On peut aussi dire qu'elle a plusieurs processus biologique comme la dégradation des protéines.
On peut aussi supposer que la fonction moléculaire du fragment de départ est "enzyme regulator activity" d'après ce que nous avons trouver :"Catalysis of a biochemical reaction at physiological temperatures. In biologically catalyzed reactions, the reactants are known as substrates, and the catalysts are naturally occurring macromolecular substances known as enzymes. Enzymes possess specific binding sites for substrates, and are usually composed wholly or largely of protein, but RNA that has catalytic activity (ribozyme) is often also regarded as enzymatic." Cette fonction est en parfait accord avec celle trouver grâce aux Blast car "sarcosine oxidase" est une enzyme qui en présence d'eau et d'oxygène donne de la glycine, du formaldéhyde et de l'eau oxygéné.
Nous avons donner "soxA" comme nom de gène car on a regardé dans la fiche genbank d'un homologue avec un e-value de 1e-118. On retrouve ce nom de gène pour les beaucoup d'autres homologues.
Pour notre arbre nous avons quelques problèmes dans le choix de notre groupe d'étude et de notre groupe extérieur. Nous avons choisis comme groupe d'études les protéobactéries car nous avons de très bon résultats (1e-118) et comme groupe extérieur les actinobactéries parce qu'elle ont des e-value correct(5e-30) L'arbre le plus cohérent est celui fait par ProtDist car on peut supposer que notre séquence appartient bien aux protéobactéries. Celui par parcimonie ne nous permet pas de déterminer un groupe taxonomique car notre séquence ne s'apparente à aucun groupe taxonomique.
Multiple Alignement
PROTOCOLE:
ClustalW2
---------------------------------------------------------------------------------------------------
ANALYSE DES RÉSULTATS:
On observe que notre séquence est beaucoup plus courte que les autres séquences de notre alignement multiple.
On observe des similitudes avec des domaines bien conservés comme par exemple de la position +33 à +40.
On peut donc supposer que notre ORF (= ma_sequence) s'intègre correctement dans la famille de ses homologues.
---------------------------------------------------------------------------------------------------
RÉSULTATS BRUTS:
CLUSTAL 2.0.10 multiple sequence alignment
CPelagibactersp[a-proteobacter --MTQSFRLNDVGLINRDRKLSFKFNSVTYYGYEGDTLASALIANGVHLV 48
CPelagibacteru[a-proteobacteri --MTQNYRLDNVGLINRDKKISFKFNGVTYFGYEGDTLASALLANGVHLI 48
ma_sequence --MTQSFRLETGGLINRDKKISFKFNGKNYFGYEGDTLASALIANGVHLI 48
Bparapertussis[b-proteobacteri --MTQQYRLNHGGLVDRRRPLTFRFDGIQYQGYHGDTLASALLANGVHLV 48
R_sp[a-proteobacterie] --MTEVNRLD-GGQINRAKEVSFTFDGHRYKGYEGDTLASALLANGERLM 47
Ssp[a-proteobacterie] --MTQVNRIS-GGLIDRSTELNFTFDGKNYQGYAGDTLASALLANGVRLM 47
Smeliloti[a-proteobacterie] --MSSYRLPK-RGLVDRNVPLSFTFDGRPMQGLEGDTLASALLANGRMLV 47
Pmendocina -MSQVNRLA-QGGRIDRSQPLTFSFNGQTYQGYAGDTLAAALLANGVDVI 48
Pentomophila -MSQTYRLA-SGGRIDRSKVLNFSFNGKTYQGYAGDTLAAALLANGVDIV 48
Paeruginosa[g-proteobacteria] -MSQINRLS-SGGRIDRNRPLTFSFNGQHYQGYAGDTLAAALLANGVDIV 48
Cpsychrerythraea[g-proteobacte -MSQVNRIAGSSKRINRNRTLTFSFNGKEYTGFEGDTVASALLANGVDVV 49
Bdolosa[b-proteobacteria] -MSQKDRLG-TGGRINRAIPLTFTFNGRTYQGFQGDTLASALLANGVHFV 48
Bcenocepacia[b-proteobacteria] -MSQKDRLG-TGGRINRAIPLTFTFNGRTYQGFQGDTLASALLANGVHFV 48
Bpseudomallei[b-proteobacteria -MSQKDRLG-AGGRINRAQPLTFTFNGRTYQGFQGDTLASALLANGVHFV 48
Bthailandensis[b-proteobacteri -MSQKDRLG-AGGRINRAQPLTFTFNGRTYQGFQGDTLASALLANGVHFV 48
Bphymatum[b-proteobacteria] -MSQKNRLG-AGGRINRAIPLTFTFNGRTYQGFQGDTLASALLANGVHFV 48
Serythraea[actinobacteria] -MSNEFRLA-EGGRIDRDRPLSFRFDGREYVGYEGDTLASALLANGVHQV 48
Rhsp[actinobacteria] -MNAPFRTR-QGGRLDRNTSYTFTFDGRELTGHPGDTLGSALLANGVHQI 48
Asp[actinobacteria] MTSQNARLA-AGGRIDRSISWRFTVDGEEFTGHPGDTLASALLANGRIAA 49
Rlitoralis[a-proteobacterie] --MNEFRVE-GRGRVNADKPVKFTFDGEIYKGFEGDTVASALLANGVHLM 47
Rbacterium[a-proteobacteria] ---MSHRLDGKGRLIDRSKKLRFTFNGKAMTGYAGDTLASALLGSGQSVM 47
Tsp[g-proteobacteria] --MAKRLPARDGEWIDRSRTLRFSFEGREYSAFAGDTISSALLANGVRVL 48
Rxylanophilus[actinobacteria] --MSSRLPYQEGEWIDRSKPLTFSFEGKRFTGFSGDTITSALWASGERVL 48
Ppacifica[d-proteobacteria] ---------MLEPDRETGETVHIRFDETLIAARPEDTLATALIGAGELMT 41
: : .: . **: :** . *
CPelagibactersp[a-proteobacter GRSFKYHRPRGFFGAGVDEPYAIVQLYRNGETE---PNIKATEQELFEGL 95
CPelagibacteru[a-proteobacteri GRSFKYHRPRGFFGAGVDEPYAIVQLYRNNETE---PNVKATEQELFEGL 95
ma_sequence GRSFKYHRPRGFFGAGVDEPYAIVQLYRNGETE---PNIKATEQELFEGL 95
Bparapertussis[b-proteobacteri GRSFKYHRPRGIYTAGVEEMNALVDVLKEGQAD---PNTRATVVELEDGI 95
R_sp[a-proteobacterie] GRSFKYHRPRGVLTAGSEEPNALVELRKGGRQE---PNTRATVIELFDGL 94
Ssp[a-proteobacterie] GRSFKYHRPRGVLAAGSEEPNALVELRSGGRQE---PNTRATVAEIYEGL 94
Smeliloti[a-proteobacterie] GRSFKYHRPRGILTAGAAEPNALVTVGRGGRAE---PNTRATMQELYEGL 94
Pmendocina GRSFKYSRPRGIVAAGAEEPNAVLQIGSTEAAQ--IPNVRATQQALYANL 96
Pentomophila GRSFKYSRPRGIIAAGTEEPNAILQIGSSEATQ--IPNVRATQQALYAGL 96
Paeruginosa[g-proteobacteria] GRSFKYSRARGIVAAGAEEPNAILQIGSREATQ--IPNVRATQQALYGGL 96
Cpsychrerythraea[g-proteobacte GRSFKYSRPRGIITSDSQEPNAIFQIGSTQATT--IPNPRATQTDLYQGL 97
Bdolosa[b-proteobacteria] ARSFKYHRPRGIVTAGVEEPNAVVQLETG-PYT--VPNARATEIELYQGL 95
Bcenocepacia[b-proteobacteria] ARSFKYHRPRGIVTADVAEPNAVVQLETG-PYT--VPNARATEIELYQGL 95
Bpseudomallei[b-proteobacteria ARSFKYHRPRGIVTAGVDEPNAVVQLETG-AYT--VPNARATEVELYQGL 95
Bthailandensis[b-proteobacteri ARSFKYHRPRGIVTAGVDEPNAVVQLETG-AHT--VPNARATEIELYQGL 95
Bphymatum[b-proteobacteria] ARSFKYHRPRGIVTADVAEPNAVVQLERG-AYT--VPNARATEIELYQGL 95
Serythraea[actinobacteria] GTSIKHGRPRGIMAAGVEEPNALVQIEKPFP----EPMLTATTVPLRDGL 94
Rhsp[actinobacteria] TTSIKLGRPRGITAAWAEDTGGLVQIEEPFP----EPMLLATTIELFDGL 94
Asp[actinobacteria] GNSLYEDRPRGIMSAGVEESNALVRVEARFPGHVAESMLPATTVTLVDGL 99
Rlitoralis[a-proteobacterie] GRSFKYHRPRGVVTAGSEEPNALIGTTRGKGRF--EPNTRATIQEIYEGL 95
Rbacterium[a-proteobacteria] GRSFKYHRPRGVVASGVEEPNALMNLGEGGRFE---PNQRATTTPLFDGL 94
Tsp[g-proteobacteria] GRSFKYHRPRGVFSAANHDSNVLLQSDSD-------FNIRGDVTAVADGM 91
Rxylanophilus[actinobacteria] GRSFKYHRPRGVLSFANHDVNVMVQNGAV-------PNIRADVTLIKSNQ 91
Ppacifica[d-proteobacteria] SRSPKYRRPRGAYCLAGDCGTCLVRVDGR-------PNVRACMTPVREGM 84
* *.** :. . : .
CPelagibactersp[a-proteobacter EAKSVNCWPSVNFDVGAINNFLKI-FLPAGFYYKTFMWPKSFWYKVYEPF 144
CPelagibacteru[a-proteobacteri EATSVNCWPSVNFDIGAINNLLKI-FLPAGFYYKTFMWPKSFWYKVYEPF 144
ma_sequence EAKSVNCWPSVNFDVGAINNFLKI-FLPAGFYYKTFMWPKSFWYKIYEPF 144
Bparapertussis[b-proteobacteri EVSSQNRWPSLRFDVRSFHGMISR-LIPAGFYYKTFMWPAKFWPK-YEHM 143
R_sp[a-proteobacterie] EAAPQNAWPSLRFDAMAVNDRFSN-FLTAGFYYKTFMWPKAFWEKIYEPI 143
Ssp[a-proteobacterie] SANSQNRWPSLKHDVMAINDRFSA-FLSAGFYYKTFMWPRAFWEKLYEPV 143
Smeliloti[a-proteobacterie] EARSQNRWPSLAFDIGALNGLLSP-FLGAGFYYKTFMWPAPLWEKLYEPV 143
Pmendocina TATSTNGWPSVNTDLMGILGKVGGGMMPPGFYYKTFMYPQNLWL-TYEKY 145
Pentomophila VATSTNGWPNVNNDMMGIIGKVGGNMMPPGFYYKTFMYPKSFWM-TYEKY 145
Paeruginosa[g-proteobacteria] VATSTNGWPNVQNDLMGIFGKVGGKLMPPGFYYKTFMYPQSMWM-TYEKY 145
Cpsychrerythraea[g-proteobacte TASSTNGWPNVDFDLMGTVGKLGGSMMPPGFYYKTFMFPQSLWM-SYEHL 146
Bdolosa[b-proteobacteria] VATSVNAEPSLENDKYAINQKLSR-FLPAGFYYKTFMWPRRMWP-KYEEK 143
Bcenocepacia[b-proteobacteria] VATSVNAEPTLENDKYAINQKFSR-FMPAGFYYKTFMWPRNMWP-KYEEK 143
Bpseudomallei[b-proteobacteria VATSVNAKPSLEHDRMAVMQKLAR-FLPAGFYYKTFMWPRNLWP-KYEEK 143
Bthailandensis[b-proteobacteri VATSVNAKPSLEHDRMAVMQKFAR-FLPAGFYYKTFMWPRNLWP-KYEEK 143
Bphymatum[b-proteobacteria] VATSVNAEPNLEHDRMAINQKFAR-FMPAGFYYKTFMWPAKWWP-KYEEK 143
Serythraea[actinobacteria] EATGLP-------------------------------------------- 100
Rhsp[actinobacteria] VARGIP-------------------------------------------- 100
Asp[actinobacteria] KADLLN-------------------------------------------- 105
Rlitoralis[a-proteobacterie] DTESQNKWPTLQFDLGAINDRLYM-LFSAGFYYKTFMWPRSFWDSVYEPL 144
Rbacterium[a-proteobacteria] TATSQNHWPSLEFDIGAVNDLAAR-FLPAGFYYKTFMFPRFAWKHLFEPF 143
Tsp[g-proteobacteria] RLSAINTQGGLDKDRGRFLDRLSP-LLPVGFYYKTFHRPKALFP-FWENQ 139
Rxylanophilus[actinobacteria] NLRAVNTIGGLKLDLGQINNRLSR-FLPVGFYYKAFHKPARLFP-LWEKF 139
Ppacifica[d-proteobacteria] RVSSQNTYRPRRLDPTAIVDKVFV----KGMDHHHLMVRPRIANQIMQEF 130
CPelagibactersp[a-proteobacter IRKAAGLGVASTKHDKERYEHKYEYCDLLIAGSGPSGLASAYAAAKNGAR 194
CPelagibacteru[a-proteobacteri IRKAAGLGVASIEHDKERYEHKYEYCDLLIAGSGPSGLASAYAAAKNGAR 194
ma_sequence IRKAAGLGTASIKHDKERYEHKYEYCDLLITGSRPSGLASAYSAAKNGAK 194
Bparapertussis[b-proteobacteri IRHAAGLGRAPLVRDRDRYEKQHAYCDVLVVGAGPAGLAAARSACQAGLR 193
R_sp[a-proteobacterie] IRKAAGLGSISFEEDPDLYDKGFLHCDLLIIGSGPSGLAAALTAGRSGAR 193
Ssp[a-proteobacterie] IRKAAGLGSLSGEGDPDAYDKGYLHCDLLVIGAGPAGLSAALTAGRGGAQ 193
Smeliloti[a-proteobacterie] IRRAAGLGKASYEADPDAYEKSWAHCDLLVIGAGPTGLAAALTAGRAGAR 193
Pmendocina IRKAAGLGRSPKENDPDIYDYMNQHCDVLVVGAGPAGLAAALAAGRSGAR 195
Pentomophila IRKAAGLGRAPLQNDPDSYDYMNQHCDVLIVGAGPAGLAAALAAARSGAR 195
Paeruginosa[g-proteobacteria] IRKAAGLGRAPTEVDPDSYDWMNHHCDVLVVGGGPAGLAAALAAARSGAR 195
Cpsychrerythraea[g-proteobacte IRKGAGLGASPQQNDPDSYDKMHHHCDVMIVGGGPAGLAAALSAAQTGAR 196
Bdolosa[b-proteobacteria] IREAAGLGKAPDTLDADRYDKRYAHCDVLVVGGGPSGLAAAHAAATAGAR 193
Bcenocepacia[b-proteobacteria] IREAAGLGKAPEVLDADRYDKCYAHCDVLVVGGGPSGLAAAHAAATAGAR 193
Bpseudomallei[b-proteobacteria IREAAGLGKAPDTLDADRYDKCYAHCDVLVVGGGPTGLAAAHAAAVNGAR 193
Bthailandensis[b-proteobacteri IREAAGLGKAPDTLDADRYDKCYAHCDVLVVGGGPAGLAAAHAAAVNGAR 193
Bphymatum[b-proteobacteria] IREAAGLGKAPEVLDADRYDKCYAHCDVLVVGGGPTGLAAAHAAASSGAR 193
Serythraea[actinobacteria] -----GQGRLAEEADPARYDTMHAHCDVLVVGAGPAGLSAALSAARSGAR 145
Rhsp[actinobacteria] -----GQGRLAEIADSAKYDAKHVHTDLLVAGAGPAGLAAALTAARAGAR 145
Asp[actinobacteria] -----GLGRLDPEEDRAEYDKKFVHTDVLVIGGGPAGLAAAREAVRTGAR 150
Rlitoralis[a-proteobacterie] IRKAAGLGKAPTEVDPDHYASRYLHCDVLIVGAGPSGIAAALTAGRAGSK 194
Rbacterium[a-proteobacteria] IRQSAGLGQVPKEPDADRYEHVYHHTDVLVIGGGVAGLAAARAAAAGGAK 193
Tsp[g-proteobacteria] IRKRAGLGRIDTQWPELRLPKRHGFCDLLVVGAGPSGLSAAIAAAESGAR 189
Rxylanophilus[actinobacteria] IRKAAGLGYVNVNSKRRLWSKAYGFADVLVIGAGAAGLSAAISAAEAGAK 189
Ppacifica[d-proteobacteria] ARNLTGFGELPEVVGERGCEHIAHELPVLIIGAGPAGRALAARLREAGID 180
* * ::: *. :* : * *
CPelagibactersp[a-proteobacter VILAEDKSRFGGTLLT------SDVNIGNQTGKEWADGIISELKEMPNVT 238
CPelagibacteru[a-proteobacteri VILAEDKPRFGGTLLT------SEVNIGNQTGKEWAENIISELKEMPNVI 238
ma_sequence VILAEDKSRFGGTLLT------SDVNIGNQS-----------VKSGQIVL 227
Bparapertussis[b-proteobacteri VLLVDEKSRVGGTLPG------SNTEIEGVAGAKWATAVERELRESGHAS 237
R_sp[a-proteobacterie] VILADEDFRMGGRLNS------ETLALGDQSGADWAAAAIAELADMPNVR 237
Ssp[a-proteobacterie] VILADEDFQLGGRLLS------DAQSLCNQSNAEWVAATQAELIALPNVR 237
Smeliloti[a-proteobacterie] VILVDEGSLPGGSLLS------DTATIDGKAAADFARDTSDELRSMPNVQ 237
Pmendocina VILADEQEEFGGSLLS------TREMLDDKPAADWAVKAIAELQKMPEVT 239
Pentomophila VILADEQEEFGGTLLD------SRETLDGKPAAEWVNAVVAELESLPEVT 239
Paeruginosa[g-proteobacteria] VILADEQEEFGGSLLD------TRETLDGKPAAEWVADAVAELQGLPEVI 239
Cpsychrerythraea[g-proteobacte VIISDEQNEFGGSLLC------STQQIDGQLPSQWVEKTVAQLSEMDNVM 240
Bdolosa[b-proteobacteria] VMLVDDQRELGGSLLS------CRAEIDGKPALQWVEKIEAELRKLPDVT 237
Bcenocepacia[b-proteobacteria] VILVDDQRELGGSLLS------CRAEIDAKPALQWVEKIEAELRKLPDVT 237
Bpseudomallei[b-proteobacteria VILVDDQRELGGSLLA------CRAEIDGKPALQWVEKIEAELAKLPDMS 237
Bthailandensis[b-proteobacteri VILVDDQRELGGSLLA------CRAEIDGKPALQWVEKIEAELSKLPDVK 237
Bphymatum[b-proteobacteria] VILVDDQRELGGSLLS------CKTEIDGHAALSWVEKIEAELSRMPDVK 237
Serythraea[actinobacteria] VIVADADAEFGGSLLG------IGERLDDAPATEWVRRAVAELATYPEVR 189
Rhsp[actinobacteria] VVLVDEQSEAGGDLLG------STDLIDGAPALDWVAAAVAELATYPDVL 189
Asp[actinobacteria] VMLLDDQPELGGTLLSGSTAPDLAEAIEGKPSLEWVADVEAELVSAAECT 200
Rlitoralis[a-proteobacterie] VVLVDENTEMGGTLLS-----EPAVSIEGQSAWDWLAAATNELDQLPNVR 239
Rbacterium[a-proteobacteria] VMVLEQTAHWGGRAPVD------GGQIDGLDPETWVNNAVQELETAENVT 237
Tsp[g-proteobacteria] VWLVDENARAGGSLN-------------DRQDGALRDQLLARLADLPNLT 226
Rxylanophilus[actinobacteria] VVLVDENPRVGGSLTY--------AKTINNNGTSVLADLARKVESYPNIE 231
Ppacifica[d-proteobacteria] HAIVDRLDRPQLRAAP-----ALGAEAPALAPVEDVLADTGVFGVYPGPK 225
: : .
CPelagibactersp[a-proteobacter VKNRSQVFGYYDHNMLVMSERISDHL-PSTKKFHPKQRLWYIRAKEVLIS 287
CPelagibacteru[a-proteobacteri VKNRSQVFGYYDHNMLVMSEKLSDHL-PKTKKYNPKQRLWYIRAKEVLIS 287
ma_sequence FQNLKKCL------MLL--------------------------------- 238
Bparapertussis[b-proteobacteri VMLRTTAFGYYDHDTVALAQQCDT----PTNPHGATQRLWYVHAKQVVLA 283
R_sp[a-proteobacterie] LMSRTTIVGAFDHGTYGAVERVQDHV-AVPQEGKPRQIFWRIYSRRALLC 286
Ssp[a-proteobacterie] VMPRTTVFGAYDHGVYGAVERNADHL-VAPEENKPRQTLWRIYSRRAVVA 286
Smeliloti[a-proteobacterie] VLVRTTAFGWYDGNVFGAVERVQKHV-REPASHLPVERLWRIVAGKALLA 286
Pmendocina LLPRATVNGYHDHNFLTIHQRLTDHLGEVAPMGQPRQRMHRVRAGRVVLA 289
Pentomophila LLPRSTVNGYHDHNFLTIHERLTDHLGDRAPIGQVRQRVHRVRANRVVLA 289
Paeruginosa[g-proteobacteria] LLPRSTVNGYHDHNFLTIHERRTDHLGEVAPLGQVRQRVHRVRAKRVVLA 289
Cpsychrerythraea[g-proteobacte LLPRSTVFGYYDHNLVGINERRTDHLGEHQ-LQSTRQRVHKVRAKQVILA 289
Bdolosa[b-proteobacteria] ILSRSTAFGYQDHNLVTITQRLTDHL-PVSMRKGTRELLWKVRAKRVILA 286
Bcenocepacia[b-proteobacteria] ILSRSTAFGYQDHNLVTVTQRLTDHL-PVSMRKGTRELLWKVRAKRVILA 286
Bpseudomallei[b-proteobacteria ILTRSTAFGYQDHNLVTVVQRLTDHL-PVSMRKGTREMIWKVRAKRVILA 286
Bthailandensis[b-proteobacteri ILTRSTAFGYQDHNLVTVVQRLTDHL-PVSMRKGTREMIWKVRAKRVILA 286
Bphymatum[b-proteobacteria] ILSRSTAFGYQDHNLVTVTQRLTDHQ-PVSMRKGTRELLWKIRAKRVILA 286
Serythraea[actinobacteria] QLPSTTVFGHYDDNYLVAVENR----GEDAP---SRQRIWRVRAREVVLA 232
Rhsp[actinobacteria] HLQRTTAFGNYDDGFVLALQRRTDHLGVEAPAALSRQRVWRIRARHILVA 239
Asp[actinobacteria] VLNRTTAFGAYDANYIVAVQNRTDHLSSPAAPGVSRQRIWHIRAKQVVVA 250
Rlitoralis[a-proteobacterie] LMTRTTAMGYYHQNMIGMVQKLTDHM-ADIPDGAPRERMWRVRAHEVVLA 288
Rbacterium[a-proteobacteria] LRLGTMGAGVYDHGYVLGYERVAD---ATPGDDRPRHRLWRIRAKQIVTA 284
Tsp[g-proteobacteria] FLPDTVAAGWYADHYVPLVTPKG---------------LIRLRARAVIVA 261
Rxylanophilus[actinobacteria] FWSDTVASAYFEDQWVPLVHSDGG--------------MTKMRAKSVVVA 267
Ppacifica[d-proteobacteria] LGLEGEEEGP-DRALVAASEGGESSN---------HERLYAFRPRHLVFA 265
CPelagibactersp[a-proteobacter SGSIERPLVFGNNDTPGVMLSSAAKEYLKVYGVLVGKKPLVFTNNDSGYE 337
CPelagibacteru[a-proteobacteri SGSIERPLVFGNNDTPGVMLSSAAKEYLKVYGVLVGKKPLIFTNNDSGYE 337
ma_sequence --------------------------------------------------
Bparapertussis[b-proteobacteri AGAIERPCVFANNDLPGVMLASAARTYCNEFGVAVGRRVLVLANNDSAYE 333
R_sp[a-proteobacterie] AGAMERPIAFADNDRPGVMLASAVRSYLNRWAAAPAQEIAIFTNNDDGHR 336
Ssp[a-proteobacterie] IGAIERPIAFENNDRPGVMLAGATRAYANRWAVTPARSVVVFANNDDAHQ 336
Smeliloti[a-proteobacterie] TGAEERPLVFGGNDRPGVMMAGAMRAYLNRYGVAPGRTPAIFTTNDTGYT 336
Pmendocina TGAHERPLVYANNDVPGNMLADAVSTYVRRYGVAPGQKLVLSTNNDYAYR 339
Pentomophila AGAHERPLVYGNNDLPGNMLAGAVSTYVRRYGVAPGRKLVLSTNNDHAYR 339
Paeruginosa[g-proteobacteria] AGAHERPLVYGNNDLPGNMLAGAVSTYVRRYGVAPGKKLVLATNNDYAYR 339
Cpsychrerythraea[g-proteobacte TGAHERPLVYGNNDVPGCMLANAISTYINRYDVVPGKQLVLMTTNDNAYK 339
Bdolosa[b-proteobacteria] TGAHERPIVFGNNDLPGVMLAGAVSTYVHRFGVLPGRNAVVFTNNDRAYQ 336
Bcenocepacia[b-proteobacteria] TGAHERPIVFGNNDLPGVMLAGAVSTYVHRFGVLPGRNVVVFTNNDRAYQ 336
Bpseudomallei[b-proteobacteria TGAHERPLVFGNNDLPGVMTASAVSAYIHRYGVLPGRVAVVATNNDRGYQ 336
Bthailandensis[b-proteobacteri TGAHERPLVFGNNDLPGVMTASAVSTYIHRYGVLPGRVAVVATNNDRGYQ 336
Bphymatum[b-proteobacteria] TGAHERPIVFGNNDLPGVMLASAVSTYIHRFGVMPGRNAVVFTNNDAGYR 336
Serythraea[actinobacteria] TGSHERPLVFAGNDRPGTMLAGSARTYLHRYGVVPGRRAVVFTANDSAYA 282
Rhsp[actinobacteria] AGAHERPVVFTDNDRPGIMLAHGARTFLHRYGVKVGEQAVVFTTNDSAYE 289
Asp[actinobacteria] PGAHERPLVFENNDRPGIMLASAVRSYLNRYAVAAGQRVVISTTNDSAYA 300
Rlitoralis[a-proteobacterie] QGAIERPMVFDGNDCPGVMMAGAAQTFLNRFGVLVGRRPVVLTSHDSAWY 338
Rbacterium[a-proteobacteria] TGAIERPLSFPGNDVPGVMLASAVRDYVVNWGVAPGRRTVIVTNNDDAYL 334
Tsp[g-proteobacteria] GGVYEQPAVFRNNDLPGVMLASAALRLARRYGVAACESAVILAANSDAYR 311
Rxylanophilus[actinobacteria] SGVMEQPAVFRNNDLPGIMLGSAAQRLIYRYAVKPFDRGIVLAANSDAYG 317
Ppacifica[d-proteobacteria] TGCREPMIPFANNDLPGVVGARGLLAALRRAGSRLSGRCVVVGEGEAAEG 315
CPelagibactersp[a-proteobacter TAIEFKKNGVDPI-ILDTRK-DPHSEIIDEAKNLGINIKFSYVVVAAQGY 385
CPelagibacteru[a-proteobacteri TAIEFKKNGVDPI-ILDTRK-EPKSEIIDEAKKLDIEIKFSYVVVAAKGY 385
ma_sequence --------------------------------------------------
Bparapertussis[b-proteobacteri AALDLKRAGIDIVGVVDQRE-AVSASLSETLASLRIPHHRGSTIKKATGR 382
R_sp[a-proteobacterie] TAADLIAKGVSVPAVIDVR--------ADAPSVAGTELLAGAEVIGTSGR 378
Ssp[a-proteobacterie] TAKDLIAKGIEVHAVVDTR--------SDAPGIEGTELLAGAQIIGTKGR 378
Smeliloti[a-proteobacterie] LAQELEAAGVDVVAIVDSRP-A-----AGVDYRGKARLVREAVVCGTKGG 380
Pmendocina VVLDWLDAGRQVVAVADARS-NPRGSWVEEARRRGVRVLTGSAVVEARGS 388
Pentomophila CALDWHDAGLQVVAIADARH-NPRGSLVEEARAKGIRILTSSAVIEAKGS 388
Paeruginosa[g-proteobacteria] VALDWQEAGLQVVAIADARA-NPRGEWVEEARQRGMRVITGSSVIEARGG 388
Cpsychrerythraea[g-proteobacte TAIDWHQAGRKVVAIVDTRS-TSNGDLVNKVKKLGIDIIFGHGVIEVKGS 388
Bdolosa[b-proteobacteria] TALDLKACG-AKVTVVDSRA-SSNGALPAAAKRQGVTVMSGAVVTAASGK 384
Bcenocepacia[b-proteobacteria] TALDLKACG-AKVTVVDSRA-SSNGALPAAAKRQGVTVMSGAVVTAASGK 384
Bpseudomallei[b-proteobacteria CALDLKACG-AKVTVVDARA-STRGALPAVAKRHGITVMSGAAVSAAAGK 384
Bthailandensis[b-proteobacteri CALDLKACG-AKVTVVDARA-STRGALPAVAKRNGVTVMSGAVVSAAAGK 384
Bphymatum[b-proteobacteria] CALDMKACG-ASVTVVDPRA-QGNGALQAAARRHGVKIMNNAAVMTAHGK 384
Serythraea[actinobacteria] AAVDLHDAGVAIAAIIDVRD-VVSTRWASHCIERGIPIHPEAAVVSTSGT 331
Rhsp[actinobacteria] AAIDLHDAGVRINAIVEARD-DAPARWQRECDARGITIRAASVVSGTRGN 338
Asp[actinobacteria] LASDLRAAGVKVAAVVDAR--PRLTEVAAAAVESGTRVLIGSAVANTSAS 348
Rlitoralis[a-proteobacterie] SAFDMADAGAEVVAIVDTRP-EVAPSLVQQAMKRGIETLVGHTATGTKGR 387
Rbacterium[a-proteobacteria] TALALKEAKLEVPAIIDVRA-TLVGPLADRARKAGIKLMHGKAVVGVKGK 383
Tsp[g-proteobacteria] NALELKALGIPVKAIVDLDAPETRGDLHDQVRAAGIAVHGRSTVYSAEGE 361
Rxylanophilus[actinobacteria] LVLDLLSAGVEVAAVVDLRHEGEDSALAEVVQESGVKIYRGHCIYEALPT 367
Ppacifica[d-proteobacteria] --------------------------------------------------
CPelagibactersp[a-proteobacter K------KVKSADIAKISD-DKEQLGTIENIKCDCICVSGFWTPTIHLAS 428
CPelagibacteru[a-proteobacteri K------KVKSAEVAKISD-DKNELGTLENINCDCICVSGFWTPTIHLAS 428
ma_sequence --------------------------------------------------
Bparapertussis[b-proteobacteri H------RVRCAVIVDQS-------GLRQTVRCDAILVSGGWTPSVHLHS 419
R_sp[a-proteobacterie] L------GLSSVTVRLAN-------GQTRKVNCGALAVSGGWNPNVHLTC 415
Ssp[a-proteobacterie] L------GLTSVTVRLLD-------GRTRDITCGALAMSGGWNPNLGLTC 415
Smeliloti[a-proteobacterie] K------AISAIEVHHG--------GRTETIAVDALAMAGGFDPIIHLAC 416
Pmendocina K------RVTGARICAIDLVSHKVTSPGETVDCDLIVSSGGYSPVVHLAS 432
Pentomophila K------HVTGARVAAIDVQAHKVTSPGETLECDLIATSGGYSPVVHLAS 432
Paeruginosa[g-proteobacteria] K------RVSGAKVARIDLQAMRASG-GEWLDCDLIASSGGYSPVVHLAS 431
Cpsychrerythraea[g-proteobacte K------RVKGVEVAPINASNHSVTGPAKHIVCDTVASSGGWSPVIHLSS 432
Bdolosa[b-proteobacteria] W------RVASVDVASY--TNGQTGGRLQSLPCDLVAMSGGFSPVLHLFA 426
Bcenocepacia[b-proteobacteria] W------RVSSVDVASY--SNGQTGGKLQTLPCDLVAMSGGFSPVLHLFA 426
Bpseudomallei[b-proteobacteria L------RVASVDVVSY--ANGRSGGKIATLPCDLVAMSGGFSPVLHLFA 426
Bthailandensis[b-proteobacteri L------RVASVDVASY--ANGRSGGKIATLPCDLVAMSGGFSPVLHLFA 426
Bphymatum[b-proteobacteria] Q------RVTSVEVVAY--ANGKTGAKQADLQCDLVAMSGGFSPVLHLFA 426
Serythraea[actinobacteria] G------RISHVHVARWETPGDRMTNVRQVIDCDVLLVSGGWNPAVHLHS 375
Rhsp[actinobacteria] G------RISHAVVSHRTDTDHRFR---IPLACDVLLVSGGWNPAVHLFS 379
Asp[actinobacteria] GEGAADGRLDSVTVRSINDDGELTSG-IEEIACDLLAVSGGWSPLVHLHS 397
Rlitoralis[a-proteobacterie] L------RVKGLRVNPIK---EGRVSYARMLSCDAVLVCGGWTPSLHLFS 428
Rbacterium[a-proteobacteria] K------QVTGVMVADLD-----GKSTPDAIECDAVAMSGGWSPVVHLWS 422
Tsp[g-proteobacteria] G------LLQAVTVCALDAEGRAKPETAQRIDCDGLLMSVGYAPAAPILY 405
Rxylanophilus[actinobacteria] RRGM---RLAGAVICPLDIQNNPIPSRAFYIDCDGICMSVGWAANIALLA 414
Ppacifica[d-proteobacteria] --------------------------------------------------
CPelagibactersp[a-proteobacter QSGNKTQFKEEIDAFIPGESKQNEK-------TLG-AANGIYTLDETLKS 470
CPelagibacteru[a-proteobacteri QSGNKTTFNKDIDAFVPGLSKQNET-------TLG-AANGTFTLEETLKS 470
ma_sequence --------------------------------------------------
Bparapertussis[b-proteobacteri QSGGKVGYDADLSTFVPTSTKQHSL-------SIG-ACAGRLQLSECLAD 461
R_sp[a-proteobacterie] HQRGRPQWDADLAAFVPGTDLPVGM-------SVAGAAMGQLSTAQALSS 458
Ssp[a-proteobacterie] HQRGRPVWREDIHAFVPGSDLPAGQ-------SVVGAAMGEMSTHAALRT 458
Smeliloti[a-proteobacterie] HRGGKPVWSAEKAAFLAPGSL-KGL-------EVAGGAAATTGLAACLGE 458
Pmendocina HLGGRPIWREDILAFVPGEGFQKR--------HCAGAVNGVFGLGDALAD 474
Pentomophila HLGGRPVWREDILGFVPGDAPQKR--------VCVGGVNGVYALGDVIAD 474
Paeruginosa[g-proteobacteria] HLGGKPEWREEILAFVPGEGLQKR--------ICAGAVNGVFGLAKVLAD 473
Cpsychrerythraea[g-proteobacte HTGSRPVWNDDIAGFVPGDTVQKQ--------HSCGGLEGVYALSKVISD 474
Bdolosa[b-proteobacteria] QSGGKACWNDEKACFLPGKPVQAE--------ASVGAAAGEFGLARALRL 468
Bcenocepacia[b-proteobacteria] QSGGKACWNDEKACFLPGKPVQAE--------ASIGAAAGEFGLARALRL 468
Bpseudomallei[b-proteobacteria QSGGKAHWNDDKACFVPGKPVQAE--------ASVGAAAGEFELARALRL 468
Bthailandensis[b-proteobacteri QSGGKAHWNDDKACFVPGKPVQAE--------ASVGAAAGEFELSRALRL 468
Bphymatum[b-proteobacteria] QSGGKAHWNDTKACFVPGKGMQPE--------TSVGAAAGEFSLARGLRL 468
Serythraea[actinobacteria] QSRGTLRFAEQIGAFVPDRSARSV--------RSAGAAAGVFATADCLRT 417
Rhsp[actinobacteria] QARGKLRYDANLGAFVPGEDLDGV--------SVAGSANGVFDLDGCLRD 421
Asp[actinobacteria] QRQGKLRWDEDLAAFVPSTVVPNQ--------QTIGSGRGSFELADCLAE 439
Rlitoralis[a-proteobacterie] HTKGSLDWDADAKAYLPGNKTEDV--------HIAGAGRGLWGIAAALED 470
Rbacterium[a-proteobacteria] HCGGKLNWDDAEAMFKPDPARPPLGADGQGFVLTAGNASGAMGLAEALAD 472
Tsp[g-proteobacteria] QSGTRMVFAEIPGQFVPEQLPPGVF--------ACGRVNGVFDLDARVAD 447
Rxylanophilus[actinobacteria] QAGCELSYAENLGQLIPKISPEGLF--------AAGRVKGIYNIQDKLCD 456
Ppacifica[d-proteobacteria] --------------------------------------------------
CPelagibactersp[a-proteobacter SFEAGNELSKKITNNDN--KVSFPNVVEKKSTVHDKFWCVPLPKGKNY-- 516
CPelagibacteru[a-proteobacteri SFETGYELSKKITNNDN--KTSSPTVMEKKSTTHDKFWCVPLPKGKTY-- 516
ma_sequence --------------------------------------------------
Bparapertussis[b-proteobacteri GASICAQMDGEAQGRGH--PATPPKAEKLVIPP-------PHLGYRAG-- 500
R_sp[a-proteobacterie] GACGAATALEAIGITAS--AIDLPEAEDAPISLKP----FWHVSGGKS-- 500
Ssp[a-proteobacterie] GAETAREALSDLGFTAP--GVETPKAEDAPISLTP----FWHVADAK--- 499
Smeliloti[a-proteobacterie] GAARAEAIVRELGLPCPPVAVVKVESEEGIRSPAP----LWSIPGIKD-- 502
Pmendocina GFEAGAKAAAEVG--FKAVTGSLPKAEKRIEEASVALFQVPHDKGTSRA- 521
Pentomophila GFEGGVRAATEAG--FKASAGTLPKTLARKEEATVALFQVPHDKGTARA- 521
Paeruginosa[g-proteobacteria] GYQAGSRAALDAG--YKTTAGSLPKVQPRREEASVALFQVPHEKPTARA- 520
Cpsychrerythraea[g-proteobacte GFTTGAVAAEAAGKGDGRYAGNSPTTSDPQEDASMALFHIPHSKKTSRA- 523
Bdolosa[b-proteobacteria] ALDAGIEAAKAAGFTAAQRP-VAPQVAETVEDALQPLWLVGSREAAARG- 516
Bcenocepacia[b-proteobacteria] AVDAGVEAAKAAGFTAAQRP-AAPQVAEAVEGALQPLWLVGSREAAARG- 516
Bpseudomallei[b-proteobacteria ALDAGVAAAKSAGF-AAERP-PVPKLAEAVEDALLPLWLASGAEAAVRG- 515
Bthailandensis[b-proteobacteri AVDAGVAAAKSTGF-AAERP-PVPKLAEAVEDALLPLWLASGAEAAIRG- 515
Bphymatum[b-proteobacteria] AVDAGVEAVKSIGY-AVTRP-QVPQVAEVVESPLQPLWLVGSRAEAARG- 515
Serythraea[actinobacteria] GAEAGRDAAVAAG--FDAEAGPVPRAANPPVLAGRNVWLVPSPADSAG-- 463
Rhsp[actinobacteria] GQTAGQSIMRDLG--FTVPDHTIDPAPAPAIEQSTPLVLWRVKDVAGE-- 467
Asp[actinobacteria] GISAGASAAIAAG--FSAAVEPSVIGEPKASAPTRQLWLVPGQAGTPDDW 487
Rlitoralis[a-proteobacterie] GAKAGVEAVQALG---QTADTVTYQVTDDRTGTGITQKELPSDRSAGKA- 516
Rbacterium[a-proteobacteria] GHEAGRQAAKAAGG--TLTRKAAPKAPETERQPLKQVWIMPTSAGPDKR- 519
Tsp[g-proteobacteria] GAAAAGEALAHLGMQAGPTARP----GRSSERMSHPWPVFPHPKGKN--- 490
Rxylanophilus[actinobacteria] GRRAGILAAQYAGFSKKNTKIPEEPVDSSAVGRSHPYPIYDHPKGMA--- 503
Ppacifica[d-proteobacteria] --------------------------------------------------
CPelagibactersp[a-proteobacter -KRFLDFQNDVAVSDIEIALREGYRSIEHVKRYTTLGMATDQGKTSNLNG 565
CPelagibacteru[a-proteobacteri -KRFLDFQNDVAVSDVEIALKEGYRSIEHVKRYTTLGMATDQGKTSNLNG 565
ma_sequence --------------------------------------------------
Bparapertussis[b-proteobacteri -KRFIDIQDDVTVEDIELAARENFRSVEHLKRYTTLGMGTDQGKTSNVNG 549
R_sp[a-proteobacterie] -RAWVDLQNDVTVKDVKLAHQENFVSVEHLKRYTTLGMATDQGKTSNMLG 549
Ssp[a-proteobacterie] -RAWLDFQNDVTVKDVKLAHQENFTSVEHLKRYTTLGMATDQGKTSNVGA 548
Smeliloti[a-proteobacterie] -KAFVDFQNDVHLKDIGLAVREGYSHVELAKRYTTSGMATDQGKLSNVNA 551
Pmendocina PKQFVDQQNDVTAAGIELATREGFESVEHVKRYTALGFGTDQGKLGNING 571
Pentomophila PKQFVDQQNDVTAAAIELATREGFESVEHVKRYTALGFGTDQGKLGNING 571
Paeruginosa[g-proteobacteria] PKQFVDPQNDVTAAAIELACREGFESIEHVKRYTALGFGTDQGKLGNING 570
Cpsychrerythraea[g-proteobacte PKQFVDYQNDVTAAGIELANREGFESIEHVKRYTALGFGTDQGKLGNING 573
Bdolosa[b-proteobacteria] PKQFVDFQNDVSAADILLAAREGFDSVEHVKRYTAMGFGTDQGKLGNING 566
Bcenocepacia[b-proteobacteria] PKQFVDFQNDVAAADILLAAREGFESVEHVKRYTAMGFGTDQGKLGNING 566
Bpseudomallei[b-proteobacteria PKQFVDFQNDVGAADILLAAREGFESVEHVKRYTAMGFGTDQGKLGNING 565
Bthailandensis[b-proteobacteri PKQFVDFQNDVGAADILLAAREGFESVEHVKRYTAMGFGTDQGKLGNING 565
Bphymatum[b-proteobacteria] PKQFVDFQNDVSAADILLAAREGFESVEHVKRYTAMGFGTDQGKLGNING 565
Serythraea[actinobacteria] HTQYVDLARDATVADIRRAVGAGLHSVEHVKRYTTIGTAHDQGKTSGILS 513
Rhsp[actinobacteria] DTQFVDVQRDATVADLARAVGAGMTSMEHIKRYTTIGTAHDQGKTSGVIS 517
Asp[actinobacteria] HHHFVDFQRDQSVADVLRSTGAGMRSVEHIKRYTSISTANDQGKTSGVNA 537
Rlitoralis[a-proteobacterie] -KAFVDFQNDVTAKDIRLAVREGMKSIEHVKRYTTNGMATDQGKLSNMNG 565
Rbacterium[a-proteobacteria] MKMWLDYQNDVKVSDVQLAAREGYASVEHTKRYTTLGMATDQGKLSNING 569
Tsp[g-proteobacteria] ---FVDLDEDLQLKDLERAAAEGFDNIELLKRYSTVGMGPSQGKHANMNA 537
Rxylanophilus[actinobacteria] ---FVDFDEDVQLKDIKNSIQEGFDSVALVNRFATLGMGPSQGKHSNMNG 550
Ppacifica[d-proteobacteria] --------------------------------------------------
CPelagibactersp[a-proteobacter LQLVSKIEN------KVVPAVGHTTFRPPYTPVSIGAIVGREVGKHTKPT 609
CPelagibacteru[a-proteobacteri LQLVSNIEN------KIVPEVGHTTFRPPYTPVTIGAIVGREVGKHSKPT 609
ma_sequence --------------------------------------------------
Bparapertussis[b-proteobacteri LTIMGALRS------ESPGAVGTTTFRPPYTPIRLGLLSGRHIDRHFSAV 593
R_sp[a-proteobacterie] LAVMAELTG------KSIPETGTTIFRPPYTPVAMGTLAGRATGKHFHPT 593
Ssp[a-proteobacterie] LAVMAELTG------KPIPETGTTIFRPPYTPVSMGALAGRAVGKDFHPT 592
Smeliloti[a-proteobacterie] IGLIAKARG------VSPAEVGTTTFRPFYTPVSFGALTGAHTGHHFQPV 595
Pmendocina LAIAAKSLG------ISISEMGTTMFRPNYTPVTFGAIAGRHCGELFEPK 615
Pentomophila LAIAARSLG------IGIPEMGTTMFRPNYTPVTFGAVAGRHCGHLFEPV 615
Paeruginosa[g-proteobacteria] LAIAARAQG------KSIADTGTTMFRPNYTPVTFGAVAGRHCGHLFEPV 614
Cpsychrerythraea[g-proteobacte MAITAKSLG------KTIPETGTTIFRPMYTPTTFGALAGADVKHLFDPA 617
Bdolosa[b-proteobacteria] MAILAQALG------KSIPETGTTTFRPNYTPVSFGTFAGRELGDFLDPI 610
Bcenocepacia[b-proteobacteria] MAILAGALG------KTIPETGTTTFRPNYTPVSFGTFAGRETGDFLDPI 610
Bpseudomallei[b-proteobacteria MAILAQALG------KTIPETGTTTFRPNYTPVSFGAFAGRELGDFLDPI 609
Bthailandensis[b-proteobacteri MAILAQALG------KTIPETGTTTFRPNYTPVSFGAFAGRELGDFLDPI 609
Bphymatum[b-proteobacteria] MAILADALG------KTIPETGTTTFRPNYTPVTFGTFAGRELGDLLDPI 609
Serythraea[actinobacteria] TGIITEALG------RDIADVGTTTFRAPYAPVTFAALAGRDRGDLYDPV 557
Rhsp[actinobacteria] SGITAELLG------RPIETLGTTTFRPPYTPVAFAALAGRSRGALFDPE 561
Asp[actinobacteria] IGVIAAALRTAGEASRGIGDIGTTTYRAPFTPVAFAALAGRQRGELFDPA 587
Rlitoralis[a-proteobacterie] LTIASDALG------KEAPQVGLTTFRPPYTPTTFGAFAGYHKGKHFEVT 609
Rbacterium[a-proteobacteria] LAVLSDALG------QAIPQTGTTTFRPPYTPISMGAIAGEARGELFQPI 613
Tsp[g-proteobacteria] VRILARLNK------QSIGATGTTTARPFYHPVPIKHLAGRR----LRPE 577
Rxylanophilus[actinobacteria] VRIVARMMN------QSIDKAGSITSRPFYHPVPMGVLAGRS----FHPV 590
Ppacifica[d-proteobacteria] --------------------------------------------------
CPelagibactersp[a-proteobacter RKSPMHYWHEKNNAVFVDAGVWLRPRYYKQ-GNETLFEGSKREAKNVRTN 658
CPelagibacteru[a-proteobacteri RKSPMHTWHEKNNAVFVDAGVWLRPRYYKI-GEETLFEGSKREAKNVRTN 658
ma_sequence --------------------------------------------------
Bparapertussis[b-proteobacteri RVSPMHEWHVRNGAVMGPANLWLRPKAYLR-GNESYAQAWQRECRNVRQD 642
R_sp[a-proteobacterie] RKTPSHRWAEEQGAVFTEVGDWLRAQWFPKAGETHWRQSVDREVLQTRNS 643
Ssp[a-proteobacterie] RLTPSHKWAEEQGAVFVEVGNWLRAQWFPKAGETHWRQSVDREVLATRNS 642
Smeliloti[a-proteobacterie] RKSPLHDWAKKHGAVFVETGLWYRSSWFPRSGERTWRESVEREVLNVRKN 645
Pmendocina RYTALQKWHLENGAEFEDVGQWKRPWYFPKNGED-LHAAVARECLAVRNA 664
Pentomophila RFTALHAWHIKNGAEFEDVGQWKRPWYFPKPGED-IHTAVARECKAVRDS 664
Paeruginosa[g-proteobacteria] RFTALHAWHVKNGAEFEDVGQWKRPWYFPRRGED-MHAAVARECRAVREA 663
Cpsychrerythraea[g-proteobacte RFSAMHKWHLENGAEFEDVGQWKRPWYFPQPGET-MQQSLERECLATRNS 666
Bdolosa[b-proteobacteria] RKTCVHEWHVEHGAMFEDVGNWKRPWYFPKNGED-LHAAVKRECLAVRNS 659
Bcenocepacia[b-proteobacteria] RKTAVHEWHVEHGAMFEDVGNWKRPWYFPKNGED-LHAAVKRECLAVRNG 659
Bpseudomallei[b-proteobacteria RKTCVHEWHVEHGAMFEDVGNWKRPWYFPRNGED-LHAAVKRECLAVRNG 658
Bthailandensis[b-proteobacteri RKTCVHEWHVEHGAMFEDVGNWKRPWYFPRNGED-LHAAVKRECLAVRNG 658
Bphymatum[b-proteobacteria] RKTAVHEWHVENGAMFEDVGNWKRPWYFPLKGED-LHAAVKRECLAVRNS 658
Serythraea[actinobacteria] RVTAMHDWHVEQGAPFENVGQWKRPWYYPRPGED-METAVLRECQAVREG 606
Rhsp[actinobacteria] RVTALHDWHVGRGAVFEDVGQWKRPRYYPLPGED-MDAAVLRECAAVRRS 610
Asp[actinobacteria] RVTSIHPWHVAKGALFEDVGQWKRPWYYPQDGED-MDTAVLRECAAVRES 636
Rlitoralis[a-proteobacterie] RKTPIDSWAEENGAAFEPVALWRRAWYFPQDGED-MHKAVLRECKATRES 658
Rbacterium[a-proteobacteria] RRTPMHSAHDAAGAVWEPVGHWRRPFCFARTGET-DMEAVNREIVNTRDN 662
Tsp[g-proteobacteria] RRTPMHGWHRDHGAVFMPAGHWQRPKYYGPAS---EAEAIRAEVMAVREG 624
Rxylanophilus[actinobacteria] RRTPMHFRHEDFNAIFMRAGNWLRPEYYELSGKE-REDAIRAEVRSVRQH 639
Ppacifica[d-proteobacteria] --------------------------------------------------
CPelagibactersp[a-proteobacter VGVCDVTTLGKIDIKGPDAAELLNRVYTNAWLKLPVGKARYGVMLREDGI 708
CPelagibacteru[a-proteobacteri VGVCDVTTLGKIDIKGPDAAELLNRVYTNAWLKLPVGKARYGVMLREDGI 708
ma_sequence --------------------------------------------------
Bparapertussis[b-proteobacteri VGIVDVSTLGKIEVQGPDAGVFLDRVYANRISTLKVGKARYGVLLREDGI 692
R_sp[a-proteobacterie] VGVCDVTTLGKIDVQGKDAAAFLNKMYANAFAKLPVGKVRYGLMLREDGI 693
Ssp[a-proteobacterie] VGICDVTTLGKIDVQGTDAAEFLNKIYANGFAKLPVGKVRYGLMLREDGV 692
Smeliloti[a-proteobacterie] AGLCDVSMLGKIEITGSDAAEFLNRVYCNAFLKLPVGKARYGLMLREDGF 695
Pmendocina VGILDASTLGKIDIQGPDAREFLNRVYTNAWTKLDVGKARYGLMCKEDGM 714
Pentomophila VGLLDASTLGKIDIQGPDAREFLNRIYTNAWTKLDVGKARYGLMCKEDGM 714
Paeruginosa[g-proteobacteria] VGLLDASTLGKIDIQGPDAREFLNRVYTNAWTKLDVGKARYGLMCKEDGM 713
Cpsychrerythraea[g-proteobacte VGILDASTLGKIDIQGKDAREFLNRVYTNPWSKLGVGKCRYGVMCKEDGM 716
Bdolosa[b-proteobacteria] VGILDASTLGKIDIQGPDAVKLLNWMYTNPWNKLEVGKCRYGLMLDENGM 709
Bcenocepacia[b-proteobacteria] VGILDASTLGKIDIQGPDAVKLLNWMYTNPWNKLEVGKCRYGLMLDENGM 709
Bpseudomallei[b-proteobacteria VGMLDASTLGKIDIQGPDAVKLLNWVYTNPWNKLEVGKCRYGLMLDENGM 708
Bthailandensis[b-proteobacteri VGILDASTLGKIDIQGPDAVKLLNWVYTNPWNKLEVGKCRYGLMLDENGM 708
Bphymatum[b-proteobacteria] VGILDASTLGKIDIQGPDAAKLLNWMYTNPWSKLEVGKCRYGLMLDENGM 708
Serythraea[actinobacteria] VGIQDVSTLGKIDVQGPDAAEFLDLVYTNKMSTLKVGRIRYGLMCHADGM 656
Rhsp[actinobacteria] IGILDGSTLGKIDVQGPDAGVLLDMIYTNMMSTLKVGMVRYGVMCGVDGM 660
Asp[actinobacteria] VGFMDATTLGKIEIRGKDAGEFLNRIYTNAFKKLAPGSARYGVMCMADGM 686
Rlitoralis[a-proteobacterie] VGMFDASTLGKIEVSGPDAVEFMNRMYTNPWTKLGVGRCRYGLLLGEDGF 708
Rbacterium[a-proteobacteria] VGMLDASTLGKILVTGPDAGKFLDMLYTNVMSSLPVGKCRYGLMCTENGF 712
Tsp[g-proteobacteria] VGLIDVSTLGKVEVFGPDAARFMDQLYTLKLSTVKQGMTRYALMVDEAGV 674
Rxylanophilus[actinobacteria] VGLIDVGTLGKLEIHGPDALELIERICTGHFARLETGMTRYALMTDEAGI 689
Ppacifica[d-proteobacteria] --------------------------------------------------
CPelagibactersp[a-proteobacter VMDDGTTTRISENHYHMTTTTAQAANVLSHLEYYLQLVWPDLNVNVVSST 758
CPelagibacteru[a-proteobacteri VMDDGTTTRISENHYHMTTTTAQAANVLSHLEYYLQLVWPELNVNVVSTT 758
ma_sequence --------------------------------------------------
Bparapertussis[b-proteobacteri VFDDGTIARWGERLFILSTTTANAAAVMSHFEFLLATAWPTLRVRVTSVT 742
R_sp[a-proteobacterie] AYDDGTAARFAEDHFVVTTTTANAVLVYRNMEFARQCLFPDMDVQLISTT 743
Ssp[a-proteobacterie] AYDDGTAARLAEDHFVVTTTTANAVLVYRNMEFARQCLWPDLDVQLISTT 742
Smeliloti[a-proteobacterie] IYDDGTTSRLEENRFFMTTTTAYAAGVMNHLEFCAQVLWPQLDVRLASIT 745
Pmendocina VFDDGVTACLADNHFVMTTTTGGAGRVMEWLEIYHQTEWPELKVYFTSVT 764
Pentomophila VFDDGVTACVGDNHFIMTTTTGGAARVLQWLELYHQTEWPDMKVYFTSVT 764
Paeruginosa[g-proteobacteria] VFDDGVTACLADNHFVMTTTTGGAARVLEWLELYHQTEWPELKVYFTSVT 763
Cpsychrerythraea[g-proteobacte VFDDGVTVCLDDNRFIMTTTTGGAAGVLQWLELWHQTEWPELEVYFSTVT 766
Bdolosa[b-proteobacteria] VFDDGVTVRLAEQHFMMTTTTGGAARVLTWLERWLQTEWPDMKVRLASVT 759
Bcenocepacia[b-proteobacteria] VFDDGVTVRLADQHFMMTTTTGGAARVLTWLERWLQTEWPDMKVRLASVT 759
Bpseudomallei[b-proteobacteria VFDDGVTVRLGDQHFMMTTTTGGAARVLTWLERWLQTEWPDMKVRLSSVT 758
Bthailandensis[b-proteobacteri VFDDGVTVRLGEQHFMMTTTTGGAARVLTWLERWLQTEWPDMKVRLSSVT 758
Bphymatum[b-proteobacteria] VFDDGVTVRLADQHFMMTTTTGGAARVLTWMERWLQTEWPDMKVRLASVT 758
Serythraea[actinobacteria] VFDDGTVMRTGENRYLISTTSGGAAGVLQWLEDWLQTEWPHLRVHLTSVT 706
Rhsp[actinobacteria] VIDDGTVMRLDDDRFQVFTTTGGAAKILDWMEEWLQTEWPHLRVRLTSVT 710
Asp[actinobacteria] IFDDGVTLRLDEDRFFMTTTTGGAAKVLDWLEEWLQTEWPELDVHCTSVT 736
Rlitoralis[a-proteobacterie] IRDDGVIGRIRDDLFHVTTTTGGAASVLNMMEDYLQTEWPDLKVWLTSTT 758
Rbacterium[a-proteobacteria] VTDDGVVARIGEQTWLCHTTTGGADRIHGHMEDWLQCEWWDWKVYTANLT 762
Tsp[g-proteobacteria] VIDDGVCARWGEEHFYVSTTTTGAEAIFRQMQRMIGEWN--LKVDVVNRT 722
Rxylanophilus[actinobacteria] IIDDGVCAKLNDDHFYLTATTSGVDDLYREMSRWIQIWG--LNVEVTNYT 737
Ppacifica[d-proteobacteria] --------------------------------------------------
CPelagibactersp[a-proteobacter EQWAGAAIAGPKSRDLLQNLFP-NSDVSN----EGLPFMGYMEGDLFGV- 802
CPelagibacteru[a-proteobacteri EQWAGAAIAGPKSRDLLQKLFP-NIDASN----EGLPFMGYLEADLFGV- 802
ma_sequence --------------------------------------------------
Bparapertussis[b-proteobacteri DHYAQIALAGPKSREVLERLQI-SADVTD----SALPHMAVCETVWNGL- 786
R_sp[a-proteobacterie] EAWAQFAVAGPNARKLLQKVVDPEFDLSN----EGFPFMACGEVTVAGG- 788
Ssp[a-proteobacterie] EAWAQYAVAGPNSRKLLQKIVDPEFDISN----AAFPFMGCREITVCGG- 787
Smeliloti[a-proteobacterie] DQWAQMAIAGPKARMILQKIVD--EDISD----AAFPFLAAKEVSLFGGA 789
Pmendocina DHWATMTLSGPNSRKLLAEVT--DIDLDK----DAFPFMSWKEG-KVG-G 806
Pentomophila DHWATMTLSGPNSRKLLADVS--DIDLDK----EGFPFMSWKEG-LVG-G 806
Paeruginosa[g-proteobacteria] DHYATLTLSGPNSRKLLAEVT--DIDLDK----DAFPFMTWKEG-KVA-G 805
Cpsychrerythraea[g-proteobacte DHWSTMTISGPNSRKVLEKIC--DIDVSN----DSFKYMDWRAA-TVA-G 808
Bdolosa[b-proteobacteria] DHWATFAVVGPKSRKVVQKVCQ-DIDFGN----DAFPFMSYRNG-TVA-G 802
Bcenocepacia[b-proteobacteria] DHWATFAVVGPKSRKVVQKVCQ-DIDFGN----EAFPFMSYRNG-TVA-G 802
Bpseudomallei[b-proteobacteria DHWATFAVVGPKSRRVVQKVCK-DIDFAN----DAFPFMSYRDG-TVA-G 801
Bthailandensis[b-proteobacteri DHWATFAVVGPKSRKVVQKVCK-DIDFAN----DALPFMSYRDG-TVA-G 801
Bphymatum[b-proteobacteria] DHWATFAVVGPKSRKVVQKVCS-DIDFAN----EAFPFMSYRNG-TVA-G 801
Serythraea[actinobacteria] EQWATIALVGPRSREVLARVAS-EMDLDN----DDFPFMAWQDG-SVA-G 749
Rhsp[actinobacteria] EQWATFPVVGPRSRDVIGEVFP-DLDVTN----DAFGFMAWRDT-SLG-G 753
Asp[actinobacteria] EQWSTIAVVGPKSRAVLAKVAP-ELAAGGGLEAEAFPFMTFRET-TLASG 784
Rlitoralis[a-proteobacterie] EEWATIALNGPNARKLLQPFVE-GADISA----DAMPHMALVEC-TVA-G 801
Rbacterium[a-proteobacteria] EQYAQVAVAGPKARKVLEALG--GMDVSK----EAMPFMTWADG-TLA-G 804
Tsp[g-proteobacteria] SQLASMNIAGPLTRDVLQPLTDVDLSQAA------FPFLGARQGRVAG-- 764
Rxylanophilus[actinobacteria] ETFAAMNVAGPSARAVMKQLTELDLSENK------FPYLAIREGEVAG-- 779
Ppacifica[d-proteobacteria] --------------------------------------------------
CPelagibactersp[a-proteobacter -KARIFRISFSGELAYEVNVESDYGNFMWEKIMEIGEEFKIQPYGTEALS 851
CPelagibacteru[a-proteobacteri -HARIFRISFSGELAYEVNVESDNGNFMWEKIMEVGQEFKIQPYGTEALS 851
ma_sequence --------------------------------------------------
Bparapertussis[b-proteobacteri -KLLIYRVSFSGERAYELAIAAAYGQRLWDQLLAVGAPFSIMPYGTEAMG 835
R_sp[a-proteobacterie] CRARLFRISFSGELAYEIAVPTRYGDALVRRLMEAGEEFGVVPYGTEALG 838
Ssp[a-proteobacterie] LRARLFRISFSGELAYEIAVPTRYGDALMREMMTAGAEFDVTPYGTEALG 837
Smeliloti[a-proteobacterie] LHGCLFRISFSGELAYELAVPAGYGESIADALLEAGKDHGIMPYGVETLS 839
Pmendocina VPARVFRISFTGELSYEVNVQADYALGVWEQIIEAGKKHGLTPYGTETMH 856
Pentomophila VPARVFRISFTGELSYEINVQANYAMGVLEQIVEAGKKYNLTPYGTETMH 856
Paeruginosa[g-proteobacteria] VPARVFRISFTGELSYEVNVQADYAMGVLEALAEHGAKYGLTPYGTETMH 855
Cpsychrerythraea[g-proteobacte VKARIFRISFTGELSFEINVQANYGMHAWKAVMAAGEEFNITPYGTETMH 858
Bdolosa[b-proteobacteria] VKARVMRISFSGELAYEVNVPANAGRAVWEALMAAGAEFDITPYGTETMH 852
Bcenocepacia[b-proteobacteria] AKARVMRISFSGELAYEVNVPANAGRAVWEALMAAGAEFDITPYGTETMH 852
Bpseudomallei[b-proteobacteria VKSRVMRISFSGELAYEVNVPANAGRAVWEALMDAGAEFDITPYGTETMH 851
Bthailandensis[b-proteobacteri VKSRVMRISFSGELAYEVNVPANAGRAVWEALMEAGAEFDITPYGTETMH 851
Bphymatum[b-proteobacteria] VKARVMRISFSGELAYEVNVPANMGRAVWEALMAAGAEFDITPYGTETMH 851
Serythraea[actinobacteria] QRARVCRISFSGELAFEINVPWWHGREVWDALIDAGAPFGITPYGTETMH 799
Rhsp[actinobacteria] VHVRVARISFSGELAFEVNVDGWHAPAVWARLIAAGEKFDITPYGTETMH 803
Asp[actinobacteria] VQARICRISFSGELAYEINVPSWYGLNTWEAVAAAGAEFNITPYGTETMH 834
Rlitoralis[a-proteobacterie] FPARLFRVSFTGELGFEINVPARHGRALWEKLHEAGQKFDICTYGTETMH 851
Rbacterium[a-proteobacteria] IPARVYRISFTGELSYEIAVPANRGAELWAKVAEAGAAHGIQPYGTEAMH 854
Tsp[g-proteobacteria] VPAWLFRVGFVGELGFEIHVPAAQALHVWEALMEAGASRGIRPFGVEAQR 814
Rxylanophilus[actinobacteria] VPARIMRVGFVGELGYEVHVPATYGLFVWDRIIEAGREYGIKPFGVEAQR 829
Ppacifica[d-proteobacteria] --------------------------------------------------
CPelagibactersp[a-proteobacter TLRIEMG-HVAGSELDGRTIPYDNSLEGLLSK-KK-DFIGKRSLTREAFT 898
CPelagibacteru[a-proteobacteri TLRIEMG-HIAGSELDGRTIPYDNSLEGLVSK-KK-DFIGKRSLEREAFI 898
ma_sequence --------------------------------------------------
Bparapertussis[b-proteobacteri ALRIEKG-HPAGPELDGRTTAADLGLGGLVKK-EG-AFVGKALLGREGLQ 882
R_sp[a-proteobacterie] VMRIEKG-HAAGNELNGTTSALNLGMGRMVSK-KK-DCIGNTLSEREGMN 885
Ssp[a-proteobacterie] VMRIEKG-HAAGNELNGTTTALNLGLDRMVST-KK-DFIGNVLSRREGMN 884
Smeliloti[a-proteobacterie] VLRIEKG-HVTHNEINGTIVPADLGFGKMVSAGKP-DFVGKAMLQREGLT 887
Pmendocina VLRAEKGFIIVGQDTDGSVTPDDLGMGWCVGRTKPFSWIGWRGMNREDCL 906
Pentomophila VLRAEKGFIIVGQDTDGSMTPDDLNMSWCVGRNKPFSWIGLRGMNREDTV 906
Paeruginosa[g-proteobacteria] VLRAEKGFIIVGQDTDASVTPDDLNMGWAVGRSKPFSWIGWRGMNRADCL 905
Cpsychrerythraea[g-proteobacte ILRAEKGFIIVGQDTDGSVTPQDLDMDWVVGKKKDFSFIGKRSWTRFDNK 908
Bdolosa[b-proteobacteria] VLRAEKGYIIVGQDTDGSVTPYDLGMGGLVAKSK--DFLGKRSLSRSDTA 900
Bcenocepacia[b-proteobacteria] VLRAEKGYIIVGQDTDGSITPFDLGMGGVVAKSK--DFLGKRSLSRSDTA 900
Bpseudomallei[b-proteobacteria VLRAEKGYIIVGQDTDGSITPFDLGMGGLVAKSK--DFLGRRSLTRADTA 899
Bthailandensis[b-proteobacteri VLRAEKGYIIVGQDTDGSITPFDLGMGGLVAKSK--DFLGRRSLTRADTA 899
Bphymatum[b-proteobacteria] VLRAEKGYIIVGQDTDGSVTPHDLGMGGLVAKTK--DFLGRRSLARSDTT 899
Serythraea[actinobacteria] VLRAEKGFPIVGQDTDGTVTPHDLGMSWAVSKKKD-DFLGMRSFSRADTS 848
Rhsp[actinobacteria] VLRAEKGYPIIGQDTDGTVTPQDLGMSWAVSKKKR-DFIGKRSFTRAENQ 852
Asp[actinobacteria] VLRAEKGYPIVGQDTDGTVTPQDAGMEWVVSKAK--EFIGKRSYARADAK 882
Rlitoralis[a-proteobacterie] VLRAEKGFIIVGQDTDGTVTPQDAGIGWAIGKMKP-DFVGKRSLDRPDIA 900
Rbacterium[a-proteobacteria] IMRAEKGFVMIGDETDGTVIPQDLNMGWIISKKKT-DYLGKRAQERSHMA 903
Tsp[g-proteobacteria] QLRLEKGHLIVGQDTDGTSSPFDANMAWAVKFDKP-FFQGKRSLQILKER 863
Rxylanophilus[actinobacteria] RLRLEKGHIIVGQDTDGLTNPWEANLGWAVKLDKP-FFIGQRTLKILRKK 878
Ppacifica[d-proteobacteria] --------------------------------------------------
CPelagibactersp[a-proteobacter AED-----RQKVVGVVPLDKKTSIPEGSHLVKDS----KAPLPNPKLGYI 939
CPelagibacteru[a-proteobacteri AED-----RQKVVGVVPIDKKTSIPEGSHLVKDA----MAPTPNPKLGYI 939
ma_sequence --------------------------------------------------
Bparapertussis[b-proteobacteri AAD-----RPTLVGLR-SKSGAAIQSGSMLVLR------AEVGAQELGWV 920
R_sp[a-proteobacterie] EED-----ALKLVGFRPVKSDETISAGAHLMNAS----GAVNAKVDQGYV 926
Ssp[a-proteobacterie] AKD-----ALNLVGVRPVDPSHSLPAGGHLMRRS----GPVDATQDQGYV 925
Smeliloti[a-proteobacterie] APD-----RPQLVGVVPLDPQQSFRSGSHILAKG----AAATLENDEGYV 928
Pmendocina KEN-----RKQLIGLKPLDPNKVLPEGAQLVFDP-KQP---IPMTMVGHV 947
Pentomophila REN-----RKQLVGLKPVDPNVWLPEGAQLVFDP-KQP---IPMDMVGHV 947
Paeruginosa[g-proteobacteria] RED-----RKQLVGLRPSNPQEVLPEGAQLVFDT-QQA---IPMKMVGHV 946
Cpsychrerythraea[g-proteobacte RDD-----RKQMVGLKPKDPTFVLPEGAQIVFEK-NQS---IPMKMVGHV 949
Bdolosa[b-proteobacteria] KEG-----RKQFVGLLTDDEQFVLPEGAQIVAKD-TQVSTVDPTPMIGHV 944
Bcenocepacia[b-proteobacteria] KEG-----RKQFVGLLTEDEQFVLPEGAQIIAKD-TQVSATDPTPMIGHV 944
Bpseudomallei[b-proteobacteria KSG-----RKQFVGLLTDDAQSVLPEGGQIVELD-AAARADGTTPMLGHV 943
Bthailandensis[b-proteobacteri KSG-----RKQFVGLLTDDAQYVLPEGGQIVELD-AAARADGTTPMLGHV 943
Bphymatum[b-proteobacteria] KDN-----RKQFVGLLSDDPQFVIPEGSQIVARP-FQG---DTAPMLGHV 940
Serythraea[actinobacteria] RTD-----RKHLVGLLPADEDLVLEEGAQLVEHS---ELPQPPVPMLGHV 890
Rhsp[actinobacteria] NPL-----RKEFVGLLPLDKQTVLPEGAQIIEEISDGVLPPPPVPMLGHV 897
Asp[actinobacteria] RED-----RKHLVSVLPVDGTLRLPEGTQLVEKGIPTNPAYGPVPMQGFV 927
Rlitoralis[a-proteobacterie] APG-----RKQLVGLLTDDSKTVLVEGAQIVANP-KQP---KPMKMIGHV 941
Rbacterium[a-proteobacteria] SPD-----RWRLVGLETLDG-SVIPDGAYAVGEGFNAN---GQRNMIGRV 944
Tsp[g-proteobacteria] AAN------RLVGFRLPGSHPGPIPRECHLVIHD---------DDIAGRV 898
Rxylanophilus[actinobacteria] MDANLAQSRVLVGFKLVSNER-PWPKESHLIIEE---------DRIIGRV 918
Ppacifica[d-proteobacteria] --------------------------------------------------
CPelagibactersp[a-proteobacter SASCWSVEYDNPFSLAILKNGKNMIGEKLYVMSPLKN-KIIPVEIVSSHY 988
CPelagibacteru[a-proteobacteri SASCWSVEYDNPFSLAILKDGKNMIGKKLFAMSPLKN-KTIPVEIVSSHY 988
ma_sequence --------------------------------------------------
Bparapertussis[b-proteobacteri ASATYSPTLGQHIALGFLVNGANALGRSVLAWSALTS-SQVEVEVVNPCF 969
R_sp[a-proteobacterie] TSAAYSPVLESSIGIGFLKNGDARKGEIIRAVNPLAG-QEIQVEVVSAHF 975
Ssp[a-proteobacterie] TSAAYSPTLKSAIGLGFVKSGFERMGEQLRLVNPLEG-QEILVEIVSPHF 974
Smeliloti[a-proteobacterie] TSSAYSPHVGSTIALALVRNGRNRHGEEVLVWSGLHG-ESTPARLCNPVF 977
Pmendocina TSSYMSAAMGYSFAMALVRGGLSRIGERVFAPLADGS--VIEAEIVSPVF 995
Pentomophila TSSYAANSLGYSFAMGVVKGGLKRLGERVYSPQADGS--VIEAEIVSSVF 995
Paeruginosa[g-proteobacteria] TSSYMSASLGHGFALAVVKGGLKRMGQKVYAPLADGR--FIEAEICSSVF 994
Cpsychrerythraea[g-proteobacte TSSYYSACMGYSFALAVVKGGISRKGESVYLPLSDGT--TVEAEICSPVF 997
Bdolosa[b-proteobacteria] TSSYYSPILQRSIALAVVKGGLNKMGESVVIPLADGK--RITAKISSPVF 992
Bcenocepacia[b-proteobacteria] TSSYYSPILKRSIALAVVKGGLNKMGESVVIPLANGR--RITAKISSPVF 992
Bpseudomallei[b-proteobacteria TSSYYSPILNRSIALAVVKGGLSRMGERVAVSLANGR--RVAATISSPVF 991
Bthailandensis[b-proteobacteri TSSYYSPILNRSIALAVVKGGLSRMGERVAVSLANGR--RVAATISSPVF 991
Bphymatum[b-proteobacteria] TSSYYSPILNRSIALAVVKGGLNKMGQSVTIPLSSGK--QIAAKIASPVF 988
Serythraea[actinobacteria] TSSYRSAVLRRGFALALVKGGRDRIGETIYSTAGDG---LAAVTITEPVF 937
Rhsp[actinobacteria] TSSYLSAELGRPFGLALVKGGRARLGDTLHVPVDGN---LVAVEVTSSVL 944
Asp[actinobacteria] TSSYHSAALGRSFGLALIKNGRNRIGETLVAAAGDQ---LVDVVVAETVL 974
Rlitoralis[a-proteobacterie] TSSYWSETLGRSIAMAVVEGGFDRMDETLHIPTEEGG--TVPAKVTGTVF 989
Rbacterium[a-proteobacteria] TSTYYSPTIRKGIAMGLIQHGPDRMGEVVDFATLDGTGTVIKAKIVETCF 994
Tsp[g-proteobacteria] TSIGYSPSLKAWVGLAMVDKT-LADAAQLSIRVEGAV--IIQADVVPTPF 945
Rxylanophilus[actinobacteria] TSTAYSESLDQVIGLAFLPTERSARGTRFQIRVEGGS--MVEAEVVPTPF 966
Ppacifica[d-proteobacteria] --------------------------------------------------
CPelagibactersp[a-proteobacter VDPKGERVRS-------- 998
CPelagibacteru[a-proteobacteri VDPKGERVRS-------- 998
ma_sequence ------------------
Bparapertussis[b-proteobacteri VDIERERLLG-------- 979
R_sp[a-proteobacterie] VDPEGERLRA-------- 985
Ssp[a-proteobacterie] VDPEGEKLRA-------- 984
Smeliloti[a-proteobacterie] FDPQNERLHV-------- 987
Pmendocina YDPKGDRQNV-------- 1005
Pentomophila FDPKGERQNV-------- 1005
Paeruginosa[g-proteobacteria] YDPKGERQNVD------- 1005
Cpsychrerythraea[g-proteobacte YDPKGDRQNV-------- 1007
Bdolosa[b-proteobacteria] YDTEGVRQHVE------- 1003
Bcenocepacia[b-proteobacteria] YDTEGVRQHVE------- 1003
Bpseudomallei[b-proteobacteria YDTEGVRQHVE------- 1002
Bthailandensis[b-proteobacteri YDTEGVRQHVE------- 1002
Bphymatum[b-proteobacteria] YDTEGVRQHVE------- 999
Serythraea[actinobacteria] YDKEGARRDG-------- 947
Rhsp[actinobacteria] VDPEGARRDG-------- 954
Asp[actinobacteria] FDPEGTRKDG-------- 984
Rlitoralis[a-proteobacterie] YDPAGDRLKVE------- 1000
Rbacterium[a-proteobacteria] YDKEGAKADV-------- 1004
Tsp[g-proteobacteria] YDPEGLRQKPETAGEVNA 963
Rxylanophilus[actinobacteria] YDPDNMRQRVS------- 977
Ppacifica[d-proteobacteria] ------------------
BLAST
PROTOCOLE:
a)BLASTp contre NR "max target sequences:500"
b)BLASTp contre SwissProt (SP)
c)BLASTx contre NR
---------------------------------------------------------------------------------------------------
ANALYSE DES RÉSULTATS:
a)Ces résultats nous ons permis de déterminer le codon start de notre ORF. En effet, les homologues
commençaient tous à la même position(8) , alignés avec la méthionine. On a donc enlevé 7 codons au
début de l'ORF pour passer de 177 à 198 AA.
On trouve beaucoup d'homologues avec de très bonnes E-value.
On remarque que dans les meilleurs E-value , il y a une fonction qui prédominent :"sarcosine
oxidase".
b) Les résultats obtenues ne permettent pas de déterminer de bons homologues car les e-values ne sont
pas pertinents, le plus petit est de 1e67 puis après sa passe à 0.013
c)On observe un saut du cadre de lecture :"Frame Shift" passant du cadre +3 au cadre +2.
Il pourrait s'agir d'une erreur de séquençage.
---------------------------------------------------------------------------------------------------
RÉSULTATS BRUTS:
Score E
a)
Sequences producing significant alignments: (Bits) Value
gb|EDZ60822.1| sarcosine oxidase alpha subunit [Candidatus Pe... 429 1e-118
ref|YP_266690.1| sarcosine oxidase alpha chain [Candidatus Pe... 422 1e-116
ref|ZP_01264926.1| sarcosine oxidase alpha chain [Candidatus ... 422 1e-116
gb|ABZ06303.1| putative glycine cleavage T-protein (aminometh... 370 7e-101
gb|ABZ05929.1| putative glycine cleavage T-protein (aminometh... 340 6e-92
gb|ABZ06659.1| putative glycine cleavage T-protein (aminometh... 337 7e-91
ref|ZP_01754673.1| sarcosine oxidase, alpha subunit family pr... 266 8e-70
gb|EDZ42222.1| sarcosine oxidase, alpha subunit family [Rhodo... 265 2e-69
gb|EDZ45195.1| sarcosine oxidase, alpha subunit family [Rhodo... 263 9e-69
ref|YP_166984.1| sarcosine oxidase alpha subunit family prote... 263 1e-68
ref|YP_611611.1| sarcosine oxidase alpha subunit family prote... 263 1e-68
ref|YP_614139.1| sarcosine oxidase alpha subunit family prote... 261 3e-68
ref|ZP_02147355.1| sarcosine oxidase, alpha subunit family pr... 260 6e-68
ref|YP_266475.1| sarcosine oxidase alpha chain [Candidatus Pe... 259 9e-68
ref|ZP_02149843.1| sarcosine oxidase, alpha subunit family pr... 259 1e-67
ref|ZP_01754466.1| sarcosine oxidase, alpha subunit family pr... 258 2e-67
ref|ZP_02297488.1| Uncharacterized NAD(FAD)-dependent dehydro... 257 5e-67
ref|ZP_01056477.1| sarcosine oxidase, alpha subunit family pr... 257 6e-67
gb|EDZ61064.1| sarcosine oxidase, alpha subunit [Candidatus P... 256 1e-66
ref|ZP_02141516.1| sarcosine oxidase, alpha subunit [Roseobac... 256 1e-66
ref|YP_001328944.1| sarcosine oxidase alpha subunit family pr... 255 2e-66
ref|NP_384189.1| putative sarcosine oxidase alpha subunit tra... 255 2e-66
ref|ZP_01054876.1| sarcosine oxidase, alpha subunit family pr... 254 6e-66
ref|YP_682013.1| sarcosine oxidase, alpha subunit [Roseobacte... 253 8e-66
gb|EDY87835.1| sarcosine oxidase, alpha subunit [Octadecabact... 250 6e-65
ref|YP_001533452.1| sarcosine oxidase alpha subunit family pr... 249 1e-64
ref|YP_002362923.1| sarcosine oxidase, alpha subunit family [... 249 1e-64
ref|ZP_02154807.1| sarcosine oxidase, alpha subunit family pr... 249 2e-64
ref|ZP_01439029.1| sarcosine oxidase, alpha subunit family pr... 247 6e-64
gb|EEB84114.1| sarcosine oxidase, alpha subunit family [Roseo... 247 7e-64
ref|ZP_01546296.1| sarcosine oxidase, alpha subunit [Stappia ... 245 2e-63
ref|NP_106776.1| sarcosine oxidase alpha subunit [Mesorhizobi... 245 2e-63
gb|ABZ05963.1| hypothetical protein ALOHA_HF4000001L24ctg1g32... 245 2e-63
gb|EEB80013.1| sarcosine oxidase, alpha subunit family [marin... 244 4e-63
ref|ZP_01002095.1| sarcosine oxidase, alpha subunit [Loktanel... 244 4e-63
ref|ZP_02168605.1| sarcosine oxidase alpha subunit [Hoeflea p... 244 4e-63
emb|CAD31286.1| PUTATIVE SARCOSINE OXIDASE ALPHA SUBUNIT PROT... 243 1e-62
gb|EDZ45988.1| sarcosine oxidase, alpha subunit family [Rhodo... 243 1e-62
ref|YP_001261448.1| glycine cleavage T protein (aminomethyl t... 242 2e-62
ref|NP_881143.1| sarcosine oxidase alpha subunit [Bordetella ... 242 2e-62
ref|NP_885663.1| sarcosine oxidase alpha subunit [Bordetella ... 241 4e-62
ref|YP_553716.1| sarcosine oxidase, alpha subunit, heterotetr... 241 4e-62
ref|YP_001062947.1| sarcosine oxidase, alpha subunit, heterot... 239 9e-62
ref|YP_001075894.1| sarcosine oxidase, alpha subunit [Burkhol... 239 9e-62
ref|YP_111378.1| sarcosine oxidase alpha subunit [Burkholderi... 239 9e-62
ref|ZP_02485951.1| sarcosine oxidase, alpha subunit [Burkhold... 239 9e-62
emb|CAD31640.1| PROBABLE SARCOSINE OXIDASE ALPHA SUBUNIT TRAN... 239 1e-61
ref|ZP_02407205.1| sarcosine oxidase, alpha subunit [Burkhold... 239 1e-61
ref|YP_771596.1| putative sarcosine oxidase alpha subunit [Rh... 239 1e-61
ref|ZP_03456801.1| sarcosine oxidase, alpha subunit [Burkhold... 239 1e-61
ref|ZP_02459962.1| putative sarcosine oxidase alpha subunit [... 239 2e-61
ref|YP_002277908.1| sarcosine oxidase, alpha subunit family [... 238 2e-61
ref|NP_356432.1| sarcosine oxidase alpha subunit [Agrobacteri... 238 4e-61
ref|ZP_00960211.1| sarcosine oxidase, alpha subunit family pr... 237 5e-61
ref|YP_743995.1| sarcosine oxidase alpha subunit [Granulibact... 237 6e-61
ref|YP_001985949.1| sarcosine oxidase protein, alpha subunit ... 237 7e-61
ref|YP_472588.1| sarcosine oxidase alpha subunit protein [Rhi... 236 9e-61
ref|ZP_00961139.1| sarcosine oxidase, alpha subunit family pr... 236 1e-60
ref|ZP_02146052.1| sarcosine oxidase, alpha subunit family pr... 236 1e-60
ref|ZP_02150359.1| sarcosine oxidase, alpha subunit family pr... 236 1e-60
ref|ZP_02366005.1| sarcosine oxidase, alpha subunit [Burkhold... 236 1e-60
gb|EDY77120.1| hypothetical protein OA307_2230 [Octadecabacte... 236 2e-60
ref|ZP_02358969.1| sarcosine oxidase, alpha subunit [Burkhold... 235 2e-60
ref|ZP_01036314.1| sarcosine oxidase, alpha subunit family pr... 235 2e-60
ref|ZP_02142150.1| sarcosine oxidase, alpha subunit family pr... 235 2e-60
ref|ZP_01033968.1| sarcosine oxidase, alpha subunit family pr... 234 3e-60
ref|NP_104289.1| sarcosine oxidase alpha subunit [Mesorhizobi... 234 3e-60
ref|ZP_00955469.1| sarcosine oxidase, alpha subunit family pr... 234 3e-60
gb|EEB84343.1| sarcosine oxidase, alpha subunit family [Roseo... 234 4e-60
ref|YP_001592099.1| sarcosine oxidase alpha subunit family pr... 234 4e-60
ref|NP_697265.1| sarcosine oxidase, alpha subunit [Brucella s... 234 4e-60
ref|ZP_00962904.1| sarcosine oxidase, alpha subunit family pr... 234 5e-60
ref|ZP_02141429.1| sarcosine oxidase, alpha subunit [Roseobac... 234 5e-60
ref|ZP_01879093.1| sarcosine oxidase, alpha subunit family pr... 234 5e-60
ref|ZP_02054752.1| sarcosine oxidase, alpha subunit family [M... 234 6e-60
ref|ZP_01443151.1| sarcosine oxidase alpha subunit [Roseovari... 233 7e-60
ref|YP_001109958.1| sarcosine oxidase alpha subunit family pr... 233 1e-59
ref|YP_001924278.1| sarcosine oxidase, alpha subunit family [... 233 1e-59
ref|NP_107653.1| sarcosine oxidase alpha subunit [Mesorhizobi... 233 1e-59
ref|YP_682094.1| sarcosine oxidase, alpha subunit [Roseobacte... 233 1e-59
ref|YP_439195.1| sarcosine oxidase, alpha subunit [Burkholder... 232 1e-59
ref|YP_002100542.1| hypothetical protein BDAG_03838 [Burkhold... 232 1e-59
gb|EDZ41083.1| sarcosine oxidase, alpha subunit family [Rhodo... 232 2e-59
ref|ZP_02188467.1| sarcosine oxidase, alpha subunit family pr... 232 2e-59
ref|ZP_02466697.1| sarcosine oxidase, alpha subunit [Burkhold... 232 2e-59
ref|YP_001258259.1| sarcosine oxidase alpha subunit [Brucella... 232 2e-59
ref|YP_001583482.1| sarcosine oxidase alpha subunit family pr... 232 2e-59
ref|YP_001533601.1| sarcosine oxidase alpha subunit family pr... 232 2e-59
ref|NP_519224.1| sarcosine oxidase subunit alpha [Ralstonia s... 231 3e-59
ref|ZP_01441755.1| sarcosine oxidase, alpha subunit family pr... 231 3e-59
gb|EEB72626.1| Glycine cleavage T-protein (aminomethyl transf... 231 4e-59
ref|NP_540637.1| sarcosine oxidase alpha subunit [Brucella me... 230 6e-59
ref|YP_001639122.1| sarcosine oxidase alpha subunit family pr... 230 6e-59
ref|YP_001238046.1| sarcosine oxidase, alpha subunit [Bradyrh... 230 7e-59
ref|YP_352745.1| putative sarcosine oxidase, alpha subunit [R... 230 7e-59
ref|YP_166827.1| sarcosine oxidase alpha subunit family prote... 230 8e-59
ref|YP_001368863.1| sarcosine oxidase alpha subunit family pr... 229 1e-58
gb|EEB71634.1| sarcosine oxidase, alpha subunit family [Ruege... 229 1e-58
ref|ZP_02059400.1| sarcosine oxidase, alpha subunit family [M... 229 1e-58
ref|NP_356575.1| sarcosine oxidase alpha subunit [Agrobacteri... 229 1e-58
ref|NP_521609.1| sarcosine oxidase subunit alpha [Ralstonia s... 229 2e-58
ref|ZP_01000874.1| sarcosine oxidase, alpha subunit family pr... 229 2e-58
ref|YP_001043229.1| sarcosine oxidase alpha subunit family pr... 228 2e-58
ref|YP_001641183.1| sarcosine oxidase alpha subunit family pr... 228 2e-58
ref|ZP_01879735.1| sarcosine oxidase, alpha subunit family pr... 228 3e-58
ref|YP_001115910.1| sarcosine oxidase alpha subunit family pr... 228 3e-58
ref|ZP_01223941.1| sarcosine oxidase, alpha subunit [marine g... 228 3e-58
ref|ZP_00631887.1| Sarcosine oxidase, alpha subunit, heterote... 228 4e-58
ref|YP_001168070.1| sarcosine oxidase alpha subunit family pr... 228 4e-58
ref|ZP_02376522.1| sarcosine oxidase, alpha subunit family pr... 228 4e-58
ref|YP_371246.1| sarcosine oxidase, alpha subunit, heterotetr... 227 6e-58
ref|ZP_01449178.1| sarcosine oxidase, alpha subunit family pr... 227 7e-58
gb|EEA96709.1| sarcosine oxidase, alpha subunit family [Pseud... 226 8e-58
ref|ZP_02118479.1| sarcosine oxidase alpha subunit [Methyloba... 226 9e-58
ref|YP_001926650.1| sarcosine oxidase, alpha subunit family [... 226 9e-58
ref|ZP_01447755.1| sarcosine oxidase, alpha subunit family pr... 226 9e-58
ref|YP_743901.1| sarcosine oxidase alpha subunit [Granulibact... 226 1e-57
ref|YP_623086.1| sarcosine oxidase alpha subunit family prote... 226 1e-57
ref|YP_001524410.1| sarcosine oxidase alpha subunit [Azorhizo... 226 1e-57
ref|YP_001778750.1| sarcosine oxidase alpha subunit family pr... 226 1e-57
ref|YP_001811729.1| sarcosine oxidase alpha subunit family pr... 225 2e-57
ref|YP_299020.1| sarcosine oxidase, alpha subunit, heterotetr... 225 2e-57
ref|YP_776428.1| sarcosine oxidase alpha subunit family prote... 225 2e-57
ref|ZP_02906041.1| sarcosine oxidase, alpha subunit family [B... 225 2e-57
ref|ZP_02888607.1| sarcosine oxidase, alpha subunit family [B... 225 2e-57
ref|YP_001859533.1| sarcosine oxidase alpha subunit family pr... 225 2e-57
ref|ZP_01444560.1| sarcosine oxidase, alpha subunit family pr... 224 3e-57
ref|YP_002095945.1| hypothetical protein BCPG_04816 [Burkhold... 224 5e-57
ref|YP_002234989.1| putative sarcosine oxidase alpha subunit ... 224 5e-57
ref|NP_356342.1| sarcosine oxidase alpha subunit [Agrobacteri... 224 6e-57
ref|YP_680436.1| sarcosine oxidase, alpha subunit [Roseobacte... 224 6e-57
ref|YP_001755163.1| sarcosine oxidase alpha subunit family pr... 223 8e-57
ref|ZP_02886813.1| sarcosine oxidase, alpha subunit family [B... 223 1e-56
ref|ZP_01157212.1| sarcosine oxidase, alpha subunit family pr... 223 1e-56
ref|YP_001415775.1| sarcosine oxidase alpha subunit family pr... 222 1e-56
ref|YP_001207761.1| sarcosine oxidase, alpha subunit [Bradyrh... 222 2e-56
ref|YP_001419578.1| sarcosine oxidase alpha subunit family pr... 222 2e-56
ref|YP_471083.1| sarcosine oxidase alpha subunit protein [Rhi... 221 4e-56
ref|ZP_02292175.1| sarcosine oxidase, alpha subunit family [R... 221 5e-56
ref|ZP_01751766.1| sarcosine oxidase, alpha subunit family pr... 221 5e-56
ref|YP_001888784.1| sarcosine oxidase, alpha subunit family [... 220 6e-56
ref|ZP_02370511.1| sarcosine oxidase, alpha subunit [Burkhold... 220 7e-56
ref|ZP_00997485.1| sarcosine oxidase, alpha subunit family pr... 220 9e-56
ref|YP_769698.1| putative sarcosine oxidase alpha subunit [Rh... 219 1e-55
ref|YP_001234130.1| sarcosine oxidase alpha subunit family pr... 219 1e-55
ref|ZP_01155888.1| hypothetical protein OG2516_13134 [Oceanic... 219 2e-55
ref|YP_511389.1| sarcosine oxidase alpha subunit family prote... 218 2e-55
ref|ZP_01078126.1| sarcosine oxidase, alpha subunit [Marinomo... 218 3e-55
ref|ZP_01226689.1| sarcosine oxidase, alpha subunit [Aurantim... 218 3e-55
ref|YP_002282849.1| sarcosine oxidase, alpha subunit family [... 218 5e-55
ref|ZP_02154237.1| sarcosine oxidase, alpha subunit family pr... 217 6e-55
ref|YP_167568.1| sarcosine oxidase alpha subunit family prote... 217 8e-55
ref|ZP_01740505.1| sarcosine oxidase, alpha subunit family pr... 216 8e-55
ref|YP_484764.1| sarcosine oxidase alpha subunit family prote... 216 1e-54
ref|ZP_02165983.1| putative sarcosine oxidase alpha subunit t... 216 1e-54
gb|EDY75574.1| sarcosine oxidase, alpha subunit family [Octad... 215 2e-54
gb|EDY90090.1| sarcosine oxidase, alpha subunit [Octadecabact... 215 3e-54
ref|ZP_01901256.1| sarcosine oxidase, alpha subunit family pr... 215 3e-54
ref|ZP_01748953.1| sarcosine oxidase, alpha subunit family pr... 215 3e-54
ref|ZP_01002738.1| sarcosine oxidase, alpha subunit family [L... 214 3e-54
ref|YP_001524086.1| sarcosine oxidase alpha subunit [Azorhizo... 214 3e-54
ref|YP_673158.1| sarcosine oxidase alpha subunit family prote... 214 4e-54
ref|ZP_02123205.1| sarcosine oxidase, alpha subunit family [M... 214 4e-54
ref|ZP_01440252.1| sarcosine oxidase, alpha subunit [Fulvimar... 214 4e-54
ref|ZP_01741311.1| sarcosine oxidase, alpha subunit family pr... 214 4e-54
ref|ZP_01155726.1| putative sarcosine oxidase, alpha subunit ... 213 1e-53
ref|ZP_01879218.1| sarcosine oxidase, alpha subunit family pr... 213 1e-53
ref|YP_610570.1| sarcosine oxidase (alpha subunit) oxidoreduc... 213 1e-53
ref|YP_262784.1| sarcosine oxidase, alpha subunit [Pseudomona... 213 1e-53
ref|YP_510128.1| sarcosine oxidase alpha subunit family prote... 212 2e-53
ref|YP_001683989.1| sarcosine oxidase alpha subunit family pr... 212 2e-53
ref|ZP_01004712.1| sarcosine oxidase, alpha subunit family [L... 211 3e-53
ref|NP_386959.1| putative sarcosine oxidase alpha subunit tra... 211 3e-53
ref|YP_001979985.1| sarcosine oxidase protein, alpha subunit ... 211 4e-53
ref|YP_001751727.1| sarcosine oxidase alpha subunit family pr... 211 5e-53
ref|NP_790307.1| sarcosine oxidase, alpha subunit [Pseudomona... 210 7e-53
ref|ZP_03395917.1| sarcosine oxidase, alpha subunit [Pseudomo... 210 7e-53
gb|EEB72508.1| sarcosine oxidase, alpha subunit family [Ruege... 210 9e-53
ref|YP_237780.1| sarcosine oxidase, alpha subunit, heterotetr... 210 9e-53
ref|YP_001666597.1| sarcosine oxidase alpha subunit family pr... 209 1e-52
ref|ZP_01012571.1| sarcosine oxidase, alpha subunit family pr... 209 2e-52
ref|YP_001265704.1| sarcosine oxidase alpha subunit family pr... 209 2e-52
ref|YP_276853.1| sarcosine oxidase, alpha subunit [Pseudomona... 209 2e-52
ref|ZP_03268938.1| sarcosine oxidase, alpha subunit, heterote... 209 2e-52
ref|NP_742492.1| sarcosine oxidase, alpha subunit family [Pse... 209 2e-52
ref|YP_350933.1| sarcosine oxidase, alpha subunit, heterotetr... 208 3e-52
ref|ZP_01616384.1| sarcosine oxidase, alpha subunit [marine g... 207 4e-52
ref|ZP_01745277.1| sarcosine oxidase, alpha subunit family pr... 207 4e-52
ref|ZP_02150577.1| sarcosine oxidase, alpha subunit family pr... 207 5e-52
ref|ZP_02147374.1| sarcosine oxidase, alpha subunit family pr... 207 5e-52
ref|YP_001328415.1| sarcosine oxidase alpha subunit family pr... 207 6e-52
ref|ZP_00630198.1| Sarcosine oxidase, alpha subunit, heterote... 207 8e-52
gb|EDY77809.1| sarcosine oxidase, alpha subunit family [Octad... 206 2e-51
ref|ZP_01038293.1| sarcosine oxidase, alpha subunit family pr... 206 2e-51
gb|ABZ06778.1| putative glycine cleavage T-protein (aminometh... 206 2e-51
ref|YP_001341606.1| sarcosine oxidase alpha subunit family pr... 205 2e-51
gb|EDY88256.1| sarcosine oxidase, alpha subunit [Octadecabact... 205 2e-51
ref|ZP_01057070.1| sarcosine oxidase, alpha subunit family pr... 204 4e-51
ref|YP_612966.1| sarcosine oxidase alpha subunit family prote... 203 1e-50
ref|ZP_00961076.1| sarcosine oxidase, alpha subunit family pr... 201 3e-50
ref|ZP_02186998.1| sarcosine oxidase alpha subunit [alpha pro... 201 3e-50
ref|ZP_01902467.1| sarcosine oxidase, alpha subunit family pr... 201 6e-50
ref|YP_573056.1| sarcosine oxidase alpha subunit family prote... 200 6e-50
ref|ZP_01753658.1| sarcosine oxidase alpha subunit [Roseobact... 199 1e-49
ref|NP_102901.1| sarcosine oxidase alpha subunit [Mesorhizobi... 199 2e-49
gb|EDZ40585.1| Glycine cleavage T-protein (aminomethyl transf... 198 3e-49
ref|YP_001189608.1| sarcosine oxidase alpha subunit family pr... 196 1e-48
ref|ZP_01754731.1| sarcosine oxidase, alpha subunit family pr... 194 7e-48
ref|YP_002083910.1| sarcosine oxidase alpha subunit [Pseudomo... 193 1e-47
ref|NP_254105.1| sarcosine oxidase alpha subunit [Pseudomonas... 193 1e-47
ref|YP_270692.1| sarcosine oxidase, alpha subunit [Colwellia ... 192 1e-47
gb|EDZ47679.1| sarcosine oxidase, alpha subunit family [Rhodo... 192 2e-47
ref|YP_001351517.1| sarcosine oxidase alpha subunit [Pseudomo... 191 3e-47
ref|YP_002088982.1| sarcosine oxidase alpha subunit [Pseudomo... 191 3e-47
ref|ZP_01368438.1| hypothetical protein PaerPA_01005598 [Pseu... 191 4e-47
ref|YP_998771.1| sarcosine oxidase alpha subunit family prote... 190 7e-47
ref|ZP_01737315.1| sarcosine oxidase alpha subunit [Marinobac... 182 1e-44
ref|NP_105928.1| sarcosine oxidase alpha subunit [Mesorhizobi... 173 1e-41
ref|YP_275145.1| sarcosine oxidase, alpha subunit family prot... 155 2e-36
ref|ZP_01224997.1| Aminomethyltransferase [marine gamma prote... 154 4e-36
ref|YP_235303.1| aminomethyltransferase [Pseudomonas syringae... 154 4e-36
ref|ZP_03399343.1| sarcosine oxidase, alpha subunit [Pseudomo... 153 1e-35
gb|EEB79908.1| tRNA uridine 5-carboxymethylaminomethyl modifi... 152 2e-35
ref|NP_792264.1| sarcosine oxidase, alpha subunit [Pseudomona... 151 4e-35
ref|ZP_03277559.1| glycine cleavage T protein (aminomethyl tr... 150 9e-35
ref|YP_001188956.1| aminomethyltransferase [Pseudomonas mendo... 150 9e-35
ref|YP_047137.1| sarcosine oxidase (alpha subunit) oxidoreduc... 140 1e-31
ref|YP_645232.1| aminomethyltransferase [Rubrobacter xylanoph... 134 5e-30
ref|YP_391618.1| aminomethyltransferase [Thiomicrospira cruno... 134 5e-30
ref|YP_001668339.1| glycine cleavage T protein (aminomethyl t... 133 1e-29
ref|YP_001106686.1| sarcosine oxidase (alpha subunit) oxidore... 132 2e-29
ref|ZP_01075747.1| sarcosine oxidase, alpha subunit family pr... 130 1e-28
ref|ZP_01626855.1| sarcosine oxidase, alpha subunit family pr... 125 4e-27
ref|YP_544565.1| aminomethyltransferase [Methylobacillus flag... 121 5e-26
ref|YP_001862026.1| glycine cleavage T protein (aminomethyl t... 113 1e-23
ref|ZP_00439391.1| COG0446: Uncharacterized NAD(FAD)-dependen... 107 8e-22
ref|YP_338415.1| sarcosine oxidase, alpha subunit, truncation... 107 1e-21
ref|NP_069111.1| sarcosine oxidase, subunit alpha (soxA) [Arc... 95.5 3e-18
ref|YP_949507.1| sarcosine oxidase alpha subunit [Arthrobacte... 86.3 2e-15
ref|YP_833177.1| sarcosine oxidase alpha subunit family prote... 85.9 3e-15
ref|ZP_02837443.1| sarcosine oxidase, alpha subunit family [A... 84.7 5e-15
ref|YP_001107723.1| sarcosine oxidase (alpha subunit) oxidore... 84.3 7e-15
gb|AAN65213.1|AF329398_3 sarcosine oxidase alpha subunit [Str... 83.6 1e-14
ref|YP_701792.1| sarcosine oxidase [Rhodococcus sp. RHA1] >gb... 80.1 1e-13
ref|YP_002206035.1| sarcosine oxidase alpha subunit [Streptom... 79.3 2e-13
pdb|2GAG|A Chain A, Heteroteterameric Sarcosine: Structure Of... 77.8 6e-13
pdb|2GAH|A Chain A, Heterotetrameric Sarcosine: Structure Of ... 77.4 8e-13
dbj|BAD97818.1| subunit alpha of sarocosine oxidase [Coryneba... 77.4 9e-13
pdb|1VRQ|A Chain A, Crystal Structure Of Heterotetrameric Sar... 77.4 9e-13
gb|AAC62216.1| sarcosine oxidase subunit A [Sinorhizobium mel... 76.6 1e-12
sp|Q46337.1|SOXA_CORS1 RecName: Full=Sarcosine oxidase subuni... 67.4 9e-10
gb|AAK16489.1|AF329478_4 sarcosine oxidase subunit A [Arthrob... 66.6 1e-09
ref|ZP_00379371.1| COG0446: Uncharacterized NAD(FAD)-dependen... 66.6 1e-09
ref|ZP_01467622.1| Dye-L-proDH alpha [Stigmatella aurantiaca ... 63.9 9e-09
ref|ZP_01911446.1| Ferredoxin / FAD-dependent pyridine nucleo... 61.6 5e-08
ref|YP_001855999.1| sarcosine oxidase alpha subunit [Kocuria ... 60.5 1e-07
ref|YP_632142.1| pyridine nucleotide-disulphide oxidoreductas... 57.8 6e-07
ref|ZP_02323957.1| FAD-dependent pyridine nucleotide-disulphi... 54.7 5e-06
ref|YP_002133740.1| FAD-dependent pyridine nucleotide-disulph... 54.7 5e-06
ref|YP_465685.1| ferredoxin / FAD-dependent pyridine nucleoti... 53.9 1e-05
ref|YP_001378580.1| FAD-dependent pyridine nucleotide-disulph... 53.1 2e-05
ref|YP_182532.1| proline dehydrogenase, alpha subunit [Thermo... 52.4 3e-05
dbj|BAD13510.1| Dye-L-proDH alpha [Thermococcus profundus] 50.4 1e-04
ref|NP_126003.1| sarcosine oxidase, subunit alpha [Pyrococcus... 50.4 1e-04
gb|EDY40709.1| tRNA uridine 5-carboxymethylaminomethyl modifi... 49.3 2e-04
ref|ZP_02419852.1| hypothetical protein ANACAC_02446 [Anaeros... 49.3 2e-04
ref|ZP_03234904.1| putative sarcosine oxidase, alpha subunit ... 48.9 3e-04
ref|YP_895341.1| sarcosine oxidase, alpha subunit [Bacillus t... 48.9 4e-04
ref|YP_084152.1| sarcosine oxidase, alpha subunit [Bacillus c... 48.5 4e-04
ref|ZP_03329249.1| ferredoxin [Thermotogales bacterium TBF 19... 48.5 5e-04
ref|ZP_00238363.1| sarcosine oxidase, subunit alpha [Bacillus... 48.1 5e-04
ref|ZP_03232892.1| sarcosine oxidase alpha subunit [Bacillus ... 48.1 6e-04
ref|YP_028906.1| sarcosine oxidase alpha subunit, N-terminal ... 47.4 9e-04
ref|NP_832589.1| sarcosine oxidase alpha subunit [Bacillus ce... 47.4 0.001
ref|ZP_01666854.1| proline dehydrogenase, alpha subunit [Ther... 47.4 0.001
ref|NP_579524.1| sarcosine oxidase subunit alpha [Pyrococcus ... 47.0 0.001
ref|ZP_03295986.1| hypothetical protein COLINT_01703 [Collins... 46.6 0.002
ref|YP_001749019.1| fumarate reductase/succinate dehydrogenas... 46.6 0.002
gb|EDY35306.1| tRNA uridine 5-carboxymethylaminomethyl modifi... 45.8 0.003
gb|EDY35216.1| tRNA uridine 5-carboxymethylaminomethyl modifi... 45.8 0.003
ref|YP_001645471.1| sarcosine oxidase, alpha subunit [Bacillu... 45.4 0.003
ref|ZP_02327522.1| hypothetical protein Plarl_07710 [Paenibac... 45.4 0.003
ref|ZP_01550200.1| FAD dependent oxidoreductase [Stappia aggr... 44.7 0.007
ref|ZP_00744209.1| Sarcosine oxidase alpha subunit [Bacillus ... 44.7 0.007
gb|EEA99631.1| amine oxidase, flavin-containing domain-contai... 44.3 0.007
ref|YP_001136924.1| hypothetical protein cgR_0061 [Corynebact... 44.3 0.007
ref|ZP_01443595.1| putative dehydrogenase [Roseovarius sp. HT... 44.3 0.008
ref|YP_883390.1| 3-ketosteroid-delta-1-dehydrogenase [Mycobac... 43.9 0.010
ref|NP_744749.1| fumarate reductase/succinate dehydrogenase f... 43.9 0.012
ref|ZP_02043101.1| hypothetical protein RUMGNA_03911 [Ruminoc... 43.5 0.012
ref|YP_001718218.1| hypothetical protein Daud_2097 [Candidatu... 43.5 0.013
ref|NP_782611.1| dihydrolipoamide dehydrogenase [Clostridium ... 43.1 0.016
gb|EEB73433.1| proline dehydrogenase, alpha subunit [Thermoco... 43.1 0.017
emb|CAO80836.1| putative dye-linked L-proline dehydrogenase (... 43.1 0.017
ref|YP_001863451.1| fumarate reductase/succinate dehydrogenas... 43.1 0.018
ref|YP_001613094.1| putative NADH dehydrogenase [Sorangium ce... 42.7 0.023
ref|ZP_03132998.1| putative secreted protein-putative xanthan... 42.7 0.024
ref|YP_001240605.1| putative 3-oxosteroid 1-dehydrogenase [Br... 42.7 0.024
ref|ZP_03127008.1| conserved hypothetical protein [Chthonioba... 42.7 0.025
ref|YP_001417593.1| putative succinate dehydrogenase [Xanthob... 42.7 0.025
ref|YP_001268432.1| fumarate reductase/succinate dehydrogenas... 42.7 0.025
ref|ZP_01696380.1| FAD-dependent pyridine nucleotide-disulphi... 42.7 0.027
ref|ZP_02432477.1| hypothetical protein CLOSCI_02724 [Clostri... 42.7 0.027
dbj|BAD77802.1| dye-linked L-proline dehydrogenase alpha2 sub... 42.4 0.030
ref|NP_143587.1| D-nopaline dehydrogenase [Pyrococcus horikos... 42.4 0.030
ref|YP_259557.1| putative FAD-binding dehydrogenase [Pseudomo... 42.4 0.032
ref|ZP_01129418.1| putative oxidoreductase [marine actinobact... 42.0 0.037
ref|YP_825257.1| hypothetical protein Acid_4005 [Solibacter u... 42.0 0.042
ref|YP_982927.1| putative FAD-binding dehydrogenase [Polaromo... 42.0 0.043
ref|XP_794903.2| PREDICTED: similar to amine oxidase (flavin-... 42.0 0.045
ref|YP_982933.1| putative succinate dehydrogenase [Polaromona... 41.6 0.051
ref|YP_001862205.1| fumarate reductase/succinate dehydrogenas... 41.6 0.053
ref|ZP_03297321.1| hypothetical protein COLSTE_01215 [Collins... 41.6 0.056
ref|YP_955987.1| 3-ketosteroid-delta-1-dehydrogenase [Mycobac... 41.6 0.059
ref|YP_002307650.1| proline dehydrogenase, alpha subunit [The... 41.2 0.060
ref|ZP_01167758.1| probable pyridine nucleotide-disulphide ox... 41.2 0.061
ref|ZP_01968924.1| hypothetical protein RUMTOR_02505 [Ruminoc... 41.2 0.062
ref|YP_001114495.1| hypothetical protein Dred_3168 [Desulfoto... 41.2 0.070
ref|ZP_02207022.1| hypothetical protein COPEUT_01824 [Coproco... 41.2 0.073
ref|ZP_01723678.1| Sarcosine oxidase alpha subunit [Bacillus ... 41.2 0.077
ref|ZP_03263743.1| fumarate reductase/succinate dehydrogenase... 40.8 0.080
ref|YP_001132817.1| 3-ketosteroid-delta-1-dehydrogenase [Myco... 40.8 0.080
ref|ZP_02011382.1| FAD dependent oxidoreductase [Opitutaceae ... 40.8 0.088
ref|ZP_01771821.1| Hypothetical protein COLAER_00810 [Collins... 40.8 0.090
ref|YP_001581965.1| geranylgeranyl reductase [Nitrosopumilus ... 40.8 0.091
ref|YP_001698497.1| hypothetical protein Bsph_2834 [Lysinibac... 40.8 0.094
ref|YP_705544.1| putrescine oxidase [Rhodococcus sp. RHA1] >g... 40.8 0.10
ref|YP_520533.1| hypothetical protein DSY4300 [Desulfitobacte... 40.4 0.10
ref|ZP_03270170.1| fumarate reductase/succinate dehydrogenase... 40.4 0.11
ref|ZP_01372601.1| NADH:flavin oxidoreductase/NADH oxidase [D... 40.4 0.11
ref|NP_864155.1| hypothetical protein RB941 [Rhodopirellula b... 40.4 0.11
ref|ZP_01725266.1| hypothetical protein BB14905_15385 [Bacill... 40.4 0.11
ref|ZP_01735924.1| soluble pyridine nucleotide transhydrogena... 40.4 0.11
ref|ZP_03293066.1| hypothetical protein CLOHIR_01014 [Clostri... 40.4 0.12
ref|ZP_02329462.1| hypothetical protein Plarl_17756 [Paenibac... 40.4 0.13
ref|NP_266414.1| hypothetical protein L56208 [Lactococcus lac... 40.4 0.13
ref|YP_945873.1| putrescine oxidase [Arthrobacter aurescens T... 40.4 0.13
ref|YP_001031621.1| putative flavoprotein [Lactococcus lactis... 40.0 0.14
ref|NP_929497.1| hypothetical protein plu2240 [Photorhabdus l... 40.0 0.14
ref|YP_808296.1| flavoprotein [Lactococcus lactis subsp. crem... 40.0 0.14
gb|EEB75826.1| hypothetical protein CDSM653_205 [Carboxydibra... 40.0 0.15
gb|EEB75816.1| hypothetical protein CDSM653_195 [Carboxydibra... 40.0 0.15
ref|YP_001746779.1| invasion protein IbeA [Escherichia coli S... 40.0 0.16
ref|YP_543969.1| invasion protein IbeA [Escherichia coli UTI8... 40.0 0.16
emb|CAH55802.1| invasion protein IbeA [Escherichia coli] 40.0 0.16
gb|AAF98391.2| invasion protein IbeA [Escherichia coli] 40.0 0.16
ref|NP_773445.1| putative dehydrogenase [Bradyrhizobium japon... 40.0 0.17
sp|Q04616.3|3O1D_RHOOP RecName: Full=3-oxosteroid 1-dehydroge... 39.7 0.19
ref|YP_001408579.1| Tat pathway signal sequence domain-contai... 39.7 0.22
ref|ZP_02013163.1| FAD dependent oxidoreductase [Opitutaceae ... 39.3 0.23
ref|YP_350639.1| putative FAD-binding dehydrogenase [Pseudomo... 39.3 0.24
ref|ZP_02190258.1| putative dehydrogenase [alpha proteobacter... 39.3 0.24
emb|CAQ90090.1| conserved hypothetical protein; putative expo... 39.3 0.25
ref|ZP_03487970.1| hypothetical protein EUBIFOR_00535 [Eubact... 39.3 0.25
gb|AAF19054.1|AF096929_2 3-ketosteroid dehydrogenase [Rhodoco... 39.3 0.27
ref|ZP_00416411.1| conserved hypothetical protein [Azotobacte... 39.3 0.28
ref|ZP_03266240.1| fumarate reductase/succinate dehydrogenase... 39.3 0.29
ref|ZP_01855464.1| probable xanthan lyase [Planctomyces maris... 39.3 0.29
ref|YP_002240100.1| FAD-dependent oxidoreductase [Klebsiella ... 39.3 0.29
ref|ZP_02928926.1| probable xanthan lyase [Verrucomicrobium s... 39.3 0.29
ref|ZP_03124937.1| enoate reductase [Clostridium difficile QC... 38.9 0.34
ref|ZP_02419851.1| hypothetical protein ANACAC_02445 [Anaeros... 38.9 0.35
ref|YP_982015.1| dihydrolipoamide dehydrogenase [Polaromonas ... 38.9 0.35
sp|P35903.1|ACHC_ACHFU RecName: Full=Achacin; Flags: Precurso... 38.9 0.36
gb|ABY74497.1| putrescine oxidase [Rhodococcus erythropolis] 38.9 0.38
ref|XP_001009119.1| amine oxidase, flavin-containing family p... 38.9 0.38
ref|NP_891223.1| hypothetical protein BB4691 [Bordetella bron... 38.5 0.40
ref|YP_001695873.1| sarcosine oxidase alpha subunit [Lysiniba... 38.5 0.41
ref|YP_001334066.1| hypothetical protein KPN_00384 [Klebsiell... 38.5 0.43
ref|ZP_01772393.1| Hypothetical protein COLAER_01399 [Collins... 38.5 0.45
ref|XP_381934.1| hypothetical protein FG01758.1 [Gibberella z... 38.5 0.45
ref|YP_176361.1| flavoprotein [Bacillus clausii KSM-K16] >dbj... 38.5 0.46
ref|YP_002093785.1| Pyruvate dehydrogenase complex, dehydroge... 38.5 0.47
ref|ZP_01313445.1| Succinate dehydrogenase [Desulfuromonas ac... 38.5 0.48
ref|YP_001626631.1| putrescine oxidase [Renibacterium salmoni... 38.5 0.49
ref|YP_002031424.1| pyruvate dehydrogenase complex E3 compone... 38.5 0.49
ref|ZP_02388030.1| pyruvate dehydrogenase, E3 component, dihy... 38.5 0.50
ref|ZP_02374190.1| pyruvate dehydrogenase, E3 component, dihy... 38.5 0.50
ref|YP_001758239.1| dihydrolipoamide dehydrogenase [Methyloba... 38.5 0.50
ref|YP_369679.1| dihydrolipoamide dehydrogenase [Burkholderia... 38.5 0.51
ref|YP_625780.1| dihydrolipoamide dehydrogenase [Burkholderia... 38.5 0.51
ref|YP_002231334.1| putative dihydrolipoamide dehydrogenase [... 38.1 0.51
ref|ZP_01764402.1| pyruvate dehydrogenase complex E3 componen... 38.1 0.51
ref|ZP_02490652.1| pyruvate dehydrogenase complex E3 componen... 38.1 0.51
ref|YP_002097966.1| Pyruvate/2-oxoglutarate dehydrogenase com... 38.1 0.51
ref|YP_442396.1| pyruvate dehydrogenase, E3 component, dihydr... 38.1 0.51
ref|YP_001066918.1| pyruvate dehydrogenase complex E3 compone... 38.1 0.51
ref|YP_001059636.1| pyruvate dehydrogenase complex E3 compone... 38.1 0.51
ref|YP_108895.1| putative dihydrolipoamide dehydrogenase [Bur... 38.1 0.51
ref|YP_001765436.1| dihydrolipoamide dehydrogenase [Burkholde... 38.1 0.52
ref|YP_103339.1| pyruvate dehydrogenase, E3 component, dihydr... 38.1 0.53
ref|ZP_02403599.1| pyruvate dehydrogenase complex E3 componen... 38.1 0.53
ref|YP_001120052.1| dihydrolipoamide dehydrogenase [Burkholde... 38.1 0.53
ref|ZP_02094457.1| hypothetical protein PEPMIC_01223 [Peptost... 38.1 0.54
ref|ZP_02207021.1| hypothetical protein COPEUT_01823 [Coproco... 38.1 0.54
ref|YP_001790680.1| dihydrolipoamide dehydrogenase [Leptothri... 38.1 0.56
ref|ZP_02122076.1| fumarate reductase/succinate dehydrogenase... 38.1 0.56
ref|YP_001117832.1| dihydrolipoamide dehydrogenase [Burkholde... 38.1 0.56
ref|XP_001367001.1| PREDICTED: similar to amine oxidase (flav... 38.1 0.59
ref|ZP_03268782.1| fumarate reductase/succinate dehydrogenase... 38.1 0.60
ref|ZP_02170720.1| geranylgeranyl reductase [Bacillus selenit... 38.1 0.60
ref|ZP_01756481.1| soluble pyridine nucleotide transhydrogena... 38.1 0.60
ref|YP_001512146.1| fumarate reductase/succinate dehydrogenas... 38.1 0.61
ref|ZP_02429421.1| hypothetical protein CLORAM_02844 [Clostri... 38.1 0.64
ref|ZP_02079187.1| hypothetical protein CLOLEP_00625 [Clostri... 38.1 0.64
ref|YP_559464.1| dihydrolipoamide dehydrogenase [Burkholderia... 38.1 0.66
ref|ZP_02887334.1| dihydrolipoamide dehydrogenase [Burkholder... 37.7 0.67
ref|XP_001367053.1| PREDICTED: similar to amine oxidase (flav... 37.7 0.68
gb|EDX89347.1| FAD dependent oxidoreductase, putative [Alcani... 37.7 0.68
ref|ZP_02885878.1| dihydrolipoamide dehydrogenase [Burkholder... 37.7 0.69
ref|YP_549483.1| dihydrolipoamide dehydrogenase [Polaromonas ... 37.7 0.70
ref|ZP_01090336.1| hypothetical protein DSM3645_21392 [Blasto... 37.7 0.71
ref|NP_770309.1| putative succinate dehydrogenase [Bradyrhizo... 37.7 0.72
ref|YP_576242.1| hypothetical protein Nham_0924 [Nitrobacter ... 37.7 0.73
ref|YP_001918886.1| dihydrolipoamide dehydrogenase [Natranaer... 37.7 0.74
ref|YP_065982.1| opine/octopine dehydrogenase, subunit A [Des... 37.7 0.74
ref|ZP_02327052.1| hypothetical protein Plarl_05305 [Paenibac... 37.7 0.76
ref|YP_001328600.1| dihydrolipoamide dehydrogenase [Sinorhizo... 37.7 0.78
ref|ZP_03488969.1| hypothetical protein EUBIFOR_01555 [Eubact... 37.7 0.82
ref|ZP_01189900.1| Dihydrolipoamide dehydrogenase [Halothermo... 37.7 0.82
ref|YP_001192073.1| pyridine nucleotide-disulphide oxidoreduc... 37.7 0.83
ref|NP_387154.1| dihydrolipoamide dehydrogenase [Sinorhizobiu... 37.7 0.83
ref|YP_575887.1| dihydrolipoamide dehydrogenase [Nitrobacter ... 37.7 0.84
ref|YP_624765.1| dihydrolipoamide dehydrogenase [Burkholderia... 37.7 0.84
ref|ZP_02992268.1| HI0933 family protein [Exiguobacterium sp.... 37.7 0.86
ref|ZP_01894333.1| Pyruvate/2-oxoglutarate dehydrogenase comp... 37.4 0.91
ref|ZP_02887350.1| FAD-dependent pyridine nucleotide-disulphi... 37.4 0.93
ref|ZP_00418574.1| putative 3-oxosteroid 1-dehydrogenase [Azo... 37.4 0.93
ref|YP_001541235.1| ribulose-1,5-biphosphate synthetase [Cald... 37.4 0.95
gb|AAD30450.1|AF121894_1 lipoamide dehydrogenase [Ascaris suum] 37.4 0.98
ref|ZP_02861345.1| hypothetical protein ANASTE_00546 [Anaerof... 37.4 0.98
ref|NP_816133.1| UDP-galactopyranose mutase [Enterococcus fae... 37.4 0.98
ref|YP_001021317.1| dihydrolipoamide dehydrogenase [Methylibi... 37.4 0.99
ref|YP_153586.1| glutathione reductase [Anaplasma marginale s... 37.4 0.99
ref|NP_693222.1| hypothetical protein OB2301 [Oceanobacillus ... 37.4 0.99
ref|XP_002128583.1| PREDICTED: similar to dihydrolipoamide de... 37.4 1.0
ref|NP_377756.1| hypothetical protein ST1775 [Sulfolobus toko... 37.4 1.1
ref|YP_925259.1| fumarate reductase/succinate dehydrogenase f... 37.4 1.1
ref|YP_743364.1| NADPH-glutathione reductase [Alkalilimnicola... 37.4 1.1
ref|YP_001860180.1| fumarate reductase/succinate dehydrogenas... 37.4 1.1
ref|ZP_01728707.1| hypothetical protein CY0110_29874 [Cyanoth... 37.4 1.1
ref|ZP_01802594.1| hypothetical protein CdifQ_04003580 [Clost... 37.4 1.1
ref|YP_925245.1| fumarate reductase/succinate dehydrogenase f... 37.0 1.1
ref|ZP_01733462.1| putative transmembrane CBS domain transpor... 37.0 1.2
ref|YP_001808742.1| dihydrolipoamide dehydrogenase [Burkholde... 37.0 1.2
ref|YP_778381.1| dihydrolipoamide dehydrogenase [Burkholderia... 37.0 1.2
ref|ZP_02894230.1| dihydrolipoamide dehydrogenase [Burkholder... 37.0 1.3
ref|YP_611681.1| soluble pyridine nucleotide transhydrogenase... 37.0 1.3
ref|ZP_02929239.1| putative secreted protein, putative xantha... 37.0 1.3
ref|ZP_02693005.1| hypothetical protein Epulo_07663 [Epulopis... 37.0 1.3
ref|ZP_02909094.1| dihydrolipoamide dehydrogenase [Burkholder... 37.0 1.3
ref|YP_829541.1| putrescine oxidase [Arthrobacter sp. FB24] >... 37.0 1.3
emb|CAQ42594.1| Flavin containing amine oxidoreductase,putati... 37.0 1.3
ref|ZP_02363411.1| pyruvate dehydrogenase complex E3 componen... 37.0 1.3
ref|ZP_02356284.1| dihydrolipoamide dehydrogenase [Burkholder... 37.0 1.4
gb|AAN32984.1| BarJ [Lyngbya majuscula] 37.0 1.4
ref|ZP_02379120.1| dihydrolipoamide dehydrogenase [Burkholder... 37.0 1.4
ref|NP_377847.1| lipoamide dehydrogenase [Sulfolobus tokodaii... 37.0 1.4
ref|YP_774062.1| dihydrolipoamide dehydrogenase [Burkholderia... 37.0 1.4
ref|XP_644354.1| hypothetical protein [Dictyostelium discoide... 37.0 1.4
ref|ZP_02329931.1| hypothetical protein Plarl_20162 [Paenibac... 37.0 1.4
ref|YP_001323884.1| geranylgeranyl reductase [Methanococcus v... 37.0 1.4
ref|YP_001896208.1| dihydrolipoamide dehydrogenase [Burkholde... 37.0 1.4
ref|YP_458491.1| 2-oxoglutarate dehydrogenase, E3 component, ... 37.0 1.4
ref|YP_001857696.1| dihydrolipoamide dehydrogenase [Burkholde... 37.0 1.4
ref|XP_782447.2| PREDICTED: similar to Dihydrolipoyl dehydrog... 37.0 1.4
ref|YP_391272.1| dihydrolipoamide dehydrogenase [Thiomicrospi... 36.6 1.5
ref|YP_001313783.1| BFD/(2Fe-2S)-binding domain-containing pr... 36.6 1.6
ref|ZP_02062853.1| dihydrolipoyl dehydrogenase [Rickettsiella... 36.6 1.7
ref|YP_036862.1| dihydrolipoamide dehydrogenase [Bacillus thu... 36.6 1.7
ref|YP_959191.1| soluble pyridine nucleotide transhydrogenase... 36.6 1.7
ref|YP_001395159.1| BfmBC [Clostridium kluyveri DSM 555] >gb|... 36.6 1.7
ref|ZP_03146647.1| FAD dependent oxidoreductase [Geobacillus ... 36.6 1.7
ref|YP_364325.1| putative pyridine nucleotide-disulphide oxid... 36.6 1.8
ref|ZP_02464060.1| dihydrolipoamide dehydrogenase [Burkholder... 36.6 1.8
ref|YP_001579322.1| dihydrolipoamide dehydrogenase [Burkholde... 36.6 1.8
ref|YP_572935.1| 2,4-dienoyl-CoA reductase [Chromohalobacter ... 36.6 1.8
ref|XP_002068171.1| GK12667 [Drosophila willistoni] >gb|EDW79... 36.6 1.8
ref|ZP_02012504.1| invasion protein IbeA [Opitutaceae bacteri... 36.6 1.9
ref|YP_198391.1| dihydrolipoamide dehydrogenase E3 component ... 36.6 1.9
ref|YP_148689.1| hypothetical protein GK2836 [Geobacillus kau... 36.2 2.0
ref|YP_456335.1| dihydrolipoamide dehydrogenase [Aster yellow... 36.2 2.0
ref|ZP_01811232.1| amine oxidase [candidate division TM7 geno... 36.2 2.0
ref|ZP_01447460.1| hypothetical protein OM2255_09786 [alpha p... 36.2 2.0
gb|EEB79175.1| oxidoreductase, FAD/FMN-binding family [marine... 36.2 2.1
ref|ZP_02931089.1| probable xanthan lyase [Verrucomicrobium s... 36.2 2.1
ref|ZP_02168476.1| dihydrolipoamide dehydrogenase [Hoeflea ph... 36.2 2.1
ref|ZP_01727730.1| Adrenodoxin reductase [Cyanothece sp. CCY0... 36.2 2.1
ref|XP_644355.1| hypothetical protein [Dictyostelium discoide... 36.2 2.1
ref|ZP_03270752.1| dihydrolipoamide dehydrogenase [Burkholder... 36.2 2.1
ref|XP_001956477.1| GF24574 [Drosophila ananassae] >gb|EDV392... 36.2 2.1
ref|YP_001667931.1| soluble pyridine nucleotide transhydrogen... 36.2 2.2
ref|YP_001321631.1| fumarate reductase/succinate dehydrogenas... 36.2 2.2
ref|NP_744300.1| soluble pyridine nucleotide transhydrogenase... 36.2 2.2
ref|NP_578974.1| d-nopaline dehydrogenase [Pyrococcus furiosu... 36.2 2.3
ref|ZP_01575419.1| HI0933-like protein [Clostridium celluloly... 36.2 2.3
ref|XP_001630345.1| predicted protein [Nematostella vectensis... 36.2 2.3
ref|YP_886656.1| geranylgeranyl reductase [Mycobacterium smeg... 36.2 2.4
ref|ZP_01901268.1| soluble pyridine nucleotide transhydrogena... 36.2 2.4
ref|YP_002351977.1| FAD-dependent pyridine nucleotide-disulph... 36.2 2.4
>gb|EDZ60822.1| sarcosine oxidase alpha subunit [Candidatus Pelagibacter sp.
HTCC7211]
Length=998
Score = 429 bits (1102), Expect = 1e-118, Method: Compositional matrix adjust.
Identities = 204/233 (87%), Positives = 214/233 (91%), Gaps = 0/233 (0%)
Query 8 MTQSFRLETGGLINRDKKISFKFNGKNYFGYEGDTLASALIANGVHLIGRSFKYHRPRGF 67
MTQSFRL GLINRD+K+SFKFN Y+GYEGDTLASALIANGVHL+GRSFKYHRPRGF
Sbjct 1 MTQSFRLNDVGLINRDRKLSFKFNSVTYYGYEGDTLASALIANGVHLVGRSFKYHRPRGF 60
Query 68 FGAGVDEPYAIVQLYRNGETEPNIKATEQELFEGLEAKSVNCWPSVNFDVGAINNFLKIF 127
FGAGVDEPYAIVQLYRNGETEPNIKATEQELFEGLEAKSVNCWPSVNFDVGAINNFLKIF
Sbjct 61 FGAGVDEPYAIVQLYRNGETEPNIKATEQELFEGLEAKSVNCWPSVNFDVGAINNFLKIF 120
Query 128 LPAGFYYKTFMWPKSFWYKIYEPFIRKAAGLGTASIKHDKERYEHKYEYCDLLITGSRPS 187
LPAGFYYKTFMWPKSFWYK+YEPFIRKAAGLG AS KHDKERYEHKYEYCDLLI GS PS
Sbjct 121 LPAGFYYKTFMWPKSFWYKVYEPFIRKAAGLGVASTKHDKERYEHKYEYCDLLIAGSGPS 180
Query 188 GLASAYSAAKNGAKVILAEDKSRFGGTLLTSDVNIGNQSVKSGQIVLFQNLKK 240
GLASAY+AAKNGA+VILAEDKSRFGGTLLTSDVNIGNQ+ K + LK+
Sbjct 181 GLASAYAAAKNGARVILAEDKSRFGGTLLTSDVNIGNQTGKEWADGIISELKE 233
>ref|YP_266690.1| Gene info sarcosine oxidase alpha chain [Candidatus Pelagibacter ubique
HTCC1062]
gb|AAZ22086.1| Gene info sarcosine oxidase alpha chain [Candidatus Pelagibacter ubique
HTCC1062]
Length=998
GENE ID: 3517319 soxA2 | sarcosine oxidase alpha chain
[Candidatus Pelagibacter ubique HTCC1062] (10 or fewer PubMed links)
Score = 423 bits (1088), Expect = 6e-117, Method: Compositional matrix adjust.
Identities = 199/233 (85%), Positives = 213/233 (91%), Gaps = 0/233 (0%)
Query 8 MTQSFRLETGGLINRDKKISFKFNGKNYFGYEGDTLASALIANGVHLIGRSFKYHRPRGF 67
MTQ++RL+ GLINRDKKISFKFNG YFGYEGDTLASAL+ANGVHLIGRSFKYHRPRGF
Sbjct 1 MTQNYRLDNVGLINRDKKISFKFNGVTYFGYEGDTLASALLANGVHLIGRSFKYHRPRGF 60
Query 68 FGAGVDEPYAIVQLYRNGETEPNIKATEQELFEGLEAKSVNCWPSVNFDVGAINNFLKIF 127
FGAGVDEPYAIVQLYRN ETEPN+KATEQELFEGLEA SVNCWPSVNFD+GAINN LKIF
Sbjct 61 FGAGVDEPYAIVQLYRNNETEPNVKATEQELFEGLEATSVNCWPSVNFDIGAINNLLKIF 120
Query 128 LPAGFYYKTFMWPKSFWYKIYEPFIRKAAGLGTASIKHDKERYEHKYEYCDLLITGSRPS 187
LPAGFYYKTFMWPKSFWYK+YEPFIRKAAGLG ASI+HDKERYEHKYEYCDLLI GS PS
Sbjct 121 LPAGFYYKTFMWPKSFWYKVYEPFIRKAAGLGVASIEHDKERYEHKYEYCDLLIAGSGPS 180
Query 188 GLASAYSAAKNGAKVILAEDKSRFGGTLLTSDVNIGNQSVKSGQIVLFQNLKK 240
GLASAY+AAKNGA+VILAEDK RFGGTLLTS+VNIGNQ+ K + LK+
Sbjct 181 GLASAYAAAKNGARVILAEDKPRFGGTLLTSEVNIGNQTGKEWAENIISELKE 233
>ref|ZP_01264926.1| sarcosine oxidase alpha chain [Candidatus Pelagibacter ubique
HTCC1002]
gb|EAS85413.1| sarcosine oxidase alpha chain [Candidatus Pelagibacter ubique
HTCC1002]
Length=998
Score = 423 bits (1087), Expect = 6e-117, Method: Compositional matrix adjust.
Identities = 199/233 (85%), Positives = 213/233 (91%), Gaps = 0/233 (0%)
Query 8 MTQSFRLETGGLINRDKKISFKFNGKNYFGYEGDTLASALIANGVHLIGRSFKYHRPRGF 67
MTQ++RL+ GLINRDKKISFKFNG YFGYEGDTLASAL+ANGVHLIGRSFKYHRPRGF
Sbjct 1 MTQNYRLDNVGLINRDKKISFKFNGVTYFGYEGDTLASALLANGVHLIGRSFKYHRPRGF 60
Query 68 FGAGVDEPYAIVQLYRNGETEPNIKATEQELFEGLEAKSVNCWPSVNFDVGAINNFLKIF 127
FGAGVDEPYAIVQLYRN ETEPN+KATEQELFEGLEA SVNCWPSVNFD+GAINN LKIF
Sbjct 61 FGAGVDEPYAIVQLYRNNETEPNVKATEQELFEGLEATSVNCWPSVNFDIGAINNLLKIF 120
Query 128 LPAGFYYKTFMWPKSFWYKIYEPFIRKAAGLGTASIKHDKERYEHKYEYCDLLITGSRPS 187
LPAGFYYKTFMWPKSFWYK+YEPFIRKAAGLG ASI+HDKERYEHKYEYCDLLI GS PS
Sbjct 121 LPAGFYYKTFMWPKSFWYKVYEPFIRKAAGLGVASIEHDKERYEHKYEYCDLLIAGSGPS 180
Query 188 GLASAYSAAKNGAKVILAEDKSRFGGTLLTSDVNIGNQSVKSGQIVLFQNLKK 240
GLASAY+AAKNGA+VILAEDK RFGGTLLTS+VNIGNQ+ K + LK+
Sbjct 181 GLASAYAAAKNGARVILAEDKPRFGGTLLTSEVNIGNQTGKEWAENIISELKE 233
>gb|ABZ06303.1| putative glycine cleavage T-protein (aminomethyl transferase)
[uncultured marine microorganism HF4000_008G09]
Length=998
Score = 370 bits (950), Expect = 5e-101, Method: Compositional matrix adjust.
Identities = 175/233 (75%), Positives = 193/233 (82%), Gaps = 0/233 (0%)
Query 8 MTQSFRLETGGLINRDKKISFKFNGKNYFGYEGDTLASALIANGVHLIGRSFKYHRPRGF 67
MTQ FRL GL+NR+K ISF FNGK YFGYEGDTLASAL+ANG+HL+GRSFKYHRPRGF
Sbjct 1 MTQKFRLPNLGLVNRNKTISFHFNGKKYFGYEGDTLASALLANGIHLVGRSFKYHRPRGF 60
Query 68 FGAGVDEPYAIVQLYRNGETEPNIKATEQELFEGLEAKSVNCWPSVNFDVGAINNFLKIF 127
FGAGVDEP A VQLY +TEPN+ ATE EL EGL AKS NCWPSV FDVGAINNF F
Sbjct 61 FGAGVDEPNAKVQLYEGDKTEPNVNATELELVEGLVAKSQNCWPSVEFDVGAINNFFSRF 120
Query 128 LPAGFYYKTFMWPKSFWYKIYEPFIRKAAGLGTASIKHDKERYEHKYEYCDLLITGSRPS 187
PAGFYYKTFMWPKSFWYK+YEP IRKAAGLG AS K D RYEHKYEYCD+L+ GS PS
Sbjct 121 FPAGFYYKTFMWPKSFWYKVYEPLIRKAAGLGVASPKPDTSRYEHKYEYCDVLVVGSGPS 180
Query 188 GLASAYSAAKNGAKVILAEDKSRFGGTLLTSDVNIGNQSVKSGQIVLFQNLKK 240
GL+SAY+AAKNGA+VILAEDK RFGG+LLT DVNIGNQ+ K + + LK+
Sbjct 181 GLSSAYAAAKNGARVILAEDKPRFGGSLLTDDVNIGNQTGKEWAEDVIKELKQ 233
>gb|ABZ05929.1| putative glycine cleavage T-protein (aminomethyl transferase)
[uncultured marine microorganism HF4000_001B09]
Length=998
Score = 340 bits (872), Expect = 6e-92, Method: Compositional matrix adjust.
Identities = 167/221 (75%), Positives = 186/221 (84%), Gaps = 0/221 (0%)
Query 8 MTQSFRLETGGLINRDKKISFKFNGKNYFGYEGDTLASALIANGVHLIGRSFKYHRPRGF 67
M+Q +RL+ G INRDKKISF FNGK YFGYEGDTLASAL+ANG+HL+GRSFKYHRPRGF
Sbjct 1 MSQKYRLDNIGYINRDKKISFTFNGKKYFGYEGDTLASALLANGIHLVGRSFKYHRPRGF 60
Query 68 FGAGVDEPYAIVQLYRNGETEPNIKATEQELFEGLEAKSVNCWPSVNFDVGAINNFLKIF 127
FGAGVDEP A VQLY+ +TEPN ATE EL EGL KS NCWPSV+FD GAINN + F
Sbjct 61 FGAGVDEPNAKVQLYKGAKTEPNANATEVELVEGLIVKSQNCWPSVSFDFGAINNLFQKF 120
Query 128 LPAGFYYKTFMWPKSFWYKIYEPFIRKAAGLGTASIKHDKERYEHKYEYCDLLITGSRPS 187
PAGFYYKTFMWPKSFWYK+YEP IRKAAGLG A +K D +RYEHKYEYCD+LI GS PS
Sbjct 121 FPAGFYYKTFMWPKSFWYKVYEPIIRKAAGLGVAPLKPDPDRYEHKYEYCDVLIAGSGPS 180
Query 188 GLASAYSAAKNGAKVILAEDKSRFGGTLLTSDVNIGNQSVK 228
GLASA +AAKNGA+VILAEDKSRFGG+LL +V IGN+ K
Sbjct 181 GLASALAAAKNGARVILAEDKSRFGGSLLVDEVTIGNKKGK 221
>gb|ABZ06659.1| putative glycine cleavage T-protein (aminomethyl transferase)
[uncultured marine microorganism HF4000_133I24]
Length=998
Score = 337 bits (863), Expect = 6e-91, Method: Compositional matrix adjust.
Identities = 166/221 (75%), Positives = 184/221 (83%), Gaps = 0/221 (0%)
Query 8 MTQSFRLETGGLINRDKKISFKFNGKNYFGYEGDTLASALIANGVHLIGRSFKYHRPRGF 67
M Q +RL+ G INRDKKISF FNGK YFGYEGDTLASAL+ANG+HL+GRSFKYHRPRGF
Sbjct 1 MPQKYRLDNIGYINRDKKISFTFNGKKYFGYEGDTLASALLANGIHLVGRSFKYHRPRGF 60
Query 68 FGAGVDEPYAIVQLYRNGETEPNIKATEQELFEGLEAKSVNCWPSVNFDVGAINNFLKIF 127
FGAGVDEP A VQLY+ +TEPN ATE EL E L KS NCWPSV+FD GAINN + F
Sbjct 61 FGAGVDEPNAKVQLYKGAKTEPNANATEVELVEDLIVKSQNCWPSVSFDFGAINNLFQKF 120
Query 128 LPAGFYYKTFMWPKSFWYKIYEPFIRKAAGLGTASIKHDKERYEHKYEYCDLLITGSRPS 187
PAGFYYKTFMWPKSFWYK+YEP IRKAAGLG A +K D +RYEHKYEYCD+LI GS PS
Sbjct 121 FPAGFYYKTFMWPKSFWYKVYEPIIRKAAGLGVAPLKPDPDRYEHKYEYCDVLIAGSGPS 180
Query 188 GLASAYSAAKNGAKVILAEDKSRFGGTLLTSDVNIGNQSVK 228
GLASA +AAKNGA+VILAEDKSRFGG+LL +V IGN+ K
Sbjct 181 GLASALAAAKNGARVILAEDKSRFGGSLLVDEVTIGNKKGK 221
>ref|ZP_01754673.1| sarcosine oxidase, alpha subunit family protein [Roseobacter
sp. SK209-2-6]
gb|EBA16865.1| sarcosine oxidase, alpha subunit family protein [Roseobacter
sp. SK209-2-6]
Length=985
Score = 270 bits (690), Expect = 7e-71, Method: Composition-based stats.
Identities = 128/219 (58%), Positives = 162/219 (73%), Gaps = 1/219 (0%)
Query 8 MTQSFRLETGGLINRDKKISFKFNGKNYFGYEGDTLASALIANGVHLIGRSFKYHRPRGF 67
MT+ RL+ GG INR K++SF F+G Y GYEGDTLASAL+ANG L+GRSFKYHRPRG
Sbjct 1 MTEVNRLD-GGQINRAKEVSFTFDGHRYKGYEGDTLASALLANGERLMGRSFKYHRPRGV 59
Query 68 FGAGVDEPYAIVQLYRNGETEPNIKATEQELFEGLEAKSVNCWPSVNFDVGAINNFLKIF 127
AG +EP A+V+L + G EPN +AT ELF+GLEA N WPS+ FD A+N+ F
Sbjct 60 LTAGSEEPNALVELRKGGRQEPNTRATVIELFDGLEAAPQNAWPSLRFDAMAVNDRFSNF 119
Query 128 LPAGFYYKTFMWPKSFWYKIYEPFIRKAAGLGTASIKHDKERYEHKYEYCDLLITGSRPS 187
L AGFYYKTFMWPK+FW KIYEP IRKAAGLG+ S + D + Y+ + +CDLLI GS PS
Sbjct 120 LTAGFYYKTFMWPKAFWEKIYEPIIRKAAGLGSISFEEDPDLYDKGFLHCDLLIIGSGPS 179
Query 188 GLASAYSAAKNGAKVILAEDKSRFGGTLLTSDVNIGNQS 226
GLA+A +A ++GA+VILA++ R GG L + + +G+QS
Sbjct 180 GLAAALTAGRSGARVILADEDFRMGGRLNSETLALGDQS 218
>gb|EDZ42222.1| sarcosine oxidase, alpha subunit family [Rhodobacterales bacterium
HTCC2083]
Length=979
Score = 265 bits (677), Expect = 2e-69, Method: Composition-based stats.
Identities = 123/219 (56%), Positives = 156/219 (71%), Gaps = 1/219 (0%)
Query 8 MTQSFRLETGGLINRDKKISFKFNGKNYFGYEGDTLASALIANGVHLIGRSFKYHRPRGF 67
MTQ R+E GG I+R+ + FKF+GK+Y G+ GDTLASAL+ANGV L+GRSFKYHRPRG
Sbjct 1 MTQVNRVE-GGQIDRNTPLKFKFDGKSYTGHAGDTLASALLANGVRLMGRSFKYHRPRGP 59
Query 68 FGAGVDEPYAIVQLYRNGETEPNIKATEQELFEGLEAKSVNCWPSVNFDVGAINNFLKIF 127
AG +EP AIV L EPN +AT ELF+GL A+S NCWPSV FD A+N+ F
Sbjct 60 LSAGSEEPNAIVTLRDGARAEPNTRATTAELFDGLSARSQNCWPSVKFDALAVNDAASDF 119
Query 128 LPAGFYYKTFMWPKSFWYKIYEPFIRKAAGLGTASIKHDKERYEHKYEYCDLLITGSRPS 187
L AGFYYKTFMWP FW K+YEP IRKAAGLG S++ D + Y+ + +CDLLI G+ PS
Sbjct 120 LAAGFYYKTFMWPAPFWEKVYEPIIRKAAGLGALSMQEDPDEYDKGFRHCDLLIVGAGPS 179
Query 188 GLASAYSAAKNGAKVILAEDKSRFGGTLLTSDVNIGNQS 226
GL +A +A + G +VILA++ GG LL+ + +G+ S
Sbjct 180 GLMAALTAGRAGKEVILADEDFAMGGRLLSEQIEVGSTS 218
>ref|YP_611611.1| Gene info sarcosine oxidase alpha subunit family protein [Silicibacter
sp. TM1040]
gb|ABF62349.1| Gene info sarcosine oxidase alpha subunit family [Silicibacter sp. TM1040]
Length=984
GENE ID: 4075276 TM1040_3377 | sarcosine oxidase alpha subunit family protein
[Silicibacter sp. TM1040]
Score = 263 bits (673), Expect = 7e-69, Method: Composition-based stats.
Identities = 123/219 (56%), Positives = 162/219 (73%), Gaps = 1/219 (0%)
Query 8 MTQSFRLETGGLINRDKKISFKFNGKNYFGYEGDTLASALIANGVHLIGRSFKYHRPRGF 67
MTQ R+ +GGLI+R +++F F+GKNY GY GDTLASAL+ANGV L+GRSFKYHRPRG
Sbjct 1 MTQVNRI-SGGLIDRSTELNFTFDGKNYQGYAGDTLASALLANGVRLMGRSFKYHRPRGV 59
Query 68 FGAGVDEPYAIVQLYRNGETEPNIKATEQELFEGLEAKSVNCWPSVNFDVGAINNFLKIF 127
AG +EP A+V+L G EPN +AT E++EGL A S N WPS+ DV AIN+ F
Sbjct 60 LAAGSEEPNALVELRSGGRQEPNTRATVAEIYEGLSANSQNRWPSLKHDVMAINDRFSAF 119
Query 128 LPAGFYYKTFMWPKSFWYKIYEPFIRKAAGLGTASIKHDKERYEHKYEYCDLLITGSRPS 187
L AGFYYKTFMWP++FW K+YEP IRKAAGLG+ S + D + Y+ Y +CDLL+ G+ P+
Sbjct 120 LSAGFYYKTFMWPRAFWEKLYEPVIRKAAGLGSLSGEGDPDAYDKGYLHCDLLVIGAGPA 179
Query 188 GLASAYSAAKNGAKVILAEDKSRFGGTLLTSDVNIGNQS 226
GL++A +A + GA+VILA++ + GG LL+ ++ NQS
Sbjct 180 GLSAALTAGRGGAQVILADEDFQLGGRLLSDAQSLCNQS 218
>ref|YP_166984.1| Gene info sarcosine oxidase alpha subunit family protein [Silicibacter
pomeroyi DSS-3]
gb|AAV95026.1| Gene info sarcosine oxidase, alpha subunit family [Silicibacter pomeroyi
DSS-3]
Length=977
GENE ID: 3193191 SPO1746 | sarcosine oxidase alpha subunit family protein
[Silicibacter pomeroyi DSS-3] (10 or fewer PubMed links)
Score = 263 bits (673), Expect = 8e-69, Method: Composition-based stats.
Identities = 118/210 (56%), Positives = 156/210 (74%), Gaps = 0/210 (0%)
Query 13 RLETGGLINRDKKISFKFNGKNYFGYEGDTLASALIANGVHLIGRSFKYHRPRGFFGAGV 72
R++ GLI+RD+ +SF F+G Y GY+GDTLASAL+AN V L+GRSFKYHRPRG AG
Sbjct 2 RVQGKGLIDRDRPVSFTFDGVGYSGYQGDTLASALLANEVRLVGRSFKYHRPRGILTAGS 61
Query 73 DEPYAIVQLYRNGETEPNIKATEQELFEGLEAKSVNCWPSVNFDVGAINNFLKIFLPAGF 132
+EP A+V + R G +PN++AT QE++EG+EA+S N WPS++FD+ AIN+ FL AGF
Sbjct 62 EEPNALVTIGRGGRQDPNVRATVQEIYEGMEAQSQNRWPSLSFDLMAINDLAAPFLGAGF 121
Query 133 YYKTFMWPKSFWYKIYEPFIRKAAGLGTASIKHDKERYEHKYEYCDLLITGSRPSGLASA 192
YYKTFMWP+SFW K+YEP IR+AAGLG S + + +RYE + +CDLL+ G+ P+GL +A
Sbjct 122 YYKTFMWPRSFWEKLYEPVIRRAAGLGALSGQDNADRYERAFAFCDLLVIGAGPAGLMAA 181
Query 193 YSAAKNGAKVILAEDKSRFGGTLLTSDVNI 222
A + GA VILAE+ +R GG LL I
Sbjct 182 LVAGRAGADVILAEEDARMGGRLLAETYEI 211
------------------------------------------------------------------------------------------------
b)
Score E
Sequences producing significant alignments: (Bits) Value
sp|O87386.2|SOXA_RHIME RecName: Full=Sarcosine oxidase subuni... 255 1e-67
sp|Q46337.1|SOXA_CORS1 RecName: Full=Sarcosine oxidase subuni... 67.4 6e-11
sp|Q04616.3|3O1D_RHOOP RecName: Full=3-oxosteroid 1-dehydroge... 39.7 0.013
sp|P35903.1|ACHC_ACHFU RecName: Full=Achacin; Flags: Precursor 38.9 0.025
sp|Q556K4.1|AOFC_DICDI RecName: Full=Probable flavin-containi... 37.0 0.094
sp|A6US00.1|GGR_METVS RecName: Full=Digeranylgeranylglyceroph... 37.0 0.094
sp|A1U1Y5.1|STHA_MARAV RecName: Full=Soluble pyridine nucleot... 36.6 0.11
sp|Q556K3.1|AOFB_DICDI RecName: Full=Probable flavin-containi... 36.2 0.14
sp|B0KH90.1|STHA_PSEPG RecName: Full=Soluble pyridine nucleot... 36.2 0.15
sp|Q88KY8.3|STHA_PSEPK RecName: Full=Soluble pyridine nucleot... 36.2 0.15
sp|Q25861.1|TRXR_PLAF5 RecName: Full=Thioredoxin reductase; S... 36.2 0.17
sp|Q2NER9.1|GGR3_METST RecName: Full=Digeranylgeranylglycerop... 36.2 0.17
sp|P32382.1|NADO_THEBR RecName: Full=NADH oxidase 36.2 0.17
sp|A6VJ23.1|GGR_METM7 RecName: Full=Digeranylgeranylglyceroph... 35.4 0.23
sp|Q6M083.1|GGR_METMP RecName: Full=Digeranylgeranylglyceroph... 35.4 0.24
sp|A9A6R1.1|GGR_METM6 RecName: Full=Digeranylgeranylglyceroph... 35.4 0.25
sp|A4FZB4.1|GGR_METM5 RecName: Full=Digeranylgeranylglyceroph... 35.4 0.26
sp|Q6CZB1.1|STHA_ERWCT RecName: Full=Soluble pyridine nucleot... 35.0 0.30
sp|P32370.1|BAIH_EUBSP RecName: Full=NADH-dependent flavin ox... 35.0 0.31
sp|Q97ZY5.1|THI4_SULSO Putative thiazole biosynthetic enzyme 35.0 0.33
sp|O32434.1|PPOX_PROFF RecName: Full=Protoporphyrinogen oxida... 35.0 0.33
sp|Q9WZP4|THI4_THEMA Putative thiazole biosynthetic enzyme 35.0 0.34
sp|Q2NFF7.1|GGR2_METST RecName: Full=Digeranylgeranylglycerop... 34.7 0.41
sp|O18480.1|DLDH_MANSE RecName: Full=Dihydrolipoyl dehydrogen... 34.7 0.44
sp|P78965.2|GSHR_SCHPO RecName: Full=Glutathione reductase; S... 34.7 0.48
sp|Q8K9T7.1|DLDH_BUCAP RecName: Full=Dihydrolipoyl dehydrogen... 34.7 0.49
sp|P48639.1|GSHR_BURCE RecName: Full=Glutathione reductase; S... 34.3 0.60
sp|Q1QX78.1|STHA_CHRSD RecName: Full=Soluble pyridine nucleot... 34.3 0.64
sp|A6VW16.1|STHA_MARMS RecName: Full=Soluble pyridine nucleot... 33.9 0.74
sp|Q9V0J8|THI4_PYRAB Putative thiazole biosynthetic enzyme 33.9 0.78
sp|P54805.1|YNH2_METBA RecName: Full=Uncharacterized protein ... 33.9 0.79
sp|O59082.2|THI4_PYRHO RecName: Full=Putative thiazole biosyn... 33.9 0.82
sp|A5UNX8.1|GGR_METS3 RecName: Full=Digeranylgeranylglyceroph... 33.9 0.85
sp|Q15ZF7.1|PEPQ_PSEA6 RecName: Full=Xaa-Pro dipeptidase; Sho... 33.5 0.92
sp|Q3K9F5.1|STHA_PSEPF RecName: Full=Soluble pyridine nucleot... 33.5 0.93
sp|Q4KFA6.1|STHA_PSEF5 RecName: Full=Soluble pyridine nucleot... 33.5 0.94
sp|O05139.3|STHA_PSEFL RecName: Full=Soluble pyridine nucleot... 33.5 0.98
sp|Q4ZV77.2|STHA_PSEU2 RecName: Full=Soluble pyridine nucleot... 33.5 0.99
sp|Q48KI8.1|STHA_PSE14 RecName: Full=Soluble pyridine nucleot... 33.5 0.99
sp|Q1I7F0.1|STHA_PSEE4 RecName: Full=Soluble pyridine nucleot... 33.5 1.0
sp|Q9XBQ9.1|STHA_AZOVI RecName: Full=Soluble pyridine nucleot... 33.5 1.1
sp|A4YIV7.1|THI4_METS5 RecName: Full=Putative thiazole biosyn... 33.1 1.3
sp|O07668.1|MRAY_ENTHR RecName: Full=Phospho-N-acetylmuramoyl... 33.1 1.4
sp|Q884I6.3|STHA_PSESM RecName: Full=Soluble pyridine nucleot... 33.1 1.4
sp|Q04829.2|DLDH_HALVO RecName: Full=Dihydrolipoyl dehydrogen... 32.7 1.8
sp|O29786.2|GGR_ARCFU RecName: Full=Digeranylgeranylglyceroph... 32.3 2.1
sp|P80647.1|DLDH_HYMDI RecName: Full=Dihydrolipoyl dehydrogen... 32.3 2.1
sp|Q94IG7.1|PPOCM_SPIOL RecName: Full=Protoporphyrinogen oxid... 32.3 2.2
sp|P19643.3|AOFB_RAT RecName: Full=Amine oxidase [flavin-cont... 32.3 2.2
sp|A4XSQ1.1|STHA_PSEMY RecName: Full=Soluble pyridine nucleot... 32.3 2.2
sp|A1RW13.2|THI4_PYRIL RecName: Full=Putative thiazole biosyn... 32.3 2.3
sp|P0A0E4.1|MERA_STAES RecName: Full=Mercuric reductase; AltN... 32.3 2.4
sp|Q5JD25|THI4_PYRKO Putative thiazole biosynthetic enzyme 32.3 2.4
sp|Q8U0Q5|THI4_PYRFU Putative thiazole biosynthetic enzyme 32.0 2.5
sp|Q9Y9Z0.2|THI4_AERPE RecName: Full=Putative thiazole biosyn... 32.0 2.8
sp|A7ZEX8.1|MNMC_CAMC1 RecName: Full=tRNA 5-methylaminomethyl... 32.0 3.0
sp|P40974.1|PUO_MICRU RecName: Full=Putrescine oxidase 32.0 3.0
sp|Q55629.1|Y782_SYNY3 RecName: Full=Uncharacterized protein ... 32.0 3.0
sp|O26377.1|GGR1_METTH RecName: Full=Digeranylgeranylglycerop... 32.0 3.2
sp|Q12YW2.1|GGR1_METBU RecName: Full=Digeranylgeranylglycerop... 31.6 3.5
sp|Q17043.1|APLY_APLKU RecName: Full=Aplysianin-A; Flags: Pre... 31.6 3.8
sp|Q975R0.1|THI4_SULTO Putative thiazole biosynthetic enzyme 31.6 3.8
sp|Q0TA96.1|STHA_ECOL5 RecName: Full=Soluble pyridine nucleot... 31.6 3.9
sp|Q8R2T8.2|TF3C5_MOUSE RecName: Full=General transcription f... 31.6 4.0
sp|Q8FB93.3|STHA_ECOL6 RecName: Full=Soluble pyridine nucleot... 31.6 4.0
sp|P27306.5|STHA_ECOLI RecName: Full=Soluble pyridine nucleot... 31.6 4.0
sp|O00087.2|DLDH_SCHPO RecName: Full=Dihydrolipoyl dehydrogen... 31.6 4.0
sp|Q83MI1.1|STHA_SHIFL RecName: Full=Soluble pyridine nucleot... 31.6 4.0
sp|A8A770.1|STHA_ECOHS RecName: Full=Soluble pyridine nucleot... 31.6 4.0
sp|Q8X727.3|STHA_ECO57 RecName: Full=Soluble pyridine nucleot... 31.6 4.0
sp|P0AB60.1|YCIM_ECO57 RecName: Full=Uncharacterized protein ... 31.6 4.0
sp|P26829.1|DHNA_BACYN RecName: Full=NADH dehydrogenase; AltN... 31.6 4.1
sp|Q2NFZ1.1|GGR1_METST RecName: Full=Digeranylgeranylglycerop... 31.2 4.2
sp|Q5R4B1.1|DLDH_PONAB RecName: Full=Dihydrolipoyl dehydrogen... 31.2 4.3
sp|Q60HG3.1|DLDH_MACFA RecName: Full=Dihydrolipoyl dehydrogen... 31.2 4.3
sp|P49819.1|DLDH_CANFA RecName: Full=Dihydrolipoyl dehydrogen... 31.2 4.3
sp|Q21988.3|AMX1_CAEEL RecName: Full=Amine oxidase family mem... 31.2 4.4
sp|Q2NQZ3.1|STHA_SODGM RecName: Full=Soluble pyridine nucleot... 31.2 4.9
sp|A4WG49.1|STHA_ENT38 RecName: Full=Soluble pyridine nucleot... 31.2 5.1
sp|Q8VHE9.1|RETST_RAT RecName: Full=All-trans-retinol 13,14-r... 30.8 5.5
sp|Q64FW2.2|RETST_MOUSE RecName: Full=All-trans-retinol 13,14... 30.8 5.6
sp|Q8CIZ7.1|DLDH_CRIGR RecName: Full=Dihydrolipoyl dehydrogen... 30.8 6.1
sp|Q6P6R2.1|DLDH_RAT RecName: Full=Dihydrolipoyl dehydrogenas... 30.8 6.1
sp|P57303.1|DLDH_BUCAI RecName: Full=Dihydrolipoyl dehydrogen... 30.8 6.1
sp|P09623.1|DLDH_PIG RecName: Full=Dihydrolipoyl dehydrogenas... 30.8 6.2
sp|Q4JAF8.1|THI4_SULAC Putative thiazole biosynthetic enzyme 30.8 6.3
sp|P09622.1|DLDH_HUMAN RecName: Full=Dihydrolipoyl dehydrogen... 30.8 6.3
sp|Q54IT3.1|AOFA_DICDI RecName: Full=Probable flavin-containi... 30.8 6.3
sp|A4WKY7.2|THI4_PYRAR Putative thiazole biosynthetic enzyme 30.8 6.4
sp|O08749.2|DLDH_MOUSE RecName: Full=Dihydrolipoyl dehydrogen... 30.8 6.5
sp|A4VMU6.1|STHA_PSEU5 RecName: Full=Soluble pyridine nucleot... 30.8 6.7
sp|Q465Z7.1|GGR_METBF RecName: Full=Digeranylgeranylglyceroph... 30.8 6.9
sp|Q6LXJ8|THI4_METMP Putative thiazole biosynthetic enzyme 30.8 6.9
sp|Q5BLE8.1|RETST_DANRE RecName: Full=Putative all-trans-reti... 30.4 7.3
sp|Q58053.1|Y636_METJA RecName: Full=Uncharacterized protein ... 30.4 7.4
sp|Q0KF58.1|METX_RALEH Homoserine O-acetyltransferase (Homose... 30.4 7.7
sp|Q12WF0.1|GGR2_METBU RecName: Full=Digeranylgeranylglycerop... 30.4 7.7
sp|Q8PU50.2|GGR_METMA RecName: Full=Digeranylgeranylglyceroph... 30.4 7.8
sp|Q8TQQ6.1|GGR_METAC RecName: Full=Digeranylgeranylglyceroph... 30.4 7.9
sp|A7MID0.1|TDH_ENTS8 RecName: Full=L-threonine 3-dehydrogenase 30.4 8.5
sp|Q8BUY8.2|GASP2_MOUSE RecName: Full=G-protein coupled recep... 30.4 8.7
sp|Q9HUY1.1|DLDH3_PSEAE RecName: Full=Dihydrolipoyl dehydroge... 30.0 9.6
>sp|O87386.2|SOXA_RHIME RecName: Full=Sarcosine oxidase subunit alpha; Short=Sarcosine
oxidase subunit
Length=987
Score = 255 bits (652), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 119/217 (54%), Positives = 155/217 (71%), Gaps = 0/217 (0%)
Query 3 QSFRLETGGLINRDKKISFKFNGKNYFGYEGDTLASALIANGVHLIGRSFKYHRPRGFFG 62
S+RL GL++R+ +SF F+G+ G EGDTLASAL+ANG L+GRSFKYHRPRG
Sbjct 2 SSYRLPKRGLVDRNVPLSFTFDGRPMQGLEGDTLASALLANGRMLVGRSFKYHRPRGILT 61
Query 63 AGVDEPYAIVQLYRNGETEPNIKATEQELFEGLEAKSVNCWPSVNFDVGAINNFLKIFLP 122
AG EP A+V + R G EPN +AT QEL+EGLEA+S N WPS+ FD+GA+N L FL
Sbjct 62 AGAAEPNALVTVGRGGRAEPNTRATMQELYEGLEARSQNRWPSLAFDIGALNGLLSPFLG 121
Query 123 AGFYYKTFMWPKSFWYKIYEPFIRKAAGLGTASIKHDKERYEHKYEYCDLLITGSRPSGL 182
AGFYYKTFMWP W K+YEP IR+AAGLG AS + D + YE + +CDLL+ G+ P+GL
Sbjct 122 AGFYYKTFMWPAPLWEKLYEPVIRRAAGLGKASYEADPDAYEKSWAHCDLLVIGAGPTGL 181
Query 183 ASAYSAAKNGAKVILAEDKSRFGGTLLTSDVNIGNQS 219
A+A +A + GA+VIL ++ S GG+LL+ I ++
Sbjct 182 AAALTAGRAGARVILVDEGSLPGGSLLSDTATIDGKA 218
>sp|Q46337.1|SOXA_CORS1 RecName: Full=Sarcosine oxidase subunit alpha; Short=Sarcosine
oxidase subunit
Length=967
Score = 67.4 bits (163), Expect = 6e-11, Method: Composition-based stats.
Identities = 56/196 (28%), Positives = 89/196 (45%), Gaps = 45/196 (22%)
Query 13 INRDKKISFKFNGKNYFGYEGDTLASALIANGVHLIGRSFKYHRPRGFFGAGVDEPYAIV 72
I+R + + +GK + GDT+ASA++ANG G S RPRG F AGV+EP A+V
Sbjct 19 IDRGEALVLTVDGKQLEAFRGDTVASAMLANGQRACGNSMYLDRPRGIFSAGVEEPNALV 78
Query 73 QLYRNGETEPNIKATEQELFEGLEAKSVNCWPSVNFDVGAINNFLKIFLPAGFYYKTFMW 132
+ EQ++ E + A + V N L
Sbjct 79 TVEAR---------HEQDINESMLAATT---------VPVTANLSATLL----------- 109
Query 133 PKSFWYKIYEPFIRKAAGLGTASIKHDKERYEHKYEYCDLLITGSRPSGLASAYSAAKNG 192
GLG D Y+H + + D+L+ G+ P+GLA+A A+++G
Sbjct 110 ----------------RGLGVLDPSTDPAYYDHVHVHTDVLVVGAGPAGLAAAREASRSG 153
Query 193 AKVILAEDKSRFGGTL 208
A+V+L ++++ GG+L
Sbjct 154 ARVLLLDERAEAGGSL 169
>sp|Q04616.3|3O1D_RHOOP RecName: Full=3-oxosteroid 1-dehydrogenase
Length=507
Score = 39.7 bits (91), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 21/46 (45%), Positives = 25/46 (54%), Gaps = 0/46 (0%)
Query 170 CDLLITGSRPSGLASAYSAAKNGAKVILAEDKSRFGGTLLTSDVNI 215
CDLL+ GS L AY+AA G I+ E RFGGT S +I
Sbjct 8 CDLLVVGSGGGALTGAYTAAAQGLTTIVLEKTDRFGGTSAYSGASI 53
>sp|P35903.1|ACHC_ACHFU RecName: Full=Achacin; Flags: Precursor
Length=531
Score = 38.9 bits (89), Expect = 0.025, Method: Compositional matrix adjust.
Identities = 24/70 (34%), Positives = 38/70 (54%), Gaps = 1/70 (1%)
Query 170 CDLLITGSRPSGLASAYSAAKNGAKVILAEDKSRFGGTLLTSDV-NIGNQSVKSGQIVLF 228
D+ + G+ PSG SAY G V L E +R GG L T+ + N+ + +++SG + F
Sbjct 38 VDVAVVGAGPSGTYSAYKLRNKGQTVELFEYSNRIGGRLFTTHLPNVPDLNLESGGMRYF 97
Query 229 QNLKKCLMLL 238
+N K +L
Sbjct 98 KNHHKIFGVL 107
>sp|Q556K4.1|AOFC_DICDI RecName: Full=Probable flavin-containing monoamine oxidase C
Length=467
Score = 37.0 bits (84), Expect = 0.094, Method: Compositional matrix adjust.
Identities = 20/53 (37%), Positives = 31/53 (58%), Gaps = 2/53 (3%)
Query 171 DLLITGSRPSGLASAYSAAKNGAKVILAEDKSRFGGTLLTSDVNIGNQSVKSG 223
D +I G SGL +AY K+ K+++ E ++RFGG T + IG+ V +G
Sbjct 6 DTIIIGGGMSGLKTAYDLKKSNFKILVLEARNRFGGR--TDSIKIGDGWVDAG 56
>sp|A6US00.1|GGR_METVS RecName: Full=Digeranylgeranylglycerophospholipid reductase;
Short=DGGGPL reductase; AltName: Full=2,3-di-O-geranylgeranylglyceryl
phosphate reductase; AltName: Full=Geranylgeranyl
reductase; Short=GGR
Length=390
Score = 37.0 bits (84), Expect = 0.094, Method: Compositional matrix adjust.
Identities = 16/38 (42%), Positives = 25/38 (65%), Gaps = 0/38 (0%)
Query 168 EYCDLLITGSRPSGLASAYSAAKNGAKVILAEDKSRFG 205
E D+++ G+ P+G S+Y+A+KNGAK +L E G
Sbjct 6 ESYDVVVVGAGPAGSMSSYNASKNGAKTLLIEKAQEIG 43
>sp|A1U1Y5.1|STHA_MARAV RecName: Full=Soluble pyridine nucleotide transhydrogenase; Short=STH;
AltName: Full=NAD(P)(+) transhydrogenase [B-specific]
Length=463
GENE ID: 4654234 Maqu_1923 | soluble pyridine nucleotide transhydrogenase
[Marinobacter aquaeolei VT8]
Score = 36.6 bits (83), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 18/43 (41%), Positives = 27/43 (62%), Gaps = 3/43 (6%)
Query 164 EHKYEYCDLLITGSRPSGLASAYSAAKNGAKVILAEDKSRFGG 206
EH Y D+++ G+ PSG +A +AAK+ +V + EDK GG
Sbjct 3 EHHY---DVVVIGAGPSGEGAAMNAAKHNRRVAIIEDKPTVGG 42
>sp|Q556K3.1|AOFB_DICDI RecName: Full=Probable flavin-containing monoamine oxidase B
Length=471
Score = 36.2 bits (82), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 22/57 (38%), Positives = 33/57 (57%), Gaps = 3/57 (5%)
Query 167 YEYCDLLITGSRPSGLASAYSAAKNGAKVILAEDKSRFGGTLLTSDVNIGNQSVKSG 223
Y Y D +I G SGL +AY K+ K+++ E ++RFGG T V +G+ V +G
Sbjct 7 YNY-DTIIIGGGLSGLNTAYDLKKSNFKILVLEARNRFGGR--TDSVKVGDGWVDAG 60
>sp|B0KH90.1|STHA_PSEPG RecName: Full=Soluble pyridine nucleotide transhydrogenase; Short=STH;
AltName: Full=NAD(P)(+) transhydrogenase [B-specific]
Length=464
GENE ID: 5869472 PputGB1_1692 | soluble pyridine nucleotide transhydrogenase
[Pseudomonas putida GB-1]
Score = 36.2 bits (82), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 17/40 (42%), Positives = 27/40 (67%), Gaps = 1/40 (2%)
Query 167 YEYCDLLITGSRPSGLASAYSAAKNGAKVILAEDKSRFGG 206
Y Y D+++ GS P+G +A +AAK G KV + +D+ + GG
Sbjct 4 YNY-DVVVLGSGPAGEGAAMNAAKAGRKVAMVDDRRQVGG 42
>sp|Q88KY8.3|STHA_PSEPK RecName: Full=Soluble pyridine nucleotide transhydrogenase; Short=STH;
AltName: Full=NAD(P)(+) transhydrogenase [B-specific]
sp|A5W6F5.1|STHA_PSEP1 RecName: Full=Soluble pyridine nucleotide transhydrogenase; Short=STH;
AltName: Full=NAD(P)(+) transhydrogenase [B-specific]
Length=464
GENE ID: 1045007 sthA | soluble pyridine nucleotide transhydrogenase
[Pseudomonas putida KT2440] (10 or fewer PubMed links)
Score = 36.2 bits (82), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 17/40 (42%), Positives = 27/40 (67%), Gaps = 1/40 (2%)
Query 167 YEYCDLLITGSRPSGLASAYSAAKNGAKVILAEDKSRFGG 206
Y Y D+++ GS P+G +A +AAK G KV + +D+ + GG
Sbjct 4 YNY-DVVVLGSGPAGEGAAMNAAKAGRKVAMVDDRRQVGG 42
---------------------------------------------------------------------------------------------------
c)
Score E
Sequences producing significant alignments: (Bits) Value
gb|EDZ60822.1| sarcosine oxidase alpha subunit [Candidatus Pe... 422 5e-121
ref|ZP_01264926.1| sarcosine oxidase alpha chain [Candidatus ... 415 6e-118
ref|YP_266690.1| sarcosine oxidase alpha chain [Candidatus Pe... 415 6e-118 Gene info
gb|ABZ06303.1| putative glycine cleavage T-protein (aminometh... 363 2e-101
gb|ABZ05929.1| putative glycine cleavage T-protein (aminometh... 352 2e-95
gb|ABZ06659.1| putative glycine cleavage T-protein (aminometh... 349 2e-94
ref|ZP_01754673.1| sarcosine oxidase, alpha subunit family pr... 258 5e-67
ref|YP_266475.1| sarcosine oxidase alpha chain [Candidatus Pe... 255 3e-66 Gene info
ref|ZP_01546296.1| sarcosine oxidase, alpha subunit [Stappia ... 254 9e-66
ref|NP_106776.1| sarcosine oxidase alpha subunit [Mesorhizobi... 253 2e-65 Gene info
ref|NP_356432.1| sarcosine oxidase alpha subunit [Agrobacteri... 253 2e-65 Gene info
gb|EDZ42222.1| sarcosine oxidase, alpha subunit family [Rhodo... 252 3e-65
gb|EDZ61064.1| sarcosine oxidase, alpha subunit [Candidatus P... 251 4e-65
ref|YP_001261448.1| glycine cleavage T protein (aminomethyl t... 251 4e-65 Gene info
emb|CAD31286.1| PUTATIVE SARCOSINE OXIDASE ALPHA SUBUNIT PROT... 251 4e-65
gb|EDZ45195.1| sarcosine oxidase, alpha subunit family [Rhodo... 251 6e-65
ref|YP_611611.1| sarcosine oxidase alpha subunit family prote... 251 8e-65 Gene info
ref|YP_001592099.1| sarcosine oxidase alpha subunit family pr... 250 1e-64 Gene info
ref|NP_697265.1| sarcosine oxidase, alpha subunit [Brucella s... 250 1e-64 Gene info
emb|CAD31640.1| PROBABLE SARCOSINE OXIDASE ALPHA SUBUNIT TRAN... 250 1e-64
ref|YP_166984.1| sarcosine oxidase alpha subunit family prote... 249 2e-64 Gene info
ref|YP_002362923.1| sarcosine oxidase, alpha subunit family [... 249 3e-64
ref|YP_001258259.1| sarcosine oxidase alpha subunit [Brucella... 249 3e-64 Gene info
ref|ZP_01754466.1| sarcosine oxidase, alpha subunit family pr... 249 3e-64
ref|YP_002277908.1| sarcosine oxidase, alpha subunit family [... 248 6e-64 Gene info
ref|NP_881143.1| sarcosine oxidase alpha subunit [Bordetella ... 248 6e-64 Gene info
ref|YP_771596.1| putative sarcosine oxidase alpha subunit [Rh... 247 8e-64 Gene info
ref|ZP_02147355.1| sarcosine oxidase, alpha subunit family pr... 247 1e-63
ref|NP_885663.1| sarcosine oxidase alpha subunit [Bordetella ... 247 1e-63 Gene info
ref|YP_614139.1| sarcosine oxidase alpha subunit family prote... 247 1e-63 Gene info
gb|ABZ05963.1| hypothetical protein ALOHA_HF4000001L24ctg1g32... 246 1e-63
ref|ZP_02297488.1| Uncharacterized NAD(FAD)-dependent dehydro... 246 1e-63
ref|NP_540637.1| sarcosine oxidase alpha subunit [Brucella me... 246 1e-63 Gene info
ref|ZP_02168605.1| sarcosine oxidase alpha subunit [Hoeflea p... 246 2e-63
ref|YP_001328944.1| sarcosine oxidase alpha subunit family pr... 246 2e-63 Gene info
ref|YP_472588.1| sarcosine oxidase alpha subunit protein [Rhi... 246 2e-63 Gene info
ref|ZP_02149843.1| sarcosine oxidase, alpha subunit family pr... 246 2e-63
ref|YP_001368863.1| sarcosine oxidase alpha subunit family pr... 246 2e-63 Gene info
ref|YP_743995.1| sarcosine oxidase alpha subunit [Granulibact... 246 2e-63 Gene info
ref|ZP_01056477.1| sarcosine oxidase, alpha subunit family pr... 245 3e-63
ref|YP_001985949.1| sarcosine oxidase protein, alpha subunit ... 245 4e-63 Gene info
ref|ZP_02188467.1| sarcosine oxidase, alpha subunit family pr... 245 4e-63
ref|ZP_02141516.1| sarcosine oxidase, alpha subunit [Roseobac... 244 5e-63
ref|NP_384189.1| putative sarcosine oxidase alpha subunit tra... 244 7e-63 Gene info
ref|NP_107653.1| sarcosine oxidase alpha subunit [Mesorhizobi... 242 3e-62 Gene info
ref|ZP_01054876.1| sarcosine oxidase, alpha subunit family pr... 242 3e-62
ref|YP_682013.1| sarcosine oxidase, alpha subunit [Roseobacte... 241 5e-62 Gene info
ref|ZP_02118479.1| sarcosine oxidase alpha subunit [Methyloba... 241 8e-62
ref|NP_104289.1| sarcosine oxidase alpha subunit [Mesorhizobi... 241 8e-62 Gene info
ref|ZP_01033968.1| sarcosine oxidase, alpha subunit family pr... 241 8e-62
ref|YP_001524410.1| sarcosine oxidase alpha subunit [Azorhizo... 240 1e-61 Gene info
ref|YP_743901.1| sarcosine oxidase alpha subunit [Granulibact... 240 1e-61 Gene info
gb|EDY87835.1| sarcosine oxidase, alpha subunit [Octadecabact... 239 2e-61
gb|EEA96709.1| sarcosine oxidase, alpha subunit family [Pseud... 239 2e-61
ref|YP_001533452.1| sarcosine oxidase alpha subunit family pr... 239 3e-61 Gene info
ref|ZP_02059400.1| sarcosine oxidase, alpha subunit family [M... 238 4e-61
ref|ZP_01439029.1| sarcosine oxidase, alpha subunit family pr... 238 4e-61
ref|YP_166827.1| sarcosine oxidase alpha subunit family prote... 238 4e-61 Gene info
ref|ZP_01002095.1| sarcosine oxidase, alpha subunit [Loktanel... 238 7e-61
gb|EEB71634.1| sarcosine oxidase, alpha subunit family [Ruege... 237 9e-61
ref|YP_002100542.1| hypothetical protein BDAG_03838 [Burkhold... 237 1e-60 Gene info
ref|ZP_02485951.1| sarcosine oxidase, alpha subunit [Burkhold... 237 1e-60
ref|ZP_02466697.1| sarcosine oxidase, alpha subunit [Burkhold... 237 1e-60
ref|ZP_02459962.1| putative sarcosine oxidase alpha subunit [... 237 1e-60
ref|ZP_02407205.1| sarcosine oxidase, alpha subunit [Burkhold... 237 1e-60
ref|YP_439195.1| sarcosine oxidase, alpha subunit [Burkholder... 237 1e-60 Gene info
ref|YP_001062947.1| sarcosine oxidase, alpha subunit, heterot... 237 1e-60 Gene info
ref|YP_001075894.1| sarcosine oxidase, alpha subunit [Burkhol... 237 1e-60 Gene info
ref|YP_111378.1| sarcosine oxidase alpha subunit [Burkholderi... 237 1e-60 Gene info
ref|YP_001583482.1| sarcosine oxidase alpha subunit family pr... 236 1e-60 Gene info
gb|EEB80013.1| sarcosine oxidase, alpha subunit family [marin... 236 1e-60
ref|ZP_02154807.1| sarcosine oxidase, alpha subunit family pr... 236 1e-60
ref|YP_001419578.1| sarcosine oxidase alpha subunit family pr... 236 1e-60 Gene info
ref|ZP_03456801.1| sarcosine oxidase, alpha subunit [Burkhold... 236 2e-60
ref|YP_001641183.1| sarcosine oxidase alpha subunit family pr... 236 2e-60 Gene info
ref|NP_519224.1| sarcosine oxidase subunit alpha [Ralstonia s... 236 2e-60 Gene info
ref|YP_001755163.1| sarcosine oxidase alpha subunit family pr... 236 3e-60 Gene info
ref|ZP_02370511.1| sarcosine oxidase, alpha subunit [Burkhold... 236 3e-60
gb|EEB84114.1| sarcosine oxidase, alpha subunit family [Roseo... 235 3e-60
ref|YP_001207761.1| sarcosine oxidase, alpha subunit [Bradyrh... 235 3e-60 Gene info
ref|YP_610570.1| sarcosine oxidase (alpha subunit) oxidoreduc... 235 3e-60 Gene info
ref|ZP_01449178.1| sarcosine oxidase, alpha subunit family pr... 226 4e-60
ref|YP_001109958.1| sarcosine oxidase alpha subunit family pr... 235 4e-60 Gene info
gb|EDZ45988.1| sarcosine oxidase, alpha subunit family [Rhodo... 235 4e-60
ref|ZP_01441755.1| sarcosine oxidase, alpha subunit family pr... 235 4e-60
ref|YP_001926650.1| sarcosine oxidase, alpha subunit family [... 234 1e-59 Gene info
ref|NP_356342.1| sarcosine oxidase alpha subunit [Agrobacteri... 234 1e-59 Gene info
ref|NP_521609.1| sarcosine oxidase subunit alpha [Ralstonia s... 234 1e-59 Gene info
ref|YP_001238046.1| sarcosine oxidase, alpha subunit [Bradyrh... 234 1e-59 Gene info
ref|ZP_03395917.1| sarcosine oxidase, alpha subunit [Pseudomo... 229 1e-59
ref|NP_790307.1| sarcosine oxidase, alpha subunit [Pseudomona... 229 1e-59 Gene info
ref|ZP_02366005.1| sarcosine oxidase, alpha subunit [Burkhold... 233 1e-59
ref|ZP_02358969.1| sarcosine oxidase, alpha subunit [Burkhold... 233 1e-59
ref|ZP_01879735.1| sarcosine oxidase, alpha subunit family pr... 233 1e-59
ref|YP_237780.1| sarcosine oxidase, alpha subunit, heterotetr... 228 2e-59 Gene info
ref|ZP_02054752.1| sarcosine oxidase, alpha subunit family [M... 233 2e-59
ref|ZP_01447755.1| sarcosine oxidase, alpha subunit family pr... 233 2e-59
ref|ZP_02886813.1| sarcosine oxidase, alpha subunit family [B... 233 2e-59
ref|YP_001751727.1| sarcosine oxidase alpha subunit family pr... 233 2e-59 Gene info
ref|YP_262784.1| sarcosine oxidase, alpha subunit [Pseudomona... 233 2e-59 Gene info
>gb|EDZ60822.1| sarcosine oxidase alpha subunit [Candidatus Pelagibacter sp.
HTCC7211]
Length=998
Score = 422 bits (1084), Expect(2) = 5e-121
Identities = 202/221 (91%), Positives = 210/221 (95%), Gaps = 0/221 (0%)
Frame = +3
Query 198 MTQSFRLETGGLINRDKKISFKFNGKNYFGYEGDTLASALIANGVHLIGRSFKYHRPRGF 377
MTQSFRL GLINRD+K+SFKFN Y+GYEGDTLASALIANGVHL+GRSFKYHRPRGF
Sbjct 1 MTQSFRLNDVGLINRDRKLSFKFNSVTYYGYEGDTLASALIANGVHLVGRSFKYHRPRGF 60
Query 378 FGAGVDEPYAIVQLYRNGETEPNIKATEQELFEGLEAKSVNCWPSVNFDVGAINNFLKIF 557
FGAGVDEPYAIVQLYRNGETEPNIKATEQELFEGLEAKSVNCWPSVNFDVGAINNFLKIF
Sbjct 61 FGAGVDEPYAIVQLYRNGETEPNIKATEQELFEGLEAKSVNCWPSVNFDVGAINNFLKIF 120
Query 558 LPAGFYYKTFMWPKSFWYKIYEPFIRKAAGLGTASIKHDKERYEHKYEYCDLLITGSRPS 737
LPAGFYYKTFMWPKSFWYK+YEPFIRKAAGLG AS KHDKERYEHKYEYCDLLI GS PS
Sbjct 121 LPAGFYYKTFMWPKSFWYKVYEPFIRKAAGLGVASTKHDKERYEHKYEYCDLLIAGSGPS 180
Query 738 GLASAYSAAKNGAKVILAEDKSRFGGTLLTSDVNIGNQSVK 860
GLASAY+AAKNGA+VILAEDKSRFGGTLLTSDVNIGNQ+ K
Sbjct 181 GLASAYAAAKNGARVILAEDKSRFGGTLLTSDVNIGNQTGK 221
Score = 38.5 bits (88), Expect(2) = 5e-121
Identities = 16/20 (80%), Positives = 18/20 (90%), Gaps = 0/20 (0%)
Frame = +2
Query 857 KEWADSIVSELKEMSNVTIK 916
KEWAD I+SELKEM NVT+K
Sbjct 221 KEWADGIISELKEMPNVTVK 240
>ref|ZP_01264926.1| sarcosine oxidase alpha chain [Candidatus Pelagibacter ubique
HTCC1002]
gb|EAS85413.1| sarcosine oxidase alpha chain [Candidatus Pelagibacter ubique
HTCC1002]
Length=998
Score = 415 bits (1066), Expect(2) = 6e-118
Identities = 197/221 (89%), Positives = 209/221 (94%), Gaps = 0/221 (0%)
Frame = +3
Query 198 MTQSFRLETGGLINRDKKISFKFNGKNYFGYEGDTLASALIANGVHLIGRSFKYHRPRGF 377
MTQ++RL+ GLINRDKKISFKFNG YFGYEGDTLASAL+ANGVHLIGRSFKYHRPRGF
Sbjct 1 MTQNYRLDNVGLINRDKKISFKFNGVTYFGYEGDTLASALLANGVHLIGRSFKYHRPRGF 60
Query 378 FGAGVDEPYAIVQLYRNGETEPNIKATEQELFEGLEAKSVNCWPSVNFDVGAINNFLKIF 557
FGAGVDEPYAIVQLYRN ETEPN+KATEQELFEGLEA SVNCWPSVNFD+GAINN LKIF
Sbjct 61 FGAGVDEPYAIVQLYRNNETEPNVKATEQELFEGLEATSVNCWPSVNFDIGAINNLLKIF 120
Query 558 LPAGFYYKTFMWPKSFWYKIYEPFIRKAAGLGTASIKHDKERYEHKYEYCDLLITGSRPS 737
LPAGFYYKTFMWPKSFWYK+YEPFIRKAAGLG ASI+HDKERYEHKYEYCDLLI GS PS
Sbjct 121 LPAGFYYKTFMWPKSFWYKVYEPFIRKAAGLGVASIEHDKERYEHKYEYCDLLIAGSGPS 180
Query 738 GLASAYSAAKNGAKVILAEDKSRFGGTLLTSDVNIGNQSVK 860
GLASAY+AAKNGA+VILAEDK RFGGTLLTS+VNIGNQ+ K
Sbjct 181 GLASAYAAAKNGARVILAEDKPRFGGTLLTSEVNIGNQTGK 221
Score = 35.0 bits (79), Expect(2) = 6e-118
Identities = 14/20 (70%), Positives = 18/20 (90%), Gaps = 0/20 (0%)
Frame = +2
Query 857 KEWADSIVSELKEMSNVTIK 916
KEWA++I+SELKEM NV +K
Sbjct 221 KEWAENIISELKEMPNVIVK 240
>ref|YP_266690.1| Gene info sarcosine oxidase alpha chain [Candidatus Pelagibacter ubique
HTCC1062]
gb|AAZ22086.1| Gene info sarcosine oxidase alpha chain [Candidatus Pelagibacter ubique
HTCC1062]
Length=998
GENE ID: 3517319 soxA2 | sarcosine oxidase alpha chain
[Candidatus Pelagibacter ubique HTCC1062] (10 or fewer PubMed links)
Score = 415 bits (1066), Expect(2) = 6e-118
Identities = 197/221 (89%), Positives = 209/221 (94%), Gaps = 0/221 (0%)
Frame = +3
Query 198 MTQSFRLETGGLINRDKKISFKFNGKNYFGYEGDTLASALIANGVHLIGRSFKYHRPRGF 377
MTQ++RL+ GLINRDKKISFKFNG YFGYEGDTLASAL+ANGVHLIGRSFKYHRPRGF
Sbjct 1 MTQNYRLDNVGLINRDKKISFKFNGVTYFGYEGDTLASALLANGVHLIGRSFKYHRPRGF 60
Query 378 FGAGVDEPYAIVQLYRNGETEPNIKATEQELFEGLEAKSVNCWPSVNFDVGAINNFLKIF 557
FGAGVDEPYAIVQLYRN ETEPN+KATEQELFEGLEA SVNCWPSVNFD+GAINN LKIF
Sbjct 61 FGAGVDEPYAIVQLYRNNETEPNVKATEQELFEGLEATSVNCWPSVNFDIGAINNLLKIF 120
Query 558 LPAGFYYKTFMWPKSFWYKIYEPFIRKAAGLGTASIKHDKERYEHKYEYCDLLITGSRPS 737
LPAGFYYKTFMWPKSFWYK+YEPFIRKAAGLG ASI+HDKERYEHKYEYCDLLI GS PS
Sbjct 121 LPAGFYYKTFMWPKSFWYKVYEPFIRKAAGLGVASIEHDKERYEHKYEYCDLLIAGSGPS 180
Query 738 GLASAYSAAKNGAKVILAEDKSRFGGTLLTSDVNIGNQSVK 860
GLASAY+AAKNGA+VILAEDK RFGGTLLTS+VNIGNQ+ K
Sbjct 181 GLASAYAAAKNGARVILAEDKPRFGGTLLTSEVNIGNQTGK 221
Score = 35.0 bits (79), Expect(2) = 6e-118
Identities = 14/20 (70%), Positives = 18/20 (90%), Gaps = 0/20 (0%)
Frame = +2
Query 857 KEWADSIVSELKEMSNVTIK 916
KEWA++I+SELKEM NV +K
Sbjct 221 KEWAENIISELKEMPNVIVK 240
>gb|ABZ06303.1| putative glycine cleavage T-protein (aminomethyl transferase)
[uncultured marine microorganism HF4000_008G09]
Length=998
Score = 363 bits (933), Expect(2) = 2e-101
Identities = 173/221 (78%), Positives = 188/221 (85%), Gaps = 0/221 (0%)
Frame = +3
Query 198 MTQSFRLETGGLINRDKKISFKFNGKNYFGYEGDTLASALIANGVHLIGRSFKYHRPRGF 377
MTQ FRL GL+NR+K ISF FNGK YFGYEGDTLASAL+ANG+HL+GRSFKYHRPRGF
Sbjct 1 MTQKFRLPNLGLVNRNKTISFHFNGKKYFGYEGDTLASALLANGIHLVGRSFKYHRPRGF 60
Query 378 FGAGVDEPYAIVQLYRNGETEPNIKATEQELFEGLEAKSVNCWPSVNFDVGAINNFLKIF 557
FGAGVDEP A VQLY +TEPN+ ATE EL EGL AKS NCWPSV FDVGAINNF F
Sbjct 61 FGAGVDEPNAKVQLYEGDKTEPNVNATELELVEGLVAKSQNCWPSVEFDVGAINNFFSRF 120
Query 558 LPAGFYYKTFMWPKSFWYKIYEPFIRKAAGLGTASIKHDKERYEHKYEYCDLLITGSRPS 737
PAGFYYKTFMWPKSFWYK+YEP IRKAAGLG AS K D RYEHKYEYCD+L+ GS PS
Sbjct 121 FPAGFYYKTFMWPKSFWYKVYEPLIRKAAGLGVASPKPDTSRYEHKYEYCDVLVVGSGPS 180
Query 738 GLASAYSAAKNGAKVILAEDKSRFGGTLLTSDVNIGNQSVK 860
GL+SAY+AAKNGA+VILAEDK RFGG+LLT DVNIGNQ+ K
Sbjct 181 GLSSAYAAAKNGARVILAEDKPRFGGSLLTDDVNIGNQTGK 221
Score = 31.6 bits (70), Expect(2) = 2e-101
Identities = 11/20 (55%), Positives = 16/20 (80%), Gaps = 0/20 (0%)
Frame = +2
Query 857 KEWADSIVSELKEMSNVTIK 916
KEWA+ ++ ELK+M NV +K
Sbjct 221 KEWAEDVIKELKQMPNVIVK 240
>gb|ABZ05929.1| putative glycine cleavage T-protein (aminomethyl transferase)
[uncultured marine microorganism HF4000_001B09]
Length=998
Score = 352 bits (904), Expect = 2e-95
Identities = 167/221 (75%), Positives = 186/221 (84%), Gaps = 0/221 (0%)
Frame = +3
Query 198 MTQSFRLETGGLINRDKKISFKFNGKNYFGYEGDTLASALIANGVHLIGRSFKYHRPRGF 377
M+Q +RL+ G INRDKKISF FNGK YFGYEGDTLASAL+ANG+HL+GRSFKYHRPRGF
Sbjct 1 MSQKYRLDNIGYINRDKKISFTFNGKKYFGYEGDTLASALLANGIHLVGRSFKYHRPRGF 60
Query 378 FGAGVDEPYAIVQLYRNGETEPNIKATEQELFEGLEAKSVNCWPSVNFDVGAINNFLKIF 557
FGAGVDEP A VQLY+ +TEPN ATE EL EGL KS NCWPSV+FD GAINN + F
Sbjct 61 FGAGVDEPNAKVQLYKGAKTEPNANATEVELVEGLIVKSQNCWPSVSFDFGAINNLFQKF 120
Query 558 LPAGFYYKTFMWPKSFWYKIYEPFIRKAAGLGTASIKHDKERYEHKYEYCDLLITGSRPS 737
PAGFYYKTFMWPKSFWYK+YEP IRKAAGLG A +K D +RYEHKYEYCD+LI GS PS
Sbjct 121 FPAGFYYKTFMWPKSFWYKVYEPIIRKAAGLGVAPLKPDPDRYEHKYEYCDVLIAGSGPS 180
Query 738 GLASAYSAAKNGAKVILAEDKSRFGGTLLTSDVNIGNQSVK 860
GLASA +AAKNGA+VILAEDKSRFGG+LL +V IGN+ K
Sbjct 181 GLASALAAAKNGARVILAEDKSRFGGSLLVDEVTIGNKKGK 221
>gb|ABZ06659.1| putative glycine cleavage T-protein (aminomethyl transferase)
[uncultured marine microorganism HF4000_133I24]
Length=998
Score = 349 bits (895), Expect = 2e-94
Identities = 166/221 (75%), Positives = 184/221 (83%), Gaps = 0/221 (0%)
Frame = +3
Query 198 MTQSFRLETGGLINRDKKISFKFNGKNYFGYEGDTLASALIANGVHLIGRSFKYHRPRGF 377
M Q +RL+ G INRDKKISF FNGK YFGYEGDTLASAL+ANG+HL+GRSFKYHRPRGF
Sbjct 1 MPQKYRLDNIGYINRDKKISFTFNGKKYFGYEGDTLASALLANGIHLVGRSFKYHRPRGF 60
Query 378 FGAGVDEPYAIVQLYRNGETEPNIKATEQELFEGLEAKSVNCWPSVNFDVGAINNFLKIF 557
FGAGVDEP A VQLY+ +TEPN ATE EL E L KS NCWPSV+FD GAINN + F
Sbjct 61 FGAGVDEPNAKVQLYKGAKTEPNANATEVELVEDLIVKSQNCWPSVSFDFGAINNLFQKF 120
Query 558 LPAGFYYKTFMWPKSFWYKIYEPFIRKAAGLGTASIKHDKERYEHKYEYCDLLITGSRPS 737
PAGFYYKTFMWPKSFWYK+YEP IRKAAGLG A +K D +RYEHKYEYCD+LI GS PS
Sbjct 121 FPAGFYYKTFMWPKSFWYKVYEPIIRKAAGLGVAPLKPDPDRYEHKYEYCDVLIAGSGPS 180
Query 738 GLASAYSAAKNGAKVILAEDKSRFGGTLLTSDVNIGNQSVK 860
GLASA +AAKNGA+VILAEDKSRFGG+LL +V IGN+ K
Sbjct 181 GLASALAAAKNGARVILAEDKSRFGGSLLVDEVTIGNKKGK 221
>ref|ZP_01754673.1| sarcosine oxidase, alpha subunit family protein [Roseobacter
sp. SK209-2-6]
gb|EBA16865.1| sarcosine oxidase, alpha subunit family protein [Roseobacter
sp. SK209-2-6]
Length=985
Score = 258 bits (659), Expect = 5e-67
Identities = 128/219 (58%), Positives = 162/219 (73%), Gaps = 1/219 (0%)
Frame = +3
Query 198 MTQSFRLETGGLINRDKKISFKFNGKNYFGYEGDTLASALIANGVHLIGRSFKYHRPRGF 377
MT+ RL+ GG INR K++SF F+G Y GYEGDTLASAL+ANG L+GRSFKYHRPRG
Sbjct 1 MTEVNRLD-GGQINRAKEVSFTFDGHRYKGYEGDTLASALLANGERLMGRSFKYHRPRGV 59
Query 378 FGAGVDEPYAIVQLYRNGETEPNIKATEQELFEGLEAKSVNCWPSVNFDVGAINNFLKIF 557
AG +EP A+V+L + G EPN +AT ELF+GLEA N WPS+ FD A+N+ F
Sbjct 60 LTAGSEEPNALVELRKGGRQEPNTRATVIELFDGLEAAPQNAWPSLRFDAMAVNDRFSNF 119
Query 558 LPAGFYYKTFMWPKSFWYKIYEPFIRKAAGLGTASIKHDKERYEHKYEYCDLLITGSRPS 737
L AGFYYKTFMWPK+FW KIYEP IRKAAGLG+ S + D + Y+ + +CDLLI GS PS
Sbjct 120 LTAGFYYKTFMWPKAFWEKIYEPIIRKAAGLGSISFEEDPDLYDKGFLHCDLLIIGSGPS 179
Query 738 GLASAYSAAKNGAKVILAEDKSRFGGTLLTSDVNIGNQS 854
GLA+A +A ++GA+VILA++ R GG L + + +G+QS
Sbjct 180 GLAAALTAGRSGARVILADEDFRMGGRLNSETLALGDQS 218
>ref|YP_266475.1| Gene info sarcosine oxidase alpha chain [Candidatus Pelagibacter ubique
HTCC1062]
ref|ZP_01265173.1| sarcosine oxidase alpha chain [Candidatus Pelagibacter ubique
HTCC1002]
gb|AAZ21871.1| Gene info sarcosine oxidase alpha chain [Candidatus Pelagibacter ubique
HTCC1062]
gb|EAS84273.1| sarcosine oxidase alpha chain [Candidatus Pelagibacter ubique
HTCC1002]
Length=1002
GENE ID: 3517368 soxA | sarcosine oxidase alpha chain
[Candidatus Pelagibacter ubique HTCC1062] (10 or fewer PubMed links)
Score = 255 bits (652), Expect = 3e-66
Identities = 124/226 (54%), Positives = 163/226 (72%), Gaps = 7/226 (3%)
Frame = +3
Query 198 MTQSFRLETGGLINRDKKISFKFNGKNYFGYEGDTLASALIANGVHLIGRSFKYHRPRGF 377
M ++ R+ T I+ ++SFKFNGK+YFGY+GDTLASAL+ANG+HL+GRSFKYHRPRG
Sbjct 1 MLKNLRVTTSKYIDETSRVSFKFNGKSYFGYKGDTLASALLANGIHLVGRSFKYHRPRGI 60
Query 378 FGAGVDEPYAIVQLYRN-GETEPNIKATEQELFEGLEAKSVNCWPSVNFDVGAINNFLKI 554
+G +EP AIVQ+ N TEPN++ATE E++ GLEA S NCWPSVNFD+G INNFL
Sbjct 61 MTSGSEEPNAIVQVNNNTALTEPNVRATELEIYHGLEANSQNCWPSVNFDIGGINNFLSP 120
Query 555 FLPAGFYYKTFMWPKSFWYKIYEPFIRKAAGLGTASIKHDKERYEHKYEYCDLLITGSRP 734
LPAGFYYKTFMWP +FW K YE IR +AGLG + D + Y+HKY +CD+L+ G+
Sbjct 121 LLPAGFYYKTFMWPANFWEK-YEYVIRHSAGLGKSPTVPDPDIYDHKYIHCDVLVIGAGI 179
Query 735 SGLASAYSAAKNGAKVILAEDKSRFGGTLLTSD-----VNIGNQSV 857
SG+ +A +AAKN K +L ++K+ GG+ + + +N N SV
Sbjct 180 SGIIAAKTAAKNNLKTLLLDEKNEIGGSTIFQNSDHIKINDQNSSV 225
>ref|ZP_01546296.1| sarcosine oxidase, alpha subunit [Stappia aggregata IAM 12614]
gb|EAV44852.1| sarcosine oxidase, alpha subunit [Stappia aggregata IAM 12614]
Length=1000
Score = 254 bits (648), Expect = 9e-66
Identities = 117/210 (55%), Positives = 161/210 (76%), Gaps = 0/210 (0%)
Frame = +3
Query 198 MTQSFRLETGGLINRDKKISFKFNGKNYFGYEGDTLASALIANGVHLIGRSFKYHRPRGF 377
M+Q FR E GG I+R ++++F F+G+ G++GDTLASAL+ANGVHL+GRSFKYHRPRG
Sbjct 1 MSQPFRTEKGGRIDRAEQLTFTFDGEEMQGHKGDTLASALLANGVHLVGRSFKYHRPRGI 60
Query 378 FGAGVDEPYAIVQLYRNGETEPNIKATEQELFEGLEAKSVNCWPSVNFDVGAINNFLKIF 557
AG +EP A+V +YRNG+ PN++AT+ EL++GLEA S N +PS+ FD+GA+N+ L
Sbjct 61 LTAGSEEPNALVGVYRNGDQTPNLRATQVELYQGLEAISQNRFPSLGFDIGAVNDLLSPL 120
Query 558 LPAGFYYKTFMWPKSFWYKIYEPFIRKAAGLGTASIKHDKERYEHKYEYCDLLITGSRPS 737
PAGFYYKTFMWP +FW K+YEP IR AAGLG D + Y + Y +CD+L+ GS P+
Sbjct 121 FPAGFYYKTFMWPHAFWDKVYEPIIRSAAGLGKPPKNPDHDVYGNIYAHCDVLVVGSGPT 180
Query 738 GLASAYSAAKNGAKVILAEDKSRFGGTLLT 827
GLA+A +A + GAKV+L ++++ FGG+LL+
Sbjct 181 GLAAALAAGETGAKVMLVDEQAEFGGSLLS 210
>ref|NP_106776.1| Gene info sarcosine oxidase alpha subunit [Mesorhizobium loti MAFF303099]
dbj|BAB52562.1| Gene info sarcosine oxidase alpha subunit [Mesorhizobium loti MAFF303099]
Length=988
GENE ID: 1229431 mll6238 | sarcosine oxidase alpha subunit
[Mesorhizobium loti MAFF303099] (10 or fewer PubMed links)
Score = 253 bits (646), Expect = 2e-65
Identities = 116/216 (53%), Positives = 159/216 (73%), Gaps = 0/216 (0%)
Frame = +3
Query 207 SFRLETGGLINRDKKISFKFNGKNYFGYEGDTLASALIANGVHLIGRSFKYHRPRGFFGA 386
S+RL +GGLI+R ++ F F+G++ G+ GDTLASAL+ANG L+GRSFKYHRPRG A
Sbjct 3 SYRLPSGGLIDRHSRLGFSFDGQSLTGHAGDTLASALLANGRQLVGRSFKYHRPRGILTA 62
Query 387 GVDEPYAIVQLYRNGETEPNIKATEQELFEGLEAKSVNCWPSVNFDVGAINNFLKIFLPA 566
G EP A++ + G TEPN +AT Q+L++GLEA+S N WPS+NFD+G++N L FL A
Sbjct 63 GAAEPNALMTIGSGGRTEPNTRATMQDLYDGLEARSQNRWPSLNFDIGSLNGLLSPFLAA 122
Query 567 GFYYKTFMWPKSFWYKIYEPFIRKAAGLGTASIKHDKERYEHKYEYCDLLITGSRPSGLA 746
GFYYKTFMWP FW +YEPFIR+AAGLG A+ + D +RYE + +CDLL+ G+ P+GLA
Sbjct 123 GFYYKTFMWPAKFWEGLYEPFIRRAAGLGKATYEADPDRYEKSWAHCDLLVIGAGPAGLA 182
Query 747 SAYSAAKNGAKVILAEDKSRFGGTLLTSDVNIGNQS 854
+A + GA+VI+ ++ S GG+LL+ +G +S
Sbjct 183 AALIVGRAGARVIILDEHSLAGGSLLSETATVGGES 218
ORF finding
PROTOCOLE: a) SMS ORFinder / sens direct / cadres 1, 2 & 3 / min 60 AA / initiation 'any codon' / code génétique 'standard' b) SMS ORFinder / sens indirect / cadres 1, 2 & 3 / min 60 AA / initiation 'any codon' / code génétique 'standard' --------------------------------------------------------------------------------------------------- ANALYSE DES RÉSULTATS: a) On obtient un ORF bien plus grand que l'autre.On choisit donc l'ORF qui est sur le cadre de lecture +3. D'après les résultats du Blastx fait plus loin on a observé un "Frame Shift" entrainant un saut du cadre +3 au cadre +2. b) Il n'y a pas d'ORF dans les 3 cadres de lecture. --------------------------------------------------------------------------------------------------- RÉSULTATS BRUTS: a) >ORF number 1 in reading frame 1 on the direct strand extends from base 1 to base 198. GTTAAACGACCAGAATTAGGTGAAGAAATATCTGATCACGATTGGGATAATTTTGTTTAC AATAGAAAAAGCTTGAGAGGAAAGCATTGGGAGTTATGGCAACATTTATCAGGTTGCAGA CAATGGATTAAAGTTCAGAGAGATACAGCTACACACGAAATTTTTAAAACTCTTAAAGCA AACGAAGATATTTCATAA >Translation of ORF number 1 in reading frame 1 on the direct strand. VKRPELGEEISDHDWDNFVYNRKSLRGKHWELWQHLSGCRQWIKVQRDTATHEIFKTLKA NEDIS* No ORFs were found in reading frame 2. >ORF number 1 in reading frame 3 on the direct strand extends from base 177 to base 914. AGCAAACGAAGATATTTCATAATGACACAAAGTTTTAGATTAGAAACTGGTGGATTAATA AATAGAGATAAAAAAATTTCTTTTAAATTTAATGGTAAAAATTATTTTGGTTATGAGGGA GACACTCTTGCTTCTGCATTAATTGCCAATGGAGTTCATTTAATTGGAAGAAGTTTCAAA TATCATAGACCAAGAGGTTTTTTTGGTGCTGGGGTTGATGAGCCATATGCAATAGTTCAA TTATACAGAAACGGTGAAACAGAGCCAAATATTAAAGCTACTGAACAAGAACTTTTTGAA GGTCTTGAAGCAAAAAGTGTTAATTGTTGGCCGAGTGTGAATTTTGATGTTGGAGCTATA AATAATTTTTTAAAGATATTTCTTCCTGCAGGCTTTTATTACAAGACTTTTATGTGGCCA AAAAGTTTTTGGTATAAAATTTATGAACCATTCATCAGAAAAGCTGCTGGTTTAGGCACT GCATCTATAAAACATGATAAAGAAAGATATGAACATAAATATGAATATTGTGATCTGCTA ATCACAGGCTCACGTCCATCTGGATTAGCGAGTGCTTATTCAGCTGCAAAAAATGGTGCT AAAGTAATTCTCGCAGAGGACAAATCACGATTTGGTGGAACTCTATTAACCAGTGATGTC AATATAGGGAATCAATCAGTAAAGAGTGGGCAGATAGTATTGTTTCAGAACTTAAAGAAA TGTCTAATGTTACTATAA >Translation of ORF number 1 in reading frame 3 on the direct strand. SKRRYFIMTQSFRLETGGLINRDKKISFKFNGKNYFGYEGDTLASALIANGVHLIGRSFK YHRPRGFFGAGVDEPYAIVQLYRNGETEPNIKATEQELFEGLEAKSVNCWPSVNFDVGAI NNFLKIFLPAGFYYKTFMWPKSFWYKIYEPFIRKAAGLGTASIKHDKERYEHKYEYCDLL ITGSRPSGLASAYSAAKNGAKVILAEDKSRFGGTLLTSDVNIGNQSVKSGQIVLFQNLKK CLMLL*

