GOS 706020

From Metagenes

Jump to: navigation, search
Warning: this metagenomic sequence has been carefully annotated by students during bioinformatics assignments. These quality annotations are therefore the result of a teaching exercise that you are most welcome to amend and extend if necessary!


Sequence
CAMERA AccNum : JCVI_READ_1092343625359
Annotathon code: GOS_706020
Sample :
  • GPS :10°7'53s; 135°26'58w
  • Tropical South Pacific: 201 miles from F. Polynesia - International
  • Open Ocean (-30m, 28.6°C, 0.1-0.8 microns)
Authors
Team : BioCell2008
Username : chalia
Annotated on : 2009-02-05 17:29:59
  • caputo aurélia
  • vicente charlotte

Contents

Synopsis

  • Taxonomy: Proteobacteria (NCBI info)
    Rank: phylum - Genetic Code: Bacterial and Plant Plastid - NCBI Identifier: 1224
    Kingdom: Bacteria - Phylum: Proteobacteria - Class: - Order:
    Bacteria; Proteobacteria;

Genomic Sequence

>JCVI_READ_1092343625359 GOS_706020 genomic DNA
GTTAAACGACCAGAATTAGGTGAAGAAATATCTGATCACGATTGGGATAATTTTGTTTACAATAGAAAAAGCTTGAGAGGAAAGCATTGGGAGTTATGGC
AACATTTATCAGGTTGCAGACAATGGATTAAAGTTCAGAGAGATACAGCTACACACGAAATTTTTAAAACTCTTAAAGCAAACGAAGATATTTCATAATG
ACACAAAGTTTTAGATTAGAAACTGGTGGATTAATAAATAGAGATAAAAAAATTTCTTTTAAATTTAATGGTAAAAATTATTTTGGTTATGAGGGAGACA
CTCTTGCTTCTGCATTAATTGCCAATGGAGTTCATTTAATTGGAAGAAGTTTCAAATATCATAGACCAAGAGGTTTTTTTGGTGCTGGGGTTGATGAGCC
ATATGCAATAGTTCAATTATACAGAAACGGTGAAACAGAGCCAAATATTAAAGCTACTGAACAAGAACTTTTTGAAGGTCTTGAAGCAAAAAGTGTTAAT
TGTTGGCCGAGTGTGAATTTTGATGTTGGAGCTATAAATAATTTTTTAAAGATATTTCTTCCTGCAGGCTTTTATTACAAGACTTTTATGTGGCCAAAAA
GTTTTTGGTATAAAATTTATGAACCATTCATCAGAAAAGCTGCTGGTTTAGGCACTGCATCTATAAAACATGATAAAGAAAGATATGAACATAAATATGA
ATATTGTGATCTGCTAATCACAGGCTCACGTCCATCTGGATTAGCGAGTGCTTATTCAGCTGCAAAAAATGGTGCTAAAGTAATTCTCGCAGAGGACAAA
TCACGATTTGGTGGAACTCTATTAACCAGTGATGTCAATATAGGGAATCAATCAGTAAAGAGTGGGCAGATAGTATTGTTTCAGAACTTAAAGAAATGTC
TAATGTTACTATAAAAATAGGTC

Translation

[198 - 911/923]   direct strand
>GOS_706020 Translation [198-911   direct strand]
MTQSFRLETGGLINRDKKISFKFNGKNYFGYEGDTLASALIANGVHLIGRSFKYHRPRGFFGAGVDEPYAIVQLYRNGETEPNIKATEQELFEGLEAKSV
NCWPSVNFDVGAINNFLKIFLPAGFYYKTFMWPKSFWYKIYEPFIRKAAGLGTASIKHDKERYEHKYEYCDLLITGSRPSGLASAYSAAKNGAKVILAED
KSRFGGTLLTSDVNIGNQSVKSGQIVLFQNLKKCLMLL

Phylogeny

PROTOCOLE:

a) Phylogeny.fr / méthode ProtPars
b) Phylogeny.fr / méthode ProtDist/DnaDist-Neighbor
---------------------------------------------------------------------------------------------------
ANALYSE DES RÉSULTATS:

a)
On ne peut pas définir un groupe taxonomique car notre séquence n'est pas vraiment apparentée à un groupe.
De plus notre arbre n'est pas raciné malgré nos tentatives cela n'a pas marché.
 
b)On peut conclure que notre séquence appartient aux proteobacteries car en l'occurrence dans cet arbre elle est 
apparentée aux protéobactéries en particulier aux a-proteobacteries. De plus la phylogénie des gènes semblent 
être cohérentes avec la phylogénie des espèces.
---------------------------------------------------------------------------------------------------
RÉSULTATS BRUTS:

a)Parcimonie

Protein parsimony algorithm, version 3.66



One most parsimonious tree found:




  +--------------------------------------------------------------------Ppacifica[d-proteobacteria]    
  |  
  |  +-----------------------------------------------------------------ma_sequence     
 23  |  
  |  |                                                              +--Rxylanophilus[actinobacteria]    
  |  |  +----------------------------------------------------------22  
  |  |  |                                                           +--Tsp[g-proteobacteria]    
  +--2  |  
     |  |     +--------------------------------------------------------Bparapertussis[b-proteobacteria]     
     |  |     |  
     |  |     |                 +--------------------------------------Rbacterium[a-proteobacteria]    
     |  |     |                 |  
     |  |     |                 |  +-----------------------------------Rlitoralis[a-proteobacteria]    
     +-21     |        +-------20  |  
        |     |        |        |  |                             +-----Asp[actinobacteria]    
        |     |        |        |  |  +-------------------------18  
        |     |        |        +-19  |                          |  +--Rhsp[actinobacteria]    
        |     |        |           |  |                          +-17  
        |     |        |           |  |                             +--Serythraea[actinobacteria]    
        |     |        |           |  |  
        |     |        |           +-16                    +-----------Bphymatum[b-proteobacteria]    
        +-----3        |              |                    |  
              |        |              |        +----------15        +--Bthailandensis[b-proteobacteria]    
              |        |              |        |           |  +----14  
              |        |              |        |           |  |     +--Bpseudomallei[b-proteobacteria]    
              |        |              |        |           +-13  
              |  +-----7              +-------11              |     +--Bcenocepacia[b-proteobacteria]    
              |  |     |                       |              +----12  
              |  |     |                       |                    +--Bdolosa[b-proteobacteria]    
              |  |     |                       |  
              |  |     |                       |              +--------Cpsychrerythraea[g-proteobacteria]    
              |  |     |                       +-------------10  
              |  |     |                                      |  +-----Paeruginosa[g-proteobacteria]    
              |  |     |                                      +--9  
              |  |     |                                         |  +--Pentomophila[g-proteobacteria]     
              +--4     |                                         +--8  
                 |     |                                            +--Pmendocina[g-proteobacteria]     
                 |     |  
                 |     |                                         +-----Smeliloti[a-proteobacteria]     
                 |     +-----------------------------------------6  
                 |                                               |  +--Ssp[a-proteobacteria]     
                 |                                               +--5  
                 |                                                  +--R_sp[a-proteobacteria]     
                 |  
                 |                                                  +--CPelagibacteru[a-proteobacteria]     
                 +--------------------------------------------------1  
                                                                    +--CPelagibactersp[a-proteobacteria]     

 

---------------------------------------------------------------------------------------------------

b)ProtDist

                   +-----------Tsp[g-proteobacteria]    
      +------------1 
      !            +-----------Rxylanophilus[actinobacteria]    
      !  
      !                     +-CPelagibactersp[a-proteobacteria]     
      !                   +-2 
      !        +----------3 +-CPelagibacteru[a-proteobacteria]     
      !        !          ! 
      !     +-22          +-ma_sequence     
      !     !  !  
      !     !  +-------------Bparapertussis[b-proteobacteria]     
      !  +-21  
  +--17  !  !         +----R_sp[a-proteobacteria]     
  !   !  !  !  +------4 
  !   !  !  +-20      +---Ssp[a-proteobacteria]     
  !   !  !     !  
  !   !  !     +-----------Smeliloti[a-proteobacteria]     
  !   !  !  
  !   !  !                  +--Pmendocina[g-proteobacteria]     
  !   !  !                +-9 
  !   !  !            +--10 +--Pentomophila[g-proteobacteria]     
  !   !  !            !   !  
  !   !  !        +--11   +--Paeruginosa[g-proteobacteria]    
  !   !  !        !   !  
  !   +-19        !   +-------Cpsychrerythraea[g-proteobacteria]    
  !      !        !  
  !      !     +-15         +Bdolosa[b-proteobacteria]    
  !      !     !  !       +-6 
  !      !     !  !       ! +Bcenocepacia[b-proteobacteria]    
  !      !     !  !     +-7 
  !      !     !  !     ! ! +Bpseudomallei[b-proteobacteria]    
  !      !     !  +-----8 +-5 
  !      !  +-16        !   +Bthailandensis[b-proteobacteria]    
  !      !  !  !        ! 
  !      !  !  !        +--Bphymatum[b-proteobacteria]    
  !      !  !  !  
  !      !  !  !      +------Serythraea[actinobacteria]    
  !      +-18  !   +-12  
  !         !  +--13  +---------Rhsp[actinobacteria]    
  !         !      !  
  !         !      +----------Asp[actinobacteria]    
  !         !  
  !         +------------Rlitoralis[a-proteobacteria]    
  !  
 14----------Rbacterium[a-proteobacteria]    
  !  
  +-----------------------------------------------Ppacifica[d-proteobacteria]    


Annotator commentaries

Notre séquence provient de Tropical South Pacific et fait 923 paires de bases (pb).

Notre ORF choisis est complet car il commence par une méthionine et se finit par un codon stop. Nous avons pu établir que l'ORF été complet grâce au Blastp.On a vu que la méthionine s'alignait au début des séquences homologues. Notre ORF est codant car il est assez long (714 pb) et qu'on obtient de bon résultats dans le Blastp. Cependant nous ne pouvons pas déterminer le poids moléculaire car il est impossible de déterminer si le "frame shift" est une mutation ou si c'est une erreur de séquençage. Nous ne savons donc pas s'il faut prendre le poids moléculaire de la protéine tronquée ou faire la somme des poids moléculaires des deux parties "discontinues" de la protéine.

Notre hypothèse sur la fonction de la protéine est: "sarcosine oxidase" car c'est la fonction qui prédomine dans les meilleurs E-value.

Nous avons trouvé un domaine protéique : NAD(P)-binding grâce à interpro.

Nous pouvons dire d'après InterProScan que notre fragment aurait comme processus biologique "protein biosynthesis" d'après ce que nous pouvons voir ici :"The chemical reactions and pathways, including anabolism and catabolism, by which living organisms transform chemical substances. Metabolic processes typically transform small molecules, but also include macromolecular processes such as DNA repair and replication, and protein synthesis and degradation". On peut aussi dire qu'elle a plusieurs processus biologique comme la dégradation des protéines.

On peut aussi supposer que la fonction moléculaire du fragment de départ est "enzyme regulator activity" d'après ce que nous avons trouver :"Catalysis of a biochemical reaction at physiological temperatures. In biologically catalyzed reactions, the reactants are known as substrates, and the catalysts are naturally occurring macromolecular substances known as enzymes. Enzymes possess specific binding sites for substrates, and are usually composed wholly or largely of protein, but RNA that has catalytic activity (ribozyme) is often also regarded as enzymatic." Cette fonction est en parfait accord avec celle trouver grâce aux Blast car "sarcosine oxidase" est une enzyme qui en présence d'eau et d'oxygène donne de la glycine, du formaldéhyde et de l'eau oxygéné.

Nous avons donner "soxA" comme nom de gène car on a regardé dans la fiche genbank d'un homologue avec un e-value de 1e-118. On retrouve ce nom de gène pour les beaucoup d'autres homologues.

Pour notre arbre nous avons quelques problèmes dans le choix de notre groupe d'étude et de notre groupe extérieur. Nous avons choisis comme groupe d'études les protéobactéries car nous avons de très bon résultats (1e-118) et comme groupe extérieur les actinobactéries parce qu'elle ont des e-value correct(5e-30) L'arbre le plus cohérent est celui fait par ProtDist car on peut supposer que notre séquence appartient bien aux protéobactéries. Celui par parcimonie ne nous permet pas de déterminer un groupe taxonomique car notre séquence ne s'apparente à aucun groupe taxonomique.

Multiple Alignement

PROTOCOLE:

ClustalW2
---------------------------------------------------------------------------------------------------
ANALYSE DES RÉSULTATS:

On observe que notre séquence est beaucoup plus courte que les autres séquences de notre alignement multiple.
On observe des similitudes avec des domaines bien conservés comme par exemple de la position +33 à +40.
On peut donc supposer que notre ORF (= ma_sequence) s'intègre correctement dans la famille de ses homologues.
 
---------------------------------------------------------------------------------------------------
RÉSULTATS BRUTS:

CLUSTAL 2.0.10 multiple sequence alignment


CPelagibactersp[a-proteobacter      --MTQSFRLNDVGLINRDRKLSFKFNSVTYYGYEGDTLASALIANGVHLV 48
CPelagibacteru[a-proteobacteri      --MTQNYRLDNVGLINRDKKISFKFNGVTYFGYEGDTLASALLANGVHLI 48
ma_sequence                         --MTQSFRLETGGLINRDKKISFKFNGKNYFGYEGDTLASALIANGVHLI 48
Bparapertussis[b-proteobacteri      --MTQQYRLNHGGLVDRRRPLTFRFDGIQYQGYHGDTLASALLANGVHLV 48
R_sp[a-proteobacterie]              --MTEVNRLD-GGQINRAKEVSFTFDGHRYKGYEGDTLASALLANGERLM 47
Ssp[a-proteobacterie]               --MTQVNRIS-GGLIDRSTELNFTFDGKNYQGYAGDTLASALLANGVRLM 47
Smeliloti[a-proteobacterie]         --MSSYRLPK-RGLVDRNVPLSFTFDGRPMQGLEGDTLASALLANGRMLV 47
Pmendocina                          -MSQVNRLA-QGGRIDRSQPLTFSFNGQTYQGYAGDTLAAALLANGVDVI 48
Pentomophila                        -MSQTYRLA-SGGRIDRSKVLNFSFNGKTYQGYAGDTLAAALLANGVDIV 48
Paeruginosa[g-proteobacteria]       -MSQINRLS-SGGRIDRNRPLTFSFNGQHYQGYAGDTLAAALLANGVDIV 48
Cpsychrerythraea[g-proteobacte      -MSQVNRIAGSSKRINRNRTLTFSFNGKEYTGFEGDTVASALLANGVDVV 49
Bdolosa[b-proteobacteria]           -MSQKDRLG-TGGRINRAIPLTFTFNGRTYQGFQGDTLASALLANGVHFV 48
Bcenocepacia[b-proteobacteria]      -MSQKDRLG-TGGRINRAIPLTFTFNGRTYQGFQGDTLASALLANGVHFV 48
Bpseudomallei[b-proteobacteria      -MSQKDRLG-AGGRINRAQPLTFTFNGRTYQGFQGDTLASALLANGVHFV 48
Bthailandensis[b-proteobacteri      -MSQKDRLG-AGGRINRAQPLTFTFNGRTYQGFQGDTLASALLANGVHFV 48
Bphymatum[b-proteobacteria]         -MSQKNRLG-AGGRINRAIPLTFTFNGRTYQGFQGDTLASALLANGVHFV 48
Serythraea[actinobacteria]          -MSNEFRLA-EGGRIDRDRPLSFRFDGREYVGYEGDTLASALLANGVHQV 48
Rhsp[actinobacteria]                -MNAPFRTR-QGGRLDRNTSYTFTFDGRELTGHPGDTLGSALLANGVHQI 48
Asp[actinobacteria]                 MTSQNARLA-AGGRIDRSISWRFTVDGEEFTGHPGDTLASALLANGRIAA 49
Rlitoralis[a-proteobacterie]        --MNEFRVE-GRGRVNADKPVKFTFDGEIYKGFEGDTVASALLANGVHLM 47
Rbacterium[a-proteobacteria]        ---MSHRLDGKGRLIDRSKKLRFTFNGKAMTGYAGDTLASALLGSGQSVM 47
Tsp[g-proteobacteria]               --MAKRLPARDGEWIDRSRTLRFSFEGREYSAFAGDTISSALLANGVRVL 48
Rxylanophilus[actinobacteria]       --MSSRLPYQEGEWIDRSKPLTFSFEGKRFTGFSGDTITSALWASGERVL 48
Ppacifica[d-proteobacteria]         ---------MLEPDRETGETVHIRFDETLIAARPEDTLATALIGAGELMT 41
                                                   :      : .:     .   **: :** . *    

CPelagibactersp[a-proteobacter      GRSFKYHRPRGFFGAGVDEPYAIVQLYRNGETE---PNIKATEQELFEGL 95
CPelagibacteru[a-proteobacteri      GRSFKYHRPRGFFGAGVDEPYAIVQLYRNNETE---PNVKATEQELFEGL 95
ma_sequence                         GRSFKYHRPRGFFGAGVDEPYAIVQLYRNGETE---PNIKATEQELFEGL 95
Bparapertussis[b-proteobacteri      GRSFKYHRPRGIYTAGVEEMNALVDVLKEGQAD---PNTRATVVELEDGI 95
R_sp[a-proteobacterie]              GRSFKYHRPRGVLTAGSEEPNALVELRKGGRQE---PNTRATVIELFDGL 94
Ssp[a-proteobacterie]               GRSFKYHRPRGVLAAGSEEPNALVELRSGGRQE---PNTRATVAEIYEGL 94
Smeliloti[a-proteobacterie]         GRSFKYHRPRGILTAGAAEPNALVTVGRGGRAE---PNTRATMQELYEGL 94
Pmendocina                          GRSFKYSRPRGIVAAGAEEPNAVLQIGSTEAAQ--IPNVRATQQALYANL 96
Pentomophila                        GRSFKYSRPRGIIAAGTEEPNAILQIGSSEATQ--IPNVRATQQALYAGL 96
Paeruginosa[g-proteobacteria]       GRSFKYSRARGIVAAGAEEPNAILQIGSREATQ--IPNVRATQQALYGGL 96
Cpsychrerythraea[g-proteobacte      GRSFKYSRPRGIITSDSQEPNAIFQIGSTQATT--IPNPRATQTDLYQGL 97
Bdolosa[b-proteobacteria]           ARSFKYHRPRGIVTAGVEEPNAVVQLETG-PYT--VPNARATEIELYQGL 95
Bcenocepacia[b-proteobacteria]      ARSFKYHRPRGIVTADVAEPNAVVQLETG-PYT--VPNARATEIELYQGL 95
Bpseudomallei[b-proteobacteria      ARSFKYHRPRGIVTAGVDEPNAVVQLETG-AYT--VPNARATEVELYQGL 95
Bthailandensis[b-proteobacteri      ARSFKYHRPRGIVTAGVDEPNAVVQLETG-AHT--VPNARATEIELYQGL 95
Bphymatum[b-proteobacteria]         ARSFKYHRPRGIVTADVAEPNAVVQLERG-AYT--VPNARATEIELYQGL 95
Serythraea[actinobacteria]          GTSIKHGRPRGIMAAGVEEPNALVQIEKPFP----EPMLTATTVPLRDGL 94
Rhsp[actinobacteria]                TTSIKLGRPRGITAAWAEDTGGLVQIEEPFP----EPMLLATTIELFDGL 94
Asp[actinobacteria]                 GNSLYEDRPRGIMSAGVEESNALVRVEARFPGHVAESMLPATTVTLVDGL 99
Rlitoralis[a-proteobacterie]        GRSFKYHRPRGVVTAGSEEPNALIGTTRGKGRF--EPNTRATIQEIYEGL 95
Rbacterium[a-proteobacteria]        GRSFKYHRPRGVVASGVEEPNALMNLGEGGRFE---PNQRATTTPLFDGL 94
Tsp[g-proteobacteria]               GRSFKYHRPRGVFSAANHDSNVLLQSDSD-------FNIRGDVTAVADGM 91
Rxylanophilus[actinobacteria]       GRSFKYHRPRGVLSFANHDVNVMVQNGAV-------PNIRADVTLIKSNQ 91
Ppacifica[d-proteobacteria]         SRSPKYRRPRGAYCLAGDCGTCLVRVDGR-------PNVRACMTPVREGM 84
                                      *    *.**           :.                .    :  . 

CPelagibactersp[a-proteobacter      EAKSVNCWPSVNFDVGAINNFLKI-FLPAGFYYKTFMWPKSFWYKVYEPF 144
CPelagibacteru[a-proteobacteri      EATSVNCWPSVNFDIGAINNLLKI-FLPAGFYYKTFMWPKSFWYKVYEPF 144
ma_sequence                         EAKSVNCWPSVNFDVGAINNFLKI-FLPAGFYYKTFMWPKSFWYKIYEPF 144
Bparapertussis[b-proteobacteri      EVSSQNRWPSLRFDVRSFHGMISR-LIPAGFYYKTFMWPAKFWPK-YEHM 143
R_sp[a-proteobacterie]              EAAPQNAWPSLRFDAMAVNDRFSN-FLTAGFYYKTFMWPKAFWEKIYEPI 143
Ssp[a-proteobacterie]               SANSQNRWPSLKHDVMAINDRFSA-FLSAGFYYKTFMWPRAFWEKLYEPV 143
Smeliloti[a-proteobacterie]         EARSQNRWPSLAFDIGALNGLLSP-FLGAGFYYKTFMWPAPLWEKLYEPV 143
Pmendocina                          TATSTNGWPSVNTDLMGILGKVGGGMMPPGFYYKTFMYPQNLWL-TYEKY 145
Pentomophila                        VATSTNGWPNVNNDMMGIIGKVGGNMMPPGFYYKTFMYPKSFWM-TYEKY 145
Paeruginosa[g-proteobacteria]       VATSTNGWPNVQNDLMGIFGKVGGKLMPPGFYYKTFMYPQSMWM-TYEKY 145
Cpsychrerythraea[g-proteobacte      TASSTNGWPNVDFDLMGTVGKLGGSMMPPGFYYKTFMFPQSLWM-SYEHL 146
Bdolosa[b-proteobacteria]           VATSVNAEPSLENDKYAINQKLSR-FLPAGFYYKTFMWPRRMWP-KYEEK 143
Bcenocepacia[b-proteobacteria]      VATSVNAEPTLENDKYAINQKFSR-FMPAGFYYKTFMWPRNMWP-KYEEK 143
Bpseudomallei[b-proteobacteria      VATSVNAKPSLEHDRMAVMQKLAR-FLPAGFYYKTFMWPRNLWP-KYEEK 143
Bthailandensis[b-proteobacteri      VATSVNAKPSLEHDRMAVMQKFAR-FLPAGFYYKTFMWPRNLWP-KYEEK 143
Bphymatum[b-proteobacteria]         VATSVNAEPNLEHDRMAINQKFAR-FMPAGFYYKTFMWPAKWWP-KYEEK 143
Serythraea[actinobacteria]          EATGLP-------------------------------------------- 100
Rhsp[actinobacteria]                VARGIP-------------------------------------------- 100
Asp[actinobacteria]                 KADLLN-------------------------------------------- 105
Rlitoralis[a-proteobacterie]        DTESQNKWPTLQFDLGAINDRLYM-LFSAGFYYKTFMWPRSFWDSVYEPL 144
Rbacterium[a-proteobacteria]        TATSQNHWPSLEFDIGAVNDLAAR-FLPAGFYYKTFMFPRFAWKHLFEPF 143
Tsp[g-proteobacteria]               RLSAINTQGGLDKDRGRFLDRLSP-LLPVGFYYKTFHRPKALFP-FWENQ 139
Rxylanophilus[actinobacteria]       NLRAVNTIGGLKLDLGQINNRLSR-FLPVGFYYKAFHKPARLFP-LWEKF 139
Ppacifica[d-proteobacteria]         RVSSQNTYRPRRLDPTAIVDKVFV----KGMDHHHLMVRPRIANQIMQEF 130
                                                                                      

CPelagibactersp[a-proteobacter      IRKAAGLGVASTKHDKERYEHKYEYCDLLIAGSGPSGLASAYAAAKNGAR 194
CPelagibacteru[a-proteobacteri      IRKAAGLGVASIEHDKERYEHKYEYCDLLIAGSGPSGLASAYAAAKNGAR 194
ma_sequence                         IRKAAGLGTASIKHDKERYEHKYEYCDLLITGSRPSGLASAYSAAKNGAK 194
Bparapertussis[b-proteobacteri      IRHAAGLGRAPLVRDRDRYEKQHAYCDVLVVGAGPAGLAAARSACQAGLR 193
R_sp[a-proteobacterie]              IRKAAGLGSISFEEDPDLYDKGFLHCDLLIIGSGPSGLAAALTAGRSGAR 193
Ssp[a-proteobacterie]               IRKAAGLGSLSGEGDPDAYDKGYLHCDLLVIGAGPAGLSAALTAGRGGAQ 193
Smeliloti[a-proteobacterie]         IRRAAGLGKASYEADPDAYEKSWAHCDLLVIGAGPTGLAAALTAGRAGAR 193
Pmendocina                          IRKAAGLGRSPKENDPDIYDYMNQHCDVLVVGAGPAGLAAALAAGRSGAR 195
Pentomophila                        IRKAAGLGRAPLQNDPDSYDYMNQHCDVLIVGAGPAGLAAALAAARSGAR 195
Paeruginosa[g-proteobacteria]       IRKAAGLGRAPTEVDPDSYDWMNHHCDVLVVGGGPAGLAAALAAARSGAR 195
Cpsychrerythraea[g-proteobacte      IRKGAGLGASPQQNDPDSYDKMHHHCDVMIVGGGPAGLAAALSAAQTGAR 196
Bdolosa[b-proteobacteria]           IREAAGLGKAPDTLDADRYDKRYAHCDVLVVGGGPSGLAAAHAAATAGAR 193
Bcenocepacia[b-proteobacteria]      IREAAGLGKAPEVLDADRYDKCYAHCDVLVVGGGPSGLAAAHAAATAGAR 193
Bpseudomallei[b-proteobacteria      IREAAGLGKAPDTLDADRYDKCYAHCDVLVVGGGPTGLAAAHAAAVNGAR 193
Bthailandensis[b-proteobacteri      IREAAGLGKAPDTLDADRYDKCYAHCDVLVVGGGPAGLAAAHAAAVNGAR 193
Bphymatum[b-proteobacteria]         IREAAGLGKAPEVLDADRYDKCYAHCDVLVVGGGPTGLAAAHAAASSGAR 193
Serythraea[actinobacteria]          -----GQGRLAEEADPARYDTMHAHCDVLVVGAGPAGLSAALSAARSGAR 145
Rhsp[actinobacteria]                -----GQGRLAEIADSAKYDAKHVHTDLLVAGAGPAGLAAALTAARAGAR 145
Asp[actinobacteria]                 -----GLGRLDPEEDRAEYDKKFVHTDVLVIGGGPAGLAAAREAVRTGAR 150
Rlitoralis[a-proteobacterie]        IRKAAGLGKAPTEVDPDHYASRYLHCDVLIVGAGPSGIAAALTAGRAGSK 194
Rbacterium[a-proteobacteria]        IRQSAGLGQVPKEPDADRYEHVYHHTDVLVIGGGVAGLAAARAAAAGGAK 193
Tsp[g-proteobacteria]               IRKRAGLGRIDTQWPELRLPKRHGFCDLLVVGAGPSGLSAAIAAAESGAR 189
Rxylanophilus[actinobacteria]       IRKAAGLGYVNVNSKRRLWSKAYGFADVLVIGAGAAGLSAAISAAEAGAK 189
Ppacifica[d-proteobacteria]         ARNLTGFGELPEVVGERGCEHIAHELPVLIIGAGPAGRALAARLREAGID 180
                                         * *                   ::: *.  :* : *      *  

CPelagibactersp[a-proteobacter      VILAEDKSRFGGTLLT------SDVNIGNQTGKEWADGIISELKEMPNVT 238
CPelagibacteru[a-proteobacteri      VILAEDKPRFGGTLLT------SEVNIGNQTGKEWAENIISELKEMPNVI 238
ma_sequence                         VILAEDKSRFGGTLLT------SDVNIGNQS-----------VKSGQIVL 227
Bparapertussis[b-proteobacteri      VLLVDEKSRVGGTLPG------SNTEIEGVAGAKWATAVERELRESGHAS 237
R_sp[a-proteobacterie]              VILADEDFRMGGRLNS------ETLALGDQSGADWAAAAIAELADMPNVR 237
Ssp[a-proteobacterie]               VILADEDFQLGGRLLS------DAQSLCNQSNAEWVAATQAELIALPNVR 237
Smeliloti[a-proteobacterie]         VILVDEGSLPGGSLLS------DTATIDGKAAADFARDTSDELRSMPNVQ 237
Pmendocina                          VILADEQEEFGGSLLS------TREMLDDKPAADWAVKAIAELQKMPEVT 239
Pentomophila                        VILADEQEEFGGTLLD------SRETLDGKPAAEWVNAVVAELESLPEVT 239
Paeruginosa[g-proteobacteria]       VILADEQEEFGGSLLD------TRETLDGKPAAEWVADAVAELQGLPEVI 239
Cpsychrerythraea[g-proteobacte      VIISDEQNEFGGSLLC------STQQIDGQLPSQWVEKTVAQLSEMDNVM 240
Bdolosa[b-proteobacteria]           VMLVDDQRELGGSLLS------CRAEIDGKPALQWVEKIEAELRKLPDVT 237
Bcenocepacia[b-proteobacteria]      VILVDDQRELGGSLLS------CRAEIDAKPALQWVEKIEAELRKLPDVT 237
Bpseudomallei[b-proteobacteria      VILVDDQRELGGSLLA------CRAEIDGKPALQWVEKIEAELAKLPDMS 237
Bthailandensis[b-proteobacteri      VILVDDQRELGGSLLA------CRAEIDGKPALQWVEKIEAELSKLPDVK 237
Bphymatum[b-proteobacteria]         VILVDDQRELGGSLLS------CKTEIDGHAALSWVEKIEAELSRMPDVK 237
Serythraea[actinobacteria]          VIVADADAEFGGSLLG------IGERLDDAPATEWVRRAVAELATYPEVR 189
Rhsp[actinobacteria]                VVLVDEQSEAGGDLLG------STDLIDGAPALDWVAAAVAELATYPDVL 189
Asp[actinobacteria]                 VMLLDDQPELGGTLLSGSTAPDLAEAIEGKPSLEWVADVEAELVSAAECT 200
Rlitoralis[a-proteobacterie]        VVLVDENTEMGGTLLS-----EPAVSIEGQSAWDWLAAATNELDQLPNVR 239
Rbacterium[a-proteobacteria]        VMVLEQTAHWGGRAPVD------GGQIDGLDPETWVNNAVQELETAENVT 237
Tsp[g-proteobacteria]               VWLVDENARAGGSLN-------------DRQDGALRDQLLARLADLPNLT 226
Rxylanophilus[actinobacteria]       VVLVDENPRVGGSLTY--------AKTINNNGTSVLADLARKVESYPNIE 231
Ppacifica[d-proteobacteria]         HAIVDRLDRPQLRAAP-----ALGAEAPALAPVEDVLADTGVFGVYPGPK 225
                                      : :                                     .       

CPelagibactersp[a-proteobacter      VKNRSQVFGYYDHNMLVMSERISDHL-PSTKKFHPKQRLWYIRAKEVLIS 287
CPelagibacteru[a-proteobacteri      VKNRSQVFGYYDHNMLVMSEKLSDHL-PKTKKYNPKQRLWYIRAKEVLIS 287
ma_sequence                         FQNLKKCL------MLL--------------------------------- 238
Bparapertussis[b-proteobacteri      VMLRTTAFGYYDHDTVALAQQCDT----PTNPHGATQRLWYVHAKQVVLA 283
R_sp[a-proteobacterie]              LMSRTTIVGAFDHGTYGAVERVQDHV-AVPQEGKPRQIFWRIYSRRALLC 286
Ssp[a-proteobacterie]               VMPRTTVFGAYDHGVYGAVERNADHL-VAPEENKPRQTLWRIYSRRAVVA 286
Smeliloti[a-proteobacterie]         VLVRTTAFGWYDGNVFGAVERVQKHV-REPASHLPVERLWRIVAGKALLA 286
Pmendocina                          LLPRATVNGYHDHNFLTIHQRLTDHLGEVAPMGQPRQRMHRVRAGRVVLA 289
Pentomophila                        LLPRSTVNGYHDHNFLTIHERLTDHLGDRAPIGQVRQRVHRVRANRVVLA 289
Paeruginosa[g-proteobacteria]       LLPRSTVNGYHDHNFLTIHERRTDHLGEVAPLGQVRQRVHRVRAKRVVLA 289
Cpsychrerythraea[g-proteobacte      LLPRSTVFGYYDHNLVGINERRTDHLGEHQ-LQSTRQRVHKVRAKQVILA 289
Bdolosa[b-proteobacteria]           ILSRSTAFGYQDHNLVTITQRLTDHL-PVSMRKGTRELLWKVRAKRVILA 286
Bcenocepacia[b-proteobacteria]      ILSRSTAFGYQDHNLVTVTQRLTDHL-PVSMRKGTRELLWKVRAKRVILA 286
Bpseudomallei[b-proteobacteria      ILTRSTAFGYQDHNLVTVVQRLTDHL-PVSMRKGTREMIWKVRAKRVILA 286
Bthailandensis[b-proteobacteri      ILTRSTAFGYQDHNLVTVVQRLTDHL-PVSMRKGTREMIWKVRAKRVILA 286
Bphymatum[b-proteobacteria]         ILSRSTAFGYQDHNLVTVTQRLTDHQ-PVSMRKGTRELLWKIRAKRVILA 286
Serythraea[actinobacteria]          QLPSTTVFGHYDDNYLVAVENR----GEDAP---SRQRIWRVRAREVVLA 232
Rhsp[actinobacteria]                HLQRTTAFGNYDDGFVLALQRRTDHLGVEAPAALSRQRVWRIRARHILVA 239
Asp[actinobacteria]                 VLNRTTAFGAYDANYIVAVQNRTDHLSSPAAPGVSRQRIWHIRAKQVVVA 250
Rlitoralis[a-proteobacterie]        LMTRTTAMGYYHQNMIGMVQKLTDHM-ADIPDGAPRERMWRVRAHEVVLA 288
Rbacterium[a-proteobacteria]        LRLGTMGAGVYDHGYVLGYERVAD---ATPGDDRPRHRLWRIRAKQIVTA 284
Tsp[g-proteobacteria]               FLPDTVAAGWYADHYVPLVTPKG---------------LIRLRARAVIVA 261
Rxylanophilus[actinobacteria]       FWSDTVASAYFEDQWVPLVHSDGG--------------MTKMRAKSVVVA 267
Ppacifica[d-proteobacteria]         LGLEGEEEGP-DRALVAASEGGESSN---------HERLYAFRPRHLVFA 265
                                                                                      

CPelagibactersp[a-proteobacter      SGSIERPLVFGNNDTPGVMLSSAAKEYLKVYGVLVGKKPLVFTNNDSGYE 337
CPelagibacteru[a-proteobacteri      SGSIERPLVFGNNDTPGVMLSSAAKEYLKVYGVLVGKKPLIFTNNDSGYE 337
ma_sequence                         --------------------------------------------------
Bparapertussis[b-proteobacteri      AGAIERPCVFANNDLPGVMLASAARTYCNEFGVAVGRRVLVLANNDSAYE 333
R_sp[a-proteobacterie]              AGAMERPIAFADNDRPGVMLASAVRSYLNRWAAAPAQEIAIFTNNDDGHR 336
Ssp[a-proteobacterie]               IGAIERPIAFENNDRPGVMLAGATRAYANRWAVTPARSVVVFANNDDAHQ 336
Smeliloti[a-proteobacterie]         TGAEERPLVFGGNDRPGVMMAGAMRAYLNRYGVAPGRTPAIFTTNDTGYT 336
Pmendocina                          TGAHERPLVYANNDVPGNMLADAVSTYVRRYGVAPGQKLVLSTNNDYAYR 339
Pentomophila                        AGAHERPLVYGNNDLPGNMLAGAVSTYVRRYGVAPGRKLVLSTNNDHAYR 339
Paeruginosa[g-proteobacteria]       AGAHERPLVYGNNDLPGNMLAGAVSTYVRRYGVAPGKKLVLATNNDYAYR 339
Cpsychrerythraea[g-proteobacte      TGAHERPLVYGNNDVPGCMLANAISTYINRYDVVPGKQLVLMTTNDNAYK 339
Bdolosa[b-proteobacteria]           TGAHERPIVFGNNDLPGVMLAGAVSTYVHRFGVLPGRNAVVFTNNDRAYQ 336
Bcenocepacia[b-proteobacteria]      TGAHERPIVFGNNDLPGVMLAGAVSTYVHRFGVLPGRNVVVFTNNDRAYQ 336
Bpseudomallei[b-proteobacteria      TGAHERPLVFGNNDLPGVMTASAVSAYIHRYGVLPGRVAVVATNNDRGYQ 336
Bthailandensis[b-proteobacteri      TGAHERPLVFGNNDLPGVMTASAVSTYIHRYGVLPGRVAVVATNNDRGYQ 336
Bphymatum[b-proteobacteria]         TGAHERPIVFGNNDLPGVMLASAVSTYIHRFGVMPGRNAVVFTNNDAGYR 336
Serythraea[actinobacteria]          TGSHERPLVFAGNDRPGTMLAGSARTYLHRYGVVPGRRAVVFTANDSAYA 282
Rhsp[actinobacteria]                AGAHERPVVFTDNDRPGIMLAHGARTFLHRYGVKVGEQAVVFTTNDSAYE 289
Asp[actinobacteria]                 PGAHERPLVFENNDRPGIMLASAVRSYLNRYAVAAGQRVVISTTNDSAYA 300
Rlitoralis[a-proteobacterie]        QGAIERPMVFDGNDCPGVMMAGAAQTFLNRFGVLVGRRPVVLTSHDSAWY 338
Rbacterium[a-proteobacteria]        TGAIERPLSFPGNDVPGVMLASAVRDYVVNWGVAPGRRTVIVTNNDDAYL 334
Tsp[g-proteobacteria]               GGVYEQPAVFRNNDLPGVMLASAALRLARRYGVAACESAVILAANSDAYR 311
Rxylanophilus[actinobacteria]       SGVMEQPAVFRNNDLPGIMLGSAAQRLIYRYAVKPFDRGIVLAANSDAYG 317
Ppacifica[d-proteobacteria]         TGCREPMIPFANNDLPGVVGARGLLAALRRAGSRLSGRCVVVGEGEAAEG 315
                                                                                      

CPelagibactersp[a-proteobacter      TAIEFKKNGVDPI-ILDTRK-DPHSEIIDEAKNLGINIKFSYVVVAAQGY 385
CPelagibacteru[a-proteobacteri      TAIEFKKNGVDPI-ILDTRK-EPKSEIIDEAKKLDIEIKFSYVVVAAKGY 385
ma_sequence                         --------------------------------------------------
Bparapertussis[b-proteobacteri      AALDLKRAGIDIVGVVDQRE-AVSASLSETLASLRIPHHRGSTIKKATGR 382
R_sp[a-proteobacterie]              TAADLIAKGVSVPAVIDVR--------ADAPSVAGTELLAGAEVIGTSGR 378
Ssp[a-proteobacterie]               TAKDLIAKGIEVHAVVDTR--------SDAPGIEGTELLAGAQIIGTKGR 378
Smeliloti[a-proteobacterie]         LAQELEAAGVDVVAIVDSRP-A-----AGVDYRGKARLVREAVVCGTKGG 380
Pmendocina                          VVLDWLDAGRQVVAVADARS-NPRGSWVEEARRRGVRVLTGSAVVEARGS 388
Pentomophila                        CALDWHDAGLQVVAIADARH-NPRGSLVEEARAKGIRILTSSAVIEAKGS 388
Paeruginosa[g-proteobacteria]       VALDWQEAGLQVVAIADARA-NPRGEWVEEARQRGMRVITGSSVIEARGG 388
Cpsychrerythraea[g-proteobacte      TAIDWHQAGRKVVAIVDTRS-TSNGDLVNKVKKLGIDIIFGHGVIEVKGS 388
Bdolosa[b-proteobacteria]           TALDLKACG-AKVTVVDSRA-SSNGALPAAAKRQGVTVMSGAVVTAASGK 384
Bcenocepacia[b-proteobacteria]      TALDLKACG-AKVTVVDSRA-SSNGALPAAAKRQGVTVMSGAVVTAASGK 384
Bpseudomallei[b-proteobacteria      CALDLKACG-AKVTVVDARA-STRGALPAVAKRHGITVMSGAAVSAAAGK 384
Bthailandensis[b-proteobacteri      CALDLKACG-AKVTVVDARA-STRGALPAVAKRNGVTVMSGAVVSAAAGK 384
Bphymatum[b-proteobacteria]         CALDMKACG-ASVTVVDPRA-QGNGALQAAARRHGVKIMNNAAVMTAHGK 384
Serythraea[actinobacteria]          AAVDLHDAGVAIAAIIDVRD-VVSTRWASHCIERGIPIHPEAAVVSTSGT 331
Rhsp[actinobacteria]                AAIDLHDAGVRINAIVEARD-DAPARWQRECDARGITIRAASVVSGTRGN 338
Asp[actinobacteria]                 LASDLRAAGVKVAAVVDAR--PRLTEVAAAAVESGTRVLIGSAVANTSAS 348
Rlitoralis[a-proteobacterie]        SAFDMADAGAEVVAIVDTRP-EVAPSLVQQAMKRGIETLVGHTATGTKGR 387
Rbacterium[a-proteobacteria]        TALALKEAKLEVPAIIDVRA-TLVGPLADRARKAGIKLMHGKAVVGVKGK 383
Tsp[g-proteobacteria]               NALELKALGIPVKAIVDLDAPETRGDLHDQVRAAGIAVHGRSTVYSAEGE 361
Rxylanophilus[actinobacteria]       LVLDLLSAGVEVAAVVDLRHEGEDSALAEVVQESGVKIYRGHCIYEALPT 367
Ppacifica[d-proteobacteria]         --------------------------------------------------
                                                                                      

CPelagibactersp[a-proteobacter      K------KVKSADIAKISD-DKEQLGTIENIKCDCICVSGFWTPTIHLAS 428
CPelagibacteru[a-proteobacteri      K------KVKSAEVAKISD-DKNELGTLENINCDCICVSGFWTPTIHLAS 428
ma_sequence                         --------------------------------------------------
Bparapertussis[b-proteobacteri      H------RVRCAVIVDQS-------GLRQTVRCDAILVSGGWTPSVHLHS 419
R_sp[a-proteobacterie]              L------GLSSVTVRLAN-------GQTRKVNCGALAVSGGWNPNVHLTC 415
Ssp[a-proteobacterie]               L------GLTSVTVRLLD-------GRTRDITCGALAMSGGWNPNLGLTC 415
Smeliloti[a-proteobacterie]         K------AISAIEVHHG--------GRTETIAVDALAMAGGFDPIIHLAC 416
Pmendocina                          K------RVTGARICAIDLVSHKVTSPGETVDCDLIVSSGGYSPVVHLAS 432
Pentomophila                        K------HVTGARVAAIDVQAHKVTSPGETLECDLIATSGGYSPVVHLAS 432
Paeruginosa[g-proteobacteria]       K------RVSGAKVARIDLQAMRASG-GEWLDCDLIASSGGYSPVVHLAS 431
Cpsychrerythraea[g-proteobacte      K------RVKGVEVAPINASNHSVTGPAKHIVCDTVASSGGWSPVIHLSS 432
Bdolosa[b-proteobacteria]           W------RVASVDVASY--TNGQTGGRLQSLPCDLVAMSGGFSPVLHLFA 426
Bcenocepacia[b-proteobacteria]      W------RVSSVDVASY--SNGQTGGKLQTLPCDLVAMSGGFSPVLHLFA 426
Bpseudomallei[b-proteobacteria      L------RVASVDVVSY--ANGRSGGKIATLPCDLVAMSGGFSPVLHLFA 426
Bthailandensis[b-proteobacteri      L------RVASVDVASY--ANGRSGGKIATLPCDLVAMSGGFSPVLHLFA 426
Bphymatum[b-proteobacteria]         Q------RVTSVEVVAY--ANGKTGAKQADLQCDLVAMSGGFSPVLHLFA 426
Serythraea[actinobacteria]          G------RISHVHVARWETPGDRMTNVRQVIDCDVLLVSGGWNPAVHLHS 375
Rhsp[actinobacteria]                G------RISHAVVSHRTDTDHRFR---IPLACDVLLVSGGWNPAVHLFS 379
Asp[actinobacteria]                 GEGAADGRLDSVTVRSINDDGELTSG-IEEIACDLLAVSGGWSPLVHLHS 397
Rlitoralis[a-proteobacterie]        L------RVKGLRVNPIK---EGRVSYARMLSCDAVLVCGGWTPSLHLFS 428
Rbacterium[a-proteobacteria]        K------QVTGVMVADLD-----GKSTPDAIECDAVAMSGGWSPVVHLWS 422
Tsp[g-proteobacteria]               G------LLQAVTVCALDAEGRAKPETAQRIDCDGLLMSVGYAPAAPILY 405
Rxylanophilus[actinobacteria]       RRGM---RLAGAVICPLDIQNNPIPSRAFYIDCDGICMSVGWAANIALLA 414
Ppacifica[d-proteobacteria]         --------------------------------------------------
                                                                                      

CPelagibactersp[a-proteobacter      QSGNKTQFKEEIDAFIPGESKQNEK-------TLG-AANGIYTLDETLKS 470
CPelagibacteru[a-proteobacteri      QSGNKTTFNKDIDAFVPGLSKQNET-------TLG-AANGTFTLEETLKS 470
ma_sequence                         --------------------------------------------------
Bparapertussis[b-proteobacteri      QSGGKVGYDADLSTFVPTSTKQHSL-------SIG-ACAGRLQLSECLAD 461
R_sp[a-proteobacterie]              HQRGRPQWDADLAAFVPGTDLPVGM-------SVAGAAMGQLSTAQALSS 458
Ssp[a-proteobacterie]               HQRGRPVWREDIHAFVPGSDLPAGQ-------SVVGAAMGEMSTHAALRT 458
Smeliloti[a-proteobacterie]         HRGGKPVWSAEKAAFLAPGSL-KGL-------EVAGGAAATTGLAACLGE 458
Pmendocina                          HLGGRPIWREDILAFVPGEGFQKR--------HCAGAVNGVFGLGDALAD 474
Pentomophila                        HLGGRPVWREDILGFVPGDAPQKR--------VCVGGVNGVYALGDVIAD 474
Paeruginosa[g-proteobacteria]       HLGGKPEWREEILAFVPGEGLQKR--------ICAGAVNGVFGLAKVLAD 473
Cpsychrerythraea[g-proteobacte      HTGSRPVWNDDIAGFVPGDTVQKQ--------HSCGGLEGVYALSKVISD 474
Bdolosa[b-proteobacteria]           QSGGKACWNDEKACFLPGKPVQAE--------ASVGAAAGEFGLARALRL 468
Bcenocepacia[b-proteobacteria]      QSGGKACWNDEKACFLPGKPVQAE--------ASIGAAAGEFGLARALRL 468
Bpseudomallei[b-proteobacteria      QSGGKAHWNDDKACFVPGKPVQAE--------ASVGAAAGEFELARALRL 468
Bthailandensis[b-proteobacteri      QSGGKAHWNDDKACFVPGKPVQAE--------ASVGAAAGEFELSRALRL 468
Bphymatum[b-proteobacteria]         QSGGKAHWNDTKACFVPGKGMQPE--------TSVGAAAGEFSLARGLRL 468
Serythraea[actinobacteria]          QSRGTLRFAEQIGAFVPDRSARSV--------RSAGAAAGVFATADCLRT 417
Rhsp[actinobacteria]                QARGKLRYDANLGAFVPGEDLDGV--------SVAGSANGVFDLDGCLRD 421
Asp[actinobacteria]                 QRQGKLRWDEDLAAFVPSTVVPNQ--------QTIGSGRGSFELADCLAE 439
Rlitoralis[a-proteobacterie]        HTKGSLDWDADAKAYLPGNKTEDV--------HIAGAGRGLWGIAAALED 470
Rbacterium[a-proteobacteria]        HCGGKLNWDDAEAMFKPDPARPPLGADGQGFVLTAGNASGAMGLAEALAD 472
Tsp[g-proteobacteria]               QSGTRMVFAEIPGQFVPEQLPPGVF--------ACGRVNGVFDLDARVAD 447
Rxylanophilus[actinobacteria]       QAGCELSYAENLGQLIPKISPEGLF--------AAGRVKGIYNIQDKLCD 456
Ppacifica[d-proteobacteria]         --------------------------------------------------
                                                                                      

CPelagibactersp[a-proteobacter      SFEAGNELSKKITNNDN--KVSFPNVVEKKSTVHDKFWCVPLPKGKNY-- 516
CPelagibacteru[a-proteobacteri      SFETGYELSKKITNNDN--KTSSPTVMEKKSTTHDKFWCVPLPKGKTY-- 516
ma_sequence                         --------------------------------------------------
Bparapertussis[b-proteobacteri      GASICAQMDGEAQGRGH--PATPPKAEKLVIPP-------PHLGYRAG-- 500
R_sp[a-proteobacterie]              GACGAATALEAIGITAS--AIDLPEAEDAPISLKP----FWHVSGGKS-- 500
Ssp[a-proteobacterie]               GAETAREALSDLGFTAP--GVETPKAEDAPISLTP----FWHVADAK--- 499
Smeliloti[a-proteobacterie]         GAARAEAIVRELGLPCPPVAVVKVESEEGIRSPAP----LWSIPGIKD-- 502
Pmendocina                          GFEAGAKAAAEVG--FKAVTGSLPKAEKRIEEASVALFQVPHDKGTSRA- 521
Pentomophila                        GFEGGVRAATEAG--FKASAGTLPKTLARKEEATVALFQVPHDKGTARA- 521
Paeruginosa[g-proteobacteria]       GYQAGSRAALDAG--YKTTAGSLPKVQPRREEASVALFQVPHEKPTARA- 520
Cpsychrerythraea[g-proteobacte      GFTTGAVAAEAAGKGDGRYAGNSPTTSDPQEDASMALFHIPHSKKTSRA- 523
Bdolosa[b-proteobacteria]           ALDAGIEAAKAAGFTAAQRP-VAPQVAETVEDALQPLWLVGSREAAARG- 516
Bcenocepacia[b-proteobacteria]      AVDAGVEAAKAAGFTAAQRP-AAPQVAEAVEGALQPLWLVGSREAAARG- 516
Bpseudomallei[b-proteobacteria      ALDAGVAAAKSAGF-AAERP-PVPKLAEAVEDALLPLWLASGAEAAVRG- 515
Bthailandensis[b-proteobacteri      AVDAGVAAAKSTGF-AAERP-PVPKLAEAVEDALLPLWLASGAEAAIRG- 515
Bphymatum[b-proteobacteria]         AVDAGVEAVKSIGY-AVTRP-QVPQVAEVVESPLQPLWLVGSRAEAARG- 515
Serythraea[actinobacteria]          GAEAGRDAAVAAG--FDAEAGPVPRAANPPVLAGRNVWLVPSPADSAG-- 463
Rhsp[actinobacteria]                GQTAGQSIMRDLG--FTVPDHTIDPAPAPAIEQSTPLVLWRVKDVAGE-- 467
Asp[actinobacteria]                 GISAGASAAIAAG--FSAAVEPSVIGEPKASAPTRQLWLVPGQAGTPDDW 487
Rlitoralis[a-proteobacterie]        GAKAGVEAVQALG---QTADTVTYQVTDDRTGTGITQKELPSDRSAGKA- 516
Rbacterium[a-proteobacteria]        GHEAGRQAAKAAGG--TLTRKAAPKAPETERQPLKQVWIMPTSAGPDKR- 519
Tsp[g-proteobacteria]               GAAAAGEALAHLGMQAGPTARP----GRSSERMSHPWPVFPHPKGKN--- 490
Rxylanophilus[actinobacteria]       GRRAGILAAQYAGFSKKNTKIPEEPVDSSAVGRSHPYPIYDHPKGMA--- 503
Ppacifica[d-proteobacteria]         --------------------------------------------------
                                                                                      

CPelagibactersp[a-proteobacter      -KRFLDFQNDVAVSDIEIALREGYRSIEHVKRYTTLGMATDQGKTSNLNG 565
CPelagibacteru[a-proteobacteri      -KRFLDFQNDVAVSDVEIALKEGYRSIEHVKRYTTLGMATDQGKTSNLNG 565
ma_sequence                         --------------------------------------------------
Bparapertussis[b-proteobacteri      -KRFIDIQDDVTVEDIELAARENFRSVEHLKRYTTLGMGTDQGKTSNVNG 549
R_sp[a-proteobacterie]              -RAWVDLQNDVTVKDVKLAHQENFVSVEHLKRYTTLGMATDQGKTSNMLG 549
Ssp[a-proteobacterie]               -RAWLDFQNDVTVKDVKLAHQENFTSVEHLKRYTTLGMATDQGKTSNVGA 548
Smeliloti[a-proteobacterie]         -KAFVDFQNDVHLKDIGLAVREGYSHVELAKRYTTSGMATDQGKLSNVNA 551
Pmendocina                          PKQFVDQQNDVTAAGIELATREGFESVEHVKRYTALGFGTDQGKLGNING 571
Pentomophila                        PKQFVDQQNDVTAAAIELATREGFESVEHVKRYTALGFGTDQGKLGNING 571
Paeruginosa[g-proteobacteria]       PKQFVDPQNDVTAAAIELACREGFESIEHVKRYTALGFGTDQGKLGNING 570
Cpsychrerythraea[g-proteobacte      PKQFVDYQNDVTAAGIELANREGFESIEHVKRYTALGFGTDQGKLGNING 573
Bdolosa[b-proteobacteria]           PKQFVDFQNDVSAADILLAAREGFDSVEHVKRYTAMGFGTDQGKLGNING 566
Bcenocepacia[b-proteobacteria]      PKQFVDFQNDVAAADILLAAREGFESVEHVKRYTAMGFGTDQGKLGNING 566
Bpseudomallei[b-proteobacteria      PKQFVDFQNDVGAADILLAAREGFESVEHVKRYTAMGFGTDQGKLGNING 565
Bthailandensis[b-proteobacteri      PKQFVDFQNDVGAADILLAAREGFESVEHVKRYTAMGFGTDQGKLGNING 565
Bphymatum[b-proteobacteria]         PKQFVDFQNDVSAADILLAAREGFESVEHVKRYTAMGFGTDQGKLGNING 565
Serythraea[actinobacteria]          HTQYVDLARDATVADIRRAVGAGLHSVEHVKRYTTIGTAHDQGKTSGILS 513
Rhsp[actinobacteria]                DTQFVDVQRDATVADLARAVGAGMTSMEHIKRYTTIGTAHDQGKTSGVIS 517
Asp[actinobacteria]                 HHHFVDFQRDQSVADVLRSTGAGMRSVEHIKRYTSISTANDQGKTSGVNA 537
Rlitoralis[a-proteobacterie]        -KAFVDFQNDVTAKDIRLAVREGMKSIEHVKRYTTNGMATDQGKLSNMNG 565
Rbacterium[a-proteobacteria]        MKMWLDYQNDVKVSDVQLAAREGYASVEHTKRYTTLGMATDQGKLSNING 569
Tsp[g-proteobacteria]               ---FVDLDEDLQLKDLERAAAEGFDNIELLKRYSTVGMGPSQGKHANMNA 537
Rxylanophilus[actinobacteria]       ---FVDFDEDVQLKDIKNSIQEGFDSVALVNRFATLGMGPSQGKHSNMNG 550
Ppacifica[d-proteobacteria]         --------------------------------------------------
                                                                                      

CPelagibactersp[a-proteobacter      LQLVSKIEN------KVVPAVGHTTFRPPYTPVSIGAIVGREVGKHTKPT 609
CPelagibacteru[a-proteobacteri      LQLVSNIEN------KIVPEVGHTTFRPPYTPVTIGAIVGREVGKHSKPT 609
ma_sequence                         --------------------------------------------------
Bparapertussis[b-proteobacteri      LTIMGALRS------ESPGAVGTTTFRPPYTPIRLGLLSGRHIDRHFSAV 593
R_sp[a-proteobacterie]              LAVMAELTG------KSIPETGTTIFRPPYTPVAMGTLAGRATGKHFHPT 593
Ssp[a-proteobacterie]               LAVMAELTG------KPIPETGTTIFRPPYTPVSMGALAGRAVGKDFHPT 592
Smeliloti[a-proteobacterie]         IGLIAKARG------VSPAEVGTTTFRPFYTPVSFGALTGAHTGHHFQPV 595
Pmendocina                          LAIAAKSLG------ISISEMGTTMFRPNYTPVTFGAIAGRHCGELFEPK 615
Pentomophila                        LAIAARSLG------IGIPEMGTTMFRPNYTPVTFGAVAGRHCGHLFEPV 615
Paeruginosa[g-proteobacteria]       LAIAARAQG------KSIADTGTTMFRPNYTPVTFGAVAGRHCGHLFEPV 614
Cpsychrerythraea[g-proteobacte      MAITAKSLG------KTIPETGTTIFRPMYTPTTFGALAGADVKHLFDPA 617
Bdolosa[b-proteobacteria]           MAILAQALG------KSIPETGTTTFRPNYTPVSFGTFAGRELGDFLDPI 610
Bcenocepacia[b-proteobacteria]      MAILAGALG------KTIPETGTTTFRPNYTPVSFGTFAGRETGDFLDPI 610
Bpseudomallei[b-proteobacteria      MAILAQALG------KTIPETGTTTFRPNYTPVSFGAFAGRELGDFLDPI 609
Bthailandensis[b-proteobacteri      MAILAQALG------KTIPETGTTTFRPNYTPVSFGAFAGRELGDFLDPI 609
Bphymatum[b-proteobacteria]         MAILADALG------KTIPETGTTTFRPNYTPVTFGTFAGRELGDLLDPI 609
Serythraea[actinobacteria]          TGIITEALG------RDIADVGTTTFRAPYAPVTFAALAGRDRGDLYDPV 557
Rhsp[actinobacteria]                SGITAELLG------RPIETLGTTTFRPPYTPVAFAALAGRSRGALFDPE 561
Asp[actinobacteria]                 IGVIAAALRTAGEASRGIGDIGTTTYRAPFTPVAFAALAGRQRGELFDPA 587
Rlitoralis[a-proteobacterie]        LTIASDALG------KEAPQVGLTTFRPPYTPTTFGAFAGYHKGKHFEVT 609
Rbacterium[a-proteobacteria]        LAVLSDALG------QAIPQTGTTTFRPPYTPISMGAIAGEARGELFQPI 613
Tsp[g-proteobacteria]               VRILARLNK------QSIGATGTTTARPFYHPVPIKHLAGRR----LRPE 577
Rxylanophilus[actinobacteria]       VRIVARMMN------QSIDKAGSITSRPFYHPVPMGVLAGRS----FHPV 590
Ppacifica[d-proteobacteria]         --------------------------------------------------
                                                                                      

CPelagibactersp[a-proteobacter      RKSPMHYWHEKNNAVFVDAGVWLRPRYYKQ-GNETLFEGSKREAKNVRTN 658
CPelagibacteru[a-proteobacteri      RKSPMHTWHEKNNAVFVDAGVWLRPRYYKI-GEETLFEGSKREAKNVRTN 658
ma_sequence                         --------------------------------------------------
Bparapertussis[b-proteobacteri      RVSPMHEWHVRNGAVMGPANLWLRPKAYLR-GNESYAQAWQRECRNVRQD 642
R_sp[a-proteobacterie]              RKTPSHRWAEEQGAVFTEVGDWLRAQWFPKAGETHWRQSVDREVLQTRNS 643
Ssp[a-proteobacterie]               RLTPSHKWAEEQGAVFVEVGNWLRAQWFPKAGETHWRQSVDREVLATRNS 642
Smeliloti[a-proteobacterie]         RKSPLHDWAKKHGAVFVETGLWYRSSWFPRSGERTWRESVEREVLNVRKN 645
Pmendocina                          RYTALQKWHLENGAEFEDVGQWKRPWYFPKNGED-LHAAVARECLAVRNA 664
Pentomophila                        RFTALHAWHIKNGAEFEDVGQWKRPWYFPKPGED-IHTAVARECKAVRDS 664
Paeruginosa[g-proteobacteria]       RFTALHAWHVKNGAEFEDVGQWKRPWYFPRRGED-MHAAVARECRAVREA 663
Cpsychrerythraea[g-proteobacte      RFSAMHKWHLENGAEFEDVGQWKRPWYFPQPGET-MQQSLERECLATRNS 666
Bdolosa[b-proteobacteria]           RKTCVHEWHVEHGAMFEDVGNWKRPWYFPKNGED-LHAAVKRECLAVRNS 659
Bcenocepacia[b-proteobacteria]      RKTAVHEWHVEHGAMFEDVGNWKRPWYFPKNGED-LHAAVKRECLAVRNG 659
Bpseudomallei[b-proteobacteria      RKTCVHEWHVEHGAMFEDVGNWKRPWYFPRNGED-LHAAVKRECLAVRNG 658
Bthailandensis[b-proteobacteri      RKTCVHEWHVEHGAMFEDVGNWKRPWYFPRNGED-LHAAVKRECLAVRNG 658
Bphymatum[b-proteobacteria]         RKTAVHEWHVENGAMFEDVGNWKRPWYFPLKGED-LHAAVKRECLAVRNS 658
Serythraea[actinobacteria]          RVTAMHDWHVEQGAPFENVGQWKRPWYYPRPGED-METAVLRECQAVREG 606
Rhsp[actinobacteria]                RVTALHDWHVGRGAVFEDVGQWKRPRYYPLPGED-MDAAVLRECAAVRRS 610
Asp[actinobacteria]                 RVTSIHPWHVAKGALFEDVGQWKRPWYYPQDGED-MDTAVLRECAAVRES 636
Rlitoralis[a-proteobacterie]        RKTPIDSWAEENGAAFEPVALWRRAWYFPQDGED-MHKAVLRECKATRES 658
Rbacterium[a-proteobacteria]        RRTPMHSAHDAAGAVWEPVGHWRRPFCFARTGET-DMEAVNREIVNTRDN 662
Tsp[g-proteobacteria]               RRTPMHGWHRDHGAVFMPAGHWQRPKYYGPAS---EAEAIRAEVMAVREG 624
Rxylanophilus[actinobacteria]       RRTPMHFRHEDFNAIFMRAGNWLRPEYYELSGKE-REDAIRAEVRSVRQH 639
Ppacifica[d-proteobacteria]         --------------------------------------------------
                                                                                      

CPelagibactersp[a-proteobacter      VGVCDVTTLGKIDIKGPDAAELLNRVYTNAWLKLPVGKARYGVMLREDGI 708
CPelagibacteru[a-proteobacteri      VGVCDVTTLGKIDIKGPDAAELLNRVYTNAWLKLPVGKARYGVMLREDGI 708
ma_sequence                         --------------------------------------------------
Bparapertussis[b-proteobacteri      VGIVDVSTLGKIEVQGPDAGVFLDRVYANRISTLKVGKARYGVLLREDGI 692
R_sp[a-proteobacterie]              VGVCDVTTLGKIDVQGKDAAAFLNKMYANAFAKLPVGKVRYGLMLREDGI 693
Ssp[a-proteobacterie]               VGICDVTTLGKIDVQGTDAAEFLNKIYANGFAKLPVGKVRYGLMLREDGV 692
Smeliloti[a-proteobacterie]         AGLCDVSMLGKIEITGSDAAEFLNRVYCNAFLKLPVGKARYGLMLREDGF 695
Pmendocina                          VGILDASTLGKIDIQGPDAREFLNRVYTNAWTKLDVGKARYGLMCKEDGM 714
Pentomophila                        VGLLDASTLGKIDIQGPDAREFLNRIYTNAWTKLDVGKARYGLMCKEDGM 714
Paeruginosa[g-proteobacteria]       VGLLDASTLGKIDIQGPDAREFLNRVYTNAWTKLDVGKARYGLMCKEDGM 713
Cpsychrerythraea[g-proteobacte      VGILDASTLGKIDIQGKDAREFLNRVYTNPWSKLGVGKCRYGVMCKEDGM 716
Bdolosa[b-proteobacteria]           VGILDASTLGKIDIQGPDAVKLLNWMYTNPWNKLEVGKCRYGLMLDENGM 709
Bcenocepacia[b-proteobacteria]      VGILDASTLGKIDIQGPDAVKLLNWMYTNPWNKLEVGKCRYGLMLDENGM 709
Bpseudomallei[b-proteobacteria      VGMLDASTLGKIDIQGPDAVKLLNWVYTNPWNKLEVGKCRYGLMLDENGM 708
Bthailandensis[b-proteobacteri      VGILDASTLGKIDIQGPDAVKLLNWVYTNPWNKLEVGKCRYGLMLDENGM 708
Bphymatum[b-proteobacteria]         VGILDASTLGKIDIQGPDAAKLLNWMYTNPWSKLEVGKCRYGLMLDENGM 708
Serythraea[actinobacteria]          VGIQDVSTLGKIDVQGPDAAEFLDLVYTNKMSTLKVGRIRYGLMCHADGM 656
Rhsp[actinobacteria]                IGILDGSTLGKIDVQGPDAGVLLDMIYTNMMSTLKVGMVRYGVMCGVDGM 660
Asp[actinobacteria]                 VGFMDATTLGKIEIRGKDAGEFLNRIYTNAFKKLAPGSARYGVMCMADGM 686
Rlitoralis[a-proteobacterie]        VGMFDASTLGKIEVSGPDAVEFMNRMYTNPWTKLGVGRCRYGLLLGEDGF 708
Rbacterium[a-proteobacteria]        VGMLDASTLGKILVTGPDAGKFLDMLYTNVMSSLPVGKCRYGLMCTENGF 712
Tsp[g-proteobacteria]               VGLIDVSTLGKVEVFGPDAARFMDQLYTLKLSTVKQGMTRYALMVDEAGV 674
Rxylanophilus[actinobacteria]       VGLIDVGTLGKLEIHGPDALELIERICTGHFARLETGMTRYALMTDEAGI 689
Ppacifica[d-proteobacteria]         --------------------------------------------------
                                                                                      

CPelagibactersp[a-proteobacter      VMDDGTTTRISENHYHMTTTTAQAANVLSHLEYYLQLVWPDLNVNVVSST 758
CPelagibacteru[a-proteobacteri      VMDDGTTTRISENHYHMTTTTAQAANVLSHLEYYLQLVWPELNVNVVSTT 758
ma_sequence                         --------------------------------------------------
Bparapertussis[b-proteobacteri      VFDDGTIARWGERLFILSTTTANAAAVMSHFEFLLATAWPTLRVRVTSVT 742
R_sp[a-proteobacterie]              AYDDGTAARFAEDHFVVTTTTANAVLVYRNMEFARQCLFPDMDVQLISTT 743
Ssp[a-proteobacterie]               AYDDGTAARLAEDHFVVTTTTANAVLVYRNMEFARQCLWPDLDVQLISTT 742
Smeliloti[a-proteobacterie]         IYDDGTTSRLEENRFFMTTTTAYAAGVMNHLEFCAQVLWPQLDVRLASIT 745
Pmendocina                          VFDDGVTACLADNHFVMTTTTGGAGRVMEWLEIYHQTEWPELKVYFTSVT 764
Pentomophila                        VFDDGVTACVGDNHFIMTTTTGGAARVLQWLELYHQTEWPDMKVYFTSVT 764
Paeruginosa[g-proteobacteria]       VFDDGVTACLADNHFVMTTTTGGAARVLEWLELYHQTEWPELKVYFTSVT 763
Cpsychrerythraea[g-proteobacte      VFDDGVTVCLDDNRFIMTTTTGGAAGVLQWLELWHQTEWPELEVYFSTVT 766
Bdolosa[b-proteobacteria]           VFDDGVTVRLAEQHFMMTTTTGGAARVLTWLERWLQTEWPDMKVRLASVT 759
Bcenocepacia[b-proteobacteria]      VFDDGVTVRLADQHFMMTTTTGGAARVLTWLERWLQTEWPDMKVRLASVT 759
Bpseudomallei[b-proteobacteria      VFDDGVTVRLGDQHFMMTTTTGGAARVLTWLERWLQTEWPDMKVRLSSVT 758
Bthailandensis[b-proteobacteri      VFDDGVTVRLGEQHFMMTTTTGGAARVLTWLERWLQTEWPDMKVRLSSVT 758
Bphymatum[b-proteobacteria]         VFDDGVTVRLADQHFMMTTTTGGAARVLTWMERWLQTEWPDMKVRLASVT 758
Serythraea[actinobacteria]          VFDDGTVMRTGENRYLISTTSGGAAGVLQWLEDWLQTEWPHLRVHLTSVT 706
Rhsp[actinobacteria]                VIDDGTVMRLDDDRFQVFTTTGGAAKILDWMEEWLQTEWPHLRVRLTSVT 710
Asp[actinobacteria]                 IFDDGVTLRLDEDRFFMTTTTGGAAKVLDWLEEWLQTEWPELDVHCTSVT 736
Rlitoralis[a-proteobacterie]        IRDDGVIGRIRDDLFHVTTTTGGAASVLNMMEDYLQTEWPDLKVWLTSTT 758
Rbacterium[a-proteobacteria]        VTDDGVVARIGEQTWLCHTTTGGADRIHGHMEDWLQCEWWDWKVYTANLT 762
Tsp[g-proteobacteria]               VIDDGVCARWGEEHFYVSTTTTGAEAIFRQMQRMIGEWN--LKVDVVNRT 722
Rxylanophilus[actinobacteria]       IIDDGVCAKLNDDHFYLTATTSGVDDLYREMSRWIQIWG--LNVEVTNYT 737
Ppacifica[d-proteobacteria]         --------------------------------------------------
                                                                                      

CPelagibactersp[a-proteobacter      EQWAGAAIAGPKSRDLLQNLFP-NSDVSN----EGLPFMGYMEGDLFGV- 802
CPelagibacteru[a-proteobacteri      EQWAGAAIAGPKSRDLLQKLFP-NIDASN----EGLPFMGYLEADLFGV- 802
ma_sequence                         --------------------------------------------------
Bparapertussis[b-proteobacteri      DHYAQIALAGPKSREVLERLQI-SADVTD----SALPHMAVCETVWNGL- 786
R_sp[a-proteobacterie]              EAWAQFAVAGPNARKLLQKVVDPEFDLSN----EGFPFMACGEVTVAGG- 788
Ssp[a-proteobacterie]               EAWAQYAVAGPNSRKLLQKIVDPEFDISN----AAFPFMGCREITVCGG- 787
Smeliloti[a-proteobacterie]         DQWAQMAIAGPKARMILQKIVD--EDISD----AAFPFLAAKEVSLFGGA 789
Pmendocina                          DHWATMTLSGPNSRKLLAEVT--DIDLDK----DAFPFMSWKEG-KVG-G 806
Pentomophila                        DHWATMTLSGPNSRKLLADVS--DIDLDK----EGFPFMSWKEG-LVG-G 806
Paeruginosa[g-proteobacteria]       DHYATLTLSGPNSRKLLAEVT--DIDLDK----DAFPFMTWKEG-KVA-G 805
Cpsychrerythraea[g-proteobacte      DHWSTMTISGPNSRKVLEKIC--DIDVSN----DSFKYMDWRAA-TVA-G 808
Bdolosa[b-proteobacteria]           DHWATFAVVGPKSRKVVQKVCQ-DIDFGN----DAFPFMSYRNG-TVA-G 802
Bcenocepacia[b-proteobacteria]      DHWATFAVVGPKSRKVVQKVCQ-DIDFGN----EAFPFMSYRNG-TVA-G 802
Bpseudomallei[b-proteobacteria      DHWATFAVVGPKSRRVVQKVCK-DIDFAN----DAFPFMSYRDG-TVA-G 801
Bthailandensis[b-proteobacteri      DHWATFAVVGPKSRKVVQKVCK-DIDFAN----DALPFMSYRDG-TVA-G 801
Bphymatum[b-proteobacteria]         DHWATFAVVGPKSRKVVQKVCS-DIDFAN----EAFPFMSYRNG-TVA-G 801
Serythraea[actinobacteria]          EQWATIALVGPRSREVLARVAS-EMDLDN----DDFPFMAWQDG-SVA-G 749
Rhsp[actinobacteria]                EQWATFPVVGPRSRDVIGEVFP-DLDVTN----DAFGFMAWRDT-SLG-G 753
Asp[actinobacteria]                 EQWSTIAVVGPKSRAVLAKVAP-ELAAGGGLEAEAFPFMTFRET-TLASG 784
Rlitoralis[a-proteobacterie]        EEWATIALNGPNARKLLQPFVE-GADISA----DAMPHMALVEC-TVA-G 801
Rbacterium[a-proteobacteria]        EQYAQVAVAGPKARKVLEALG--GMDVSK----EAMPFMTWADG-TLA-G 804
Tsp[g-proteobacteria]               SQLASMNIAGPLTRDVLQPLTDVDLSQAA------FPFLGARQGRVAG-- 764
Rxylanophilus[actinobacteria]       ETFAAMNVAGPSARAVMKQLTELDLSENK------FPYLAIREGEVAG-- 779
Ppacifica[d-proteobacteria]         --------------------------------------------------
                                                                                      

CPelagibactersp[a-proteobacter      -KARIFRISFSGELAYEVNVESDYGNFMWEKIMEIGEEFKIQPYGTEALS 851
CPelagibacteru[a-proteobacteri      -HARIFRISFSGELAYEVNVESDNGNFMWEKIMEVGQEFKIQPYGTEALS 851
ma_sequence                         --------------------------------------------------
Bparapertussis[b-proteobacteri      -KLLIYRVSFSGERAYELAIAAAYGQRLWDQLLAVGAPFSIMPYGTEAMG 835
R_sp[a-proteobacterie]              CRARLFRISFSGELAYEIAVPTRYGDALVRRLMEAGEEFGVVPYGTEALG 838
Ssp[a-proteobacterie]               LRARLFRISFSGELAYEIAVPTRYGDALMREMMTAGAEFDVTPYGTEALG 837
Smeliloti[a-proteobacterie]         LHGCLFRISFSGELAYELAVPAGYGESIADALLEAGKDHGIMPYGVETLS 839
Pmendocina                          VPARVFRISFTGELSYEVNVQADYALGVWEQIIEAGKKHGLTPYGTETMH 856
Pentomophila                        VPARVFRISFTGELSYEINVQANYAMGVLEQIVEAGKKYNLTPYGTETMH 856
Paeruginosa[g-proteobacteria]       VPARVFRISFTGELSYEVNVQADYAMGVLEALAEHGAKYGLTPYGTETMH 855
Cpsychrerythraea[g-proteobacte      VKARIFRISFTGELSFEINVQANYGMHAWKAVMAAGEEFNITPYGTETMH 858
Bdolosa[b-proteobacteria]           VKARVMRISFSGELAYEVNVPANAGRAVWEALMAAGAEFDITPYGTETMH 852
Bcenocepacia[b-proteobacteria]      AKARVMRISFSGELAYEVNVPANAGRAVWEALMAAGAEFDITPYGTETMH 852
Bpseudomallei[b-proteobacteria      VKSRVMRISFSGELAYEVNVPANAGRAVWEALMDAGAEFDITPYGTETMH 851
Bthailandensis[b-proteobacteri      VKSRVMRISFSGELAYEVNVPANAGRAVWEALMEAGAEFDITPYGTETMH 851
Bphymatum[b-proteobacteria]         VKARVMRISFSGELAYEVNVPANMGRAVWEALMAAGAEFDITPYGTETMH 851
Serythraea[actinobacteria]          QRARVCRISFSGELAFEINVPWWHGREVWDALIDAGAPFGITPYGTETMH 799
Rhsp[actinobacteria]                VHVRVARISFSGELAFEVNVDGWHAPAVWARLIAAGEKFDITPYGTETMH 803
Asp[actinobacteria]                 VQARICRISFSGELAYEINVPSWYGLNTWEAVAAAGAEFNITPYGTETMH 834
Rlitoralis[a-proteobacterie]        FPARLFRVSFTGELGFEINVPARHGRALWEKLHEAGQKFDICTYGTETMH 851
Rbacterium[a-proteobacteria]        IPARVYRISFTGELSYEIAVPANRGAELWAKVAEAGAAHGIQPYGTEAMH 854
Tsp[g-proteobacteria]               VPAWLFRVGFVGELGFEIHVPAAQALHVWEALMEAGASRGIRPFGVEAQR 814
Rxylanophilus[actinobacteria]       VPARIMRVGFVGELGYEVHVPATYGLFVWDRIIEAGREYGIKPFGVEAQR 829
Ppacifica[d-proteobacteria]         --------------------------------------------------
                                                                                      

CPelagibactersp[a-proteobacter      TLRIEMG-HVAGSELDGRTIPYDNSLEGLLSK-KK-DFIGKRSLTREAFT 898
CPelagibacteru[a-proteobacteri      TLRIEMG-HIAGSELDGRTIPYDNSLEGLVSK-KK-DFIGKRSLEREAFI 898
ma_sequence                         --------------------------------------------------
Bparapertussis[b-proteobacteri      ALRIEKG-HPAGPELDGRTTAADLGLGGLVKK-EG-AFVGKALLGREGLQ 882
R_sp[a-proteobacterie]              VMRIEKG-HAAGNELNGTTSALNLGMGRMVSK-KK-DCIGNTLSEREGMN 885
Ssp[a-proteobacterie]               VMRIEKG-HAAGNELNGTTTALNLGLDRMVST-KK-DFIGNVLSRREGMN 884
Smeliloti[a-proteobacterie]         VLRIEKG-HVTHNEINGTIVPADLGFGKMVSAGKP-DFVGKAMLQREGLT 887
Pmendocina                          VLRAEKGFIIVGQDTDGSVTPDDLGMGWCVGRTKPFSWIGWRGMNREDCL 906
Pentomophila                        VLRAEKGFIIVGQDTDGSMTPDDLNMSWCVGRNKPFSWIGLRGMNREDTV 906
Paeruginosa[g-proteobacteria]       VLRAEKGFIIVGQDTDASVTPDDLNMGWAVGRSKPFSWIGWRGMNRADCL 905
Cpsychrerythraea[g-proteobacte      ILRAEKGFIIVGQDTDGSVTPQDLDMDWVVGKKKDFSFIGKRSWTRFDNK 908
Bdolosa[b-proteobacteria]           VLRAEKGYIIVGQDTDGSVTPYDLGMGGLVAKSK--DFLGKRSLSRSDTA 900
Bcenocepacia[b-proteobacteria]      VLRAEKGYIIVGQDTDGSITPFDLGMGGVVAKSK--DFLGKRSLSRSDTA 900
Bpseudomallei[b-proteobacteria      VLRAEKGYIIVGQDTDGSITPFDLGMGGLVAKSK--DFLGRRSLTRADTA 899
Bthailandensis[b-proteobacteri      VLRAEKGYIIVGQDTDGSITPFDLGMGGLVAKSK--DFLGRRSLTRADTA 899
Bphymatum[b-proteobacteria]         VLRAEKGYIIVGQDTDGSVTPHDLGMGGLVAKTK--DFLGRRSLARSDTT 899
Serythraea[actinobacteria]          VLRAEKGFPIVGQDTDGTVTPHDLGMSWAVSKKKD-DFLGMRSFSRADTS 848
Rhsp[actinobacteria]                VLRAEKGYPIIGQDTDGTVTPQDLGMSWAVSKKKR-DFIGKRSFTRAENQ 852
Asp[actinobacteria]                 VLRAEKGYPIVGQDTDGTVTPQDAGMEWVVSKAK--EFIGKRSYARADAK 882
Rlitoralis[a-proteobacterie]        VLRAEKGFIIVGQDTDGTVTPQDAGIGWAIGKMKP-DFVGKRSLDRPDIA 900
Rbacterium[a-proteobacteria]        IMRAEKGFVMIGDETDGTVIPQDLNMGWIISKKKT-DYLGKRAQERSHMA 903
Tsp[g-proteobacteria]               QLRLEKGHLIVGQDTDGTSSPFDANMAWAVKFDKP-FFQGKRSLQILKER 863
Rxylanophilus[actinobacteria]       RLRLEKGHIIVGQDTDGLTNPWEANLGWAVKLDKP-FFIGQRTLKILRKK 878
Ppacifica[d-proteobacteria]         --------------------------------------------------
                                                                                      

CPelagibactersp[a-proteobacter      AED-----RQKVVGVVPLDKKTSIPEGSHLVKDS----KAPLPNPKLGYI 939
CPelagibacteru[a-proteobacteri      AED-----RQKVVGVVPIDKKTSIPEGSHLVKDA----MAPTPNPKLGYI 939
ma_sequence                         --------------------------------------------------
Bparapertussis[b-proteobacteri      AAD-----RPTLVGLR-SKSGAAIQSGSMLVLR------AEVGAQELGWV 920
R_sp[a-proteobacterie]              EED-----ALKLVGFRPVKSDETISAGAHLMNAS----GAVNAKVDQGYV 926
Ssp[a-proteobacterie]               AKD-----ALNLVGVRPVDPSHSLPAGGHLMRRS----GPVDATQDQGYV 925
Smeliloti[a-proteobacterie]         APD-----RPQLVGVVPLDPQQSFRSGSHILAKG----AAATLENDEGYV 928
Pmendocina                          KEN-----RKQLIGLKPLDPNKVLPEGAQLVFDP-KQP---IPMTMVGHV 947
Pentomophila                        REN-----RKQLVGLKPVDPNVWLPEGAQLVFDP-KQP---IPMDMVGHV 947
Paeruginosa[g-proteobacteria]       RED-----RKQLVGLRPSNPQEVLPEGAQLVFDT-QQA---IPMKMVGHV 946
Cpsychrerythraea[g-proteobacte      RDD-----RKQMVGLKPKDPTFVLPEGAQIVFEK-NQS---IPMKMVGHV 949
Bdolosa[b-proteobacteria]           KEG-----RKQFVGLLTDDEQFVLPEGAQIVAKD-TQVSTVDPTPMIGHV 944
Bcenocepacia[b-proteobacteria]      KEG-----RKQFVGLLTEDEQFVLPEGAQIIAKD-TQVSATDPTPMIGHV 944
Bpseudomallei[b-proteobacteria      KSG-----RKQFVGLLTDDAQSVLPEGGQIVELD-AAARADGTTPMLGHV 943
Bthailandensis[b-proteobacteri      KSG-----RKQFVGLLTDDAQYVLPEGGQIVELD-AAARADGTTPMLGHV 943
Bphymatum[b-proteobacteria]         KDN-----RKQFVGLLSDDPQFVIPEGSQIVARP-FQG---DTAPMLGHV 940
Serythraea[actinobacteria]          RTD-----RKHLVGLLPADEDLVLEEGAQLVEHS---ELPQPPVPMLGHV 890
Rhsp[actinobacteria]                NPL-----RKEFVGLLPLDKQTVLPEGAQIIEEISDGVLPPPPVPMLGHV 897
Asp[actinobacteria]                 RED-----RKHLVSVLPVDGTLRLPEGTQLVEKGIPTNPAYGPVPMQGFV 927
Rlitoralis[a-proteobacterie]        APG-----RKQLVGLLTDDSKTVLVEGAQIVANP-KQP---KPMKMIGHV 941
Rbacterium[a-proteobacteria]        SPD-----RWRLVGLETLDG-SVIPDGAYAVGEGFNAN---GQRNMIGRV 944
Tsp[g-proteobacteria]               AAN------RLVGFRLPGSHPGPIPRECHLVIHD---------DDIAGRV 898
Rxylanophilus[actinobacteria]       MDANLAQSRVLVGFKLVSNER-PWPKESHLIIEE---------DRIIGRV 918
Ppacifica[d-proteobacteria]         --------------------------------------------------
                                                                                      

CPelagibactersp[a-proteobacter      SASCWSVEYDNPFSLAILKNGKNMIGEKLYVMSPLKN-KIIPVEIVSSHY 988
CPelagibacteru[a-proteobacteri      SASCWSVEYDNPFSLAILKDGKNMIGKKLFAMSPLKN-KTIPVEIVSSHY 988
ma_sequence                         --------------------------------------------------
Bparapertussis[b-proteobacteri      ASATYSPTLGQHIALGFLVNGANALGRSVLAWSALTS-SQVEVEVVNPCF 969
R_sp[a-proteobacterie]              TSAAYSPVLESSIGIGFLKNGDARKGEIIRAVNPLAG-QEIQVEVVSAHF 975
Ssp[a-proteobacterie]               TSAAYSPTLKSAIGLGFVKSGFERMGEQLRLVNPLEG-QEILVEIVSPHF 974
Smeliloti[a-proteobacterie]         TSSAYSPHVGSTIALALVRNGRNRHGEEVLVWSGLHG-ESTPARLCNPVF 977
Pmendocina                          TSSYMSAAMGYSFAMALVRGGLSRIGERVFAPLADGS--VIEAEIVSPVF 995
Pentomophila                        TSSYAANSLGYSFAMGVVKGGLKRLGERVYSPQADGS--VIEAEIVSSVF 995
Paeruginosa[g-proteobacteria]       TSSYMSASLGHGFALAVVKGGLKRMGQKVYAPLADGR--FIEAEICSSVF 994
Cpsychrerythraea[g-proteobacte      TSSYYSACMGYSFALAVVKGGISRKGESVYLPLSDGT--TVEAEICSPVF 997
Bdolosa[b-proteobacteria]           TSSYYSPILQRSIALAVVKGGLNKMGESVVIPLADGK--RITAKISSPVF 992
Bcenocepacia[b-proteobacteria]      TSSYYSPILKRSIALAVVKGGLNKMGESVVIPLANGR--RITAKISSPVF 992
Bpseudomallei[b-proteobacteria      TSSYYSPILNRSIALAVVKGGLSRMGERVAVSLANGR--RVAATISSPVF 991
Bthailandensis[b-proteobacteri      TSSYYSPILNRSIALAVVKGGLSRMGERVAVSLANGR--RVAATISSPVF 991
Bphymatum[b-proteobacteria]         TSSYYSPILNRSIALAVVKGGLNKMGQSVTIPLSSGK--QIAAKIASPVF 988
Serythraea[actinobacteria]          TSSYRSAVLRRGFALALVKGGRDRIGETIYSTAGDG---LAAVTITEPVF 937
Rhsp[actinobacteria]                TSSYLSAELGRPFGLALVKGGRARLGDTLHVPVDGN---LVAVEVTSSVL 944
Asp[actinobacteria]                 TSSYHSAALGRSFGLALIKNGRNRIGETLVAAAGDQ---LVDVVVAETVL 974
Rlitoralis[a-proteobacterie]        TSSYWSETLGRSIAMAVVEGGFDRMDETLHIPTEEGG--TVPAKVTGTVF 989
Rbacterium[a-proteobacteria]        TSTYYSPTIRKGIAMGLIQHGPDRMGEVVDFATLDGTGTVIKAKIVETCF 994
Tsp[g-proteobacteria]               TSIGYSPSLKAWVGLAMVDKT-LADAAQLSIRVEGAV--IIQADVVPTPF 945
Rxylanophilus[actinobacteria]       TSTAYSESLDQVIGLAFLPTERSARGTRFQIRVEGGS--MVEAEVVPTPF 966
Ppacifica[d-proteobacteria]         --------------------------------------------------
                                                                                      

CPelagibactersp[a-proteobacter      VDPKGERVRS-------- 998
CPelagibacteru[a-proteobacteri      VDPKGERVRS-------- 998
ma_sequence                         ------------------
Bparapertussis[b-proteobacteri      VDIERERLLG-------- 979
R_sp[a-proteobacterie]              VDPEGERLRA-------- 985
Ssp[a-proteobacterie]               VDPEGEKLRA-------- 984
Smeliloti[a-proteobacterie]         FDPQNERLHV-------- 987
Pmendocina                          YDPKGDRQNV-------- 1005
Pentomophila                        FDPKGERQNV-------- 1005
Paeruginosa[g-proteobacteria]       YDPKGERQNVD------- 1005
Cpsychrerythraea[g-proteobacte      YDPKGDRQNV-------- 1007
Bdolosa[b-proteobacteria]           YDTEGVRQHVE------- 1003
Bcenocepacia[b-proteobacteria]      YDTEGVRQHVE------- 1003
Bpseudomallei[b-proteobacteria      YDTEGVRQHVE------- 1002
Bthailandensis[b-proteobacteri      YDTEGVRQHVE------- 1002
Bphymatum[b-proteobacteria]         YDTEGVRQHVE------- 999
Serythraea[actinobacteria]          YDKEGARRDG-------- 947
Rhsp[actinobacteria]                VDPEGARRDG-------- 954
Asp[actinobacteria]                 FDPEGTRKDG-------- 984
Rlitoralis[a-proteobacterie]        YDPAGDRLKVE------- 1000
Rbacterium[a-proteobacteria]        YDKEGAKADV-------- 1004
Tsp[g-proteobacteria]               YDPEGLRQKPETAGEVNA 963
Rxylanophilus[actinobacteria]       YDPDNMRQRVS------- 977
Ppacifica[d-proteobacteria]         ------------------
                                                      

BLAST

PROTOCOLE:

a)BLASTp contre NR "max target sequences:500"

b)BLASTp contre SwissProt (SP)

c)BLASTx contre NR
---------------------------------------------------------------------------------------------------
ANALYSE DES RÉSULTATS:

a)Ces résultats nous ons permis de déterminer le codon start de notre ORF. En effet, les homologues 
commençaient tous à la même position(8) , alignés avec la méthionine. On a donc enlevé 7 codons au 
début de l'ORF pour passer de 177 à 198 AA.

On trouve beaucoup d'homologues avec de très bonnes  E-value.
On remarque que dans les meilleurs E-value , il y a une fonction qui prédominent :"sarcosine 
oxidase".

b) Les résultats obtenues ne permettent pas de déterminer de bons homologues car les e-values ne sont
pas pertinents, le plus petit est de 1e67 puis après sa passe à 0.013

c)On observe un saut du cadre de lecture :"Frame Shift" passant du cadre +3 au cadre +2.
Il pourrait s'agir d'une erreur de séquençage.

---------------------------------------------------------------------------------------------------
RÉSULTATS BRUTS:

                                                                   Score     E
a)
Sequences producing significant alignments:                       (Bits)  Value

gb|EDZ60822.1|  sarcosine oxidase alpha subunit [Candidatus Pe...   429    1e-118
ref|YP_266690.1|  sarcosine oxidase alpha chain [Candidatus Pe...   422    1e-116 
ref|ZP_01264926.1|  sarcosine oxidase alpha chain [Candidatus ...   422    1e-116
gb|ABZ06303.1|  putative glycine cleavage T-protein (aminometh...   370    7e-101
gb|ABZ05929.1|  putative glycine cleavage T-protein (aminometh...   340    6e-92 
gb|ABZ06659.1|  putative glycine cleavage T-protein (aminometh...   337    7e-91 
ref|ZP_01754673.1|  sarcosine oxidase, alpha subunit family pr...   266    8e-70 
gb|EDZ42222.1|  sarcosine oxidase, alpha subunit family [Rhodo...   265    2e-69 
gb|EDZ45195.1|  sarcosine oxidase, alpha subunit family [Rhodo...   263    9e-69 
ref|YP_166984.1|  sarcosine oxidase alpha subunit family prote...   263    1e-68  
ref|YP_611611.1|  sarcosine oxidase alpha subunit family prote...   263    1e-68  
ref|YP_614139.1|  sarcosine oxidase alpha subunit family prote...   261    3e-68  
ref|ZP_02147355.1|  sarcosine oxidase, alpha subunit family pr...   260    6e-68 
ref|YP_266475.1|  sarcosine oxidase alpha chain [Candidatus Pe...   259    9e-68  
ref|ZP_02149843.1|  sarcosine oxidase, alpha subunit family pr...   259    1e-67 
ref|ZP_01754466.1|  sarcosine oxidase, alpha subunit family pr...   258    2e-67 
ref|ZP_02297488.1|  Uncharacterized NAD(FAD)-dependent dehydro...   257    5e-67 
ref|ZP_01056477.1|  sarcosine oxidase, alpha subunit family pr...   257    6e-67 
gb|EDZ61064.1|  sarcosine oxidase, alpha subunit [Candidatus P...   256    1e-66 
ref|ZP_02141516.1|  sarcosine oxidase, alpha subunit [Roseobac...   256    1e-66 
ref|YP_001328944.1|  sarcosine oxidase alpha subunit family pr...   255    2e-66  
ref|NP_384189.1|  putative sarcosine oxidase alpha subunit tra...   255    2e-66  
ref|ZP_01054876.1|  sarcosine oxidase, alpha subunit family pr...   254    6e-66 
ref|YP_682013.1|  sarcosine oxidase, alpha subunit [Roseobacte...   253    8e-66  
gb|EDY87835.1|  sarcosine oxidase, alpha subunit [Octadecabact...   250    6e-65 
ref|YP_001533452.1|  sarcosine oxidase alpha subunit family pr...   249    1e-64  
ref|YP_002362923.1|  sarcosine oxidase, alpha subunit family [...   249    1e-64  
ref|ZP_02154807.1|  sarcosine oxidase, alpha subunit family pr...   249    2e-64 
ref|ZP_01439029.1|  sarcosine oxidase, alpha subunit family pr...   247    6e-64 
gb|EEB84114.1|  sarcosine oxidase, alpha subunit family [Roseo...   247    7e-64 
ref|ZP_01546296.1|  sarcosine oxidase, alpha subunit [Stappia ...   245    2e-63 
ref|NP_106776.1|  sarcosine oxidase alpha subunit [Mesorhizobi...   245    2e-63  
gb|ABZ05963.1|  hypothetical protein ALOHA_HF4000001L24ctg1g32...   245    2e-63 
gb|EEB80013.1|  sarcosine oxidase, alpha subunit family [marin...   244    4e-63 
ref|ZP_01002095.1|  sarcosine oxidase, alpha subunit [Loktanel...   244    4e-63 
ref|ZP_02168605.1|  sarcosine oxidase alpha subunit [Hoeflea p...   244    4e-63 
emb|CAD31286.1|  PUTATIVE SARCOSINE OXIDASE ALPHA SUBUNIT PROT...   243    1e-62 
gb|EDZ45988.1|  sarcosine oxidase, alpha subunit family [Rhodo...   243    1e-62 
ref|YP_001261448.1|  glycine cleavage T protein (aminomethyl t...   242    2e-62  
ref|NP_881143.1|  sarcosine oxidase alpha subunit [Bordetella ...   242    2e-62  
ref|NP_885663.1|  sarcosine oxidase alpha subunit [Bordetella ...   241    4e-62  
ref|YP_553716.1|  sarcosine oxidase, alpha subunit, heterotetr...   241    4e-62  
ref|YP_001062947.1|  sarcosine oxidase, alpha subunit, heterot...   239    9e-62  
ref|YP_001075894.1|  sarcosine oxidase, alpha subunit [Burkhol...   239    9e-62  
ref|YP_111378.1|  sarcosine oxidase alpha subunit [Burkholderi...   239    9e-62  
ref|ZP_02485951.1|  sarcosine oxidase, alpha subunit [Burkhold...   239    9e-62 
emb|CAD31640.1|  PROBABLE SARCOSINE OXIDASE ALPHA SUBUNIT TRAN...   239    1e-61 
ref|ZP_02407205.1|  sarcosine oxidase, alpha subunit [Burkhold...   239    1e-61 
ref|YP_771596.1|  putative sarcosine oxidase alpha subunit [Rh...   239    1e-61  
ref|ZP_03456801.1|  sarcosine oxidase, alpha subunit [Burkhold...   239    1e-61 
ref|ZP_02459962.1|  putative sarcosine oxidase alpha subunit [...   239    2e-61 
ref|YP_002277908.1|  sarcosine oxidase, alpha subunit family [...   238    2e-61  
ref|NP_356432.1|  sarcosine oxidase alpha subunit [Agrobacteri...   238    4e-61  
ref|ZP_00960211.1|  sarcosine oxidase, alpha subunit family pr...   237    5e-61 
ref|YP_743995.1|  sarcosine oxidase alpha subunit [Granulibact...   237    6e-61  
ref|YP_001985949.1|  sarcosine oxidase protein, alpha subunit ...   237    7e-61  
ref|YP_472588.1|  sarcosine oxidase alpha subunit protein [Rhi...   236    9e-61  
ref|ZP_00961139.1|  sarcosine oxidase, alpha subunit family pr...   236    1e-60 
ref|ZP_02146052.1|  sarcosine oxidase, alpha subunit family pr...   236    1e-60 
ref|ZP_02150359.1|  sarcosine oxidase, alpha subunit family pr...   236    1e-60 
ref|ZP_02366005.1|  sarcosine oxidase, alpha subunit [Burkhold...   236    1e-60 
gb|EDY77120.1|  hypothetical protein OA307_2230 [Octadecabacte...   236    2e-60 
ref|ZP_02358969.1|  sarcosine oxidase, alpha subunit [Burkhold...   235    2e-60 
ref|ZP_01036314.1|  sarcosine oxidase, alpha subunit family pr...   235    2e-60 
ref|ZP_02142150.1|  sarcosine oxidase, alpha subunit family pr...   235    2e-60 
ref|ZP_01033968.1|  sarcosine oxidase, alpha subunit family pr...   234    3e-60 
ref|NP_104289.1|  sarcosine oxidase alpha subunit [Mesorhizobi...   234    3e-60  
ref|ZP_00955469.1|  sarcosine oxidase, alpha subunit family pr...   234    3e-60 
gb|EEB84343.1|  sarcosine oxidase, alpha subunit family [Roseo...   234    4e-60 
ref|YP_001592099.1|  sarcosine oxidase alpha subunit family pr...   234    4e-60  
ref|NP_697265.1|  sarcosine oxidase, alpha subunit [Brucella s...   234    4e-60  
ref|ZP_00962904.1|  sarcosine oxidase, alpha subunit family pr...   234    5e-60 
ref|ZP_02141429.1|  sarcosine oxidase, alpha subunit [Roseobac...   234    5e-60 
ref|ZP_01879093.1|  sarcosine oxidase, alpha subunit family pr...   234    5e-60 
ref|ZP_02054752.1|  sarcosine oxidase, alpha subunit family [M...   234    6e-60 
ref|ZP_01443151.1|  sarcosine oxidase alpha subunit [Roseovari...   233    7e-60 
ref|YP_001109958.1|  sarcosine oxidase alpha subunit family pr...   233    1e-59  
ref|YP_001924278.1|  sarcosine oxidase, alpha subunit family [...   233    1e-59  
ref|NP_107653.1|  sarcosine oxidase alpha subunit [Mesorhizobi...   233    1e-59  
ref|YP_682094.1|  sarcosine oxidase, alpha subunit [Roseobacte...   233    1e-59  
ref|YP_439195.1|  sarcosine oxidase, alpha subunit [Burkholder...   232    1e-59  
ref|YP_002100542.1|  hypothetical protein BDAG_03838 [Burkhold...   232    1e-59  
gb|EDZ41083.1|  sarcosine oxidase, alpha subunit family [Rhodo...   232    2e-59 
ref|ZP_02188467.1|  sarcosine oxidase, alpha subunit family pr...   232    2e-59 
ref|ZP_02466697.1|  sarcosine oxidase, alpha subunit [Burkhold...   232    2e-59 
ref|YP_001258259.1|  sarcosine oxidase alpha subunit [Brucella...   232    2e-59  
ref|YP_001583482.1|  sarcosine oxidase alpha subunit family pr...   232    2e-59  
ref|YP_001533601.1|  sarcosine oxidase alpha subunit family pr...   232    2e-59  
ref|NP_519224.1|  sarcosine oxidase subunit alpha [Ralstonia s...   231    3e-59  
ref|ZP_01441755.1|  sarcosine oxidase, alpha subunit family pr...   231    3e-59 
gb|EEB72626.1|  Glycine cleavage T-protein (aminomethyl transf...   231    4e-59 
ref|NP_540637.1|  sarcosine oxidase alpha subunit [Brucella me...   230    6e-59  
ref|YP_001639122.1|  sarcosine oxidase alpha subunit family pr...   230    6e-59  
ref|YP_001238046.1|  sarcosine oxidase, alpha subunit [Bradyrh...   230    7e-59  
ref|YP_352745.1|  putative sarcosine oxidase, alpha subunit [R...   230    7e-59  
ref|YP_166827.1|  sarcosine oxidase alpha subunit family prote...   230    8e-59  
ref|YP_001368863.1|  sarcosine oxidase alpha subunit family pr...   229    1e-58  
gb|EEB71634.1|  sarcosine oxidase, alpha subunit family [Ruege...   229    1e-58 
ref|ZP_02059400.1|  sarcosine oxidase, alpha subunit family [M...   229    1e-58 
ref|NP_356575.1|  sarcosine oxidase alpha subunit [Agrobacteri...   229    1e-58  
ref|NP_521609.1|  sarcosine oxidase subunit alpha [Ralstonia s...   229    2e-58  
ref|ZP_01000874.1|  sarcosine oxidase, alpha subunit family pr...   229    2e-58 
ref|YP_001043229.1|  sarcosine oxidase alpha subunit family pr...   228    2e-58  
ref|YP_001641183.1|  sarcosine oxidase alpha subunit family pr...   228    2e-58  
ref|ZP_01879735.1|  sarcosine oxidase, alpha subunit family pr...   228    3e-58 
ref|YP_001115910.1|  sarcosine oxidase alpha subunit family pr...   228    3e-58  
ref|ZP_01223941.1|  sarcosine oxidase, alpha subunit [marine g...   228    3e-58 
ref|ZP_00631887.1|  Sarcosine oxidase, alpha subunit, heterote...   228    4e-58  
ref|YP_001168070.1|  sarcosine oxidase alpha subunit family pr...   228    4e-58  
ref|ZP_02376522.1|  sarcosine oxidase, alpha subunit family pr...   228    4e-58 
ref|YP_371246.1|  sarcosine oxidase, alpha subunit, heterotetr...   227    6e-58  
ref|ZP_01449178.1|  sarcosine oxidase, alpha subunit family pr...   227    7e-58 
gb|EEA96709.1|  sarcosine oxidase, alpha subunit family [Pseud...   226    8e-58 
ref|ZP_02118479.1|  sarcosine oxidase alpha subunit [Methyloba...   226    9e-58 
ref|YP_001926650.1|  sarcosine oxidase, alpha subunit family [...   226    9e-58  
ref|ZP_01447755.1|  sarcosine oxidase, alpha subunit family pr...   226    9e-58 
ref|YP_743901.1|  sarcosine oxidase alpha subunit [Granulibact...   226    1e-57  
ref|YP_623086.1|  sarcosine oxidase alpha subunit family prote...   226    1e-57  
ref|YP_001524410.1|  sarcosine oxidase alpha subunit [Azorhizo...   226    1e-57  
ref|YP_001778750.1|  sarcosine oxidase alpha subunit family pr...   226    1e-57  
ref|YP_001811729.1|  sarcosine oxidase alpha subunit family pr...   225    2e-57  
ref|YP_299020.1|  sarcosine oxidase, alpha subunit, heterotetr...   225    2e-57  
ref|YP_776428.1|  sarcosine oxidase alpha subunit family prote...   225    2e-57  
ref|ZP_02906041.1|  sarcosine oxidase, alpha subunit family [B...   225    2e-57 
ref|ZP_02888607.1|  sarcosine oxidase, alpha subunit family [B...   225    2e-57 
ref|YP_001859533.1|  sarcosine oxidase alpha subunit family pr...   225    2e-57  
ref|ZP_01444560.1|  sarcosine oxidase, alpha subunit family pr...   224    3e-57 
ref|YP_002095945.1|  hypothetical protein BCPG_04816 [Burkhold...   224    5e-57  
ref|YP_002234989.1|  putative sarcosine oxidase alpha subunit ...   224    5e-57  
ref|NP_356342.1|  sarcosine oxidase alpha subunit [Agrobacteri...   224    6e-57  
ref|YP_680436.1|  sarcosine oxidase, alpha subunit [Roseobacte...   224    6e-57  
ref|YP_001755163.1|  sarcosine oxidase alpha subunit family pr...   223    8e-57  
ref|ZP_02886813.1|  sarcosine oxidase, alpha subunit family [B...   223    1e-56 
ref|ZP_01157212.1|  sarcosine oxidase, alpha subunit family pr...   223    1e-56 
ref|YP_001415775.1|  sarcosine oxidase alpha subunit family pr...   222    1e-56  
ref|YP_001207761.1|  sarcosine oxidase, alpha subunit [Bradyrh...   222    2e-56  
ref|YP_001419578.1|  sarcosine oxidase alpha subunit family pr...   222    2e-56  
ref|YP_471083.1|  sarcosine oxidase alpha subunit protein [Rhi...   221    4e-56  
ref|ZP_02292175.1|  sarcosine oxidase, alpha subunit family [R...   221    5e-56 
ref|ZP_01751766.1|  sarcosine oxidase, alpha subunit family pr...   221    5e-56 
ref|YP_001888784.1|  sarcosine oxidase, alpha subunit family [...   220    6e-56  
ref|ZP_02370511.1|  sarcosine oxidase, alpha subunit [Burkhold...   220    7e-56 
ref|ZP_00997485.1|  sarcosine oxidase, alpha subunit family pr...   220    9e-56 
ref|YP_769698.1|  putative sarcosine oxidase alpha subunit [Rh...   219    1e-55  
ref|YP_001234130.1|  sarcosine oxidase alpha subunit family pr...   219    1e-55  
ref|ZP_01155888.1|  hypothetical protein OG2516_13134 [Oceanic...   219    2e-55 
ref|YP_511389.1|  sarcosine oxidase alpha subunit family prote...   218    2e-55  
ref|ZP_01078126.1|  sarcosine oxidase, alpha subunit [Marinomo...   218    3e-55 
ref|ZP_01226689.1|  sarcosine oxidase, alpha subunit [Aurantim...   218    3e-55 
ref|YP_002282849.1|  sarcosine oxidase, alpha subunit family [...   218    5e-55  
ref|ZP_02154237.1|  sarcosine oxidase, alpha subunit family pr...   217    6e-55 
ref|YP_167568.1|  sarcosine oxidase alpha subunit family prote...   217    8e-55  
ref|ZP_01740505.1|  sarcosine oxidase, alpha subunit family pr...   216    8e-55 
ref|YP_484764.1|  sarcosine oxidase alpha subunit family prote...   216    1e-54  
ref|ZP_02165983.1|  putative sarcosine oxidase alpha subunit t...   216    1e-54 
gb|EDY75574.1|  sarcosine oxidase, alpha subunit family [Octad...   215    2e-54 
gb|EDY90090.1|  sarcosine oxidase, alpha subunit [Octadecabact...   215    3e-54 
ref|ZP_01901256.1|  sarcosine oxidase, alpha subunit family pr...   215    3e-54 
ref|ZP_01748953.1|  sarcosine oxidase, alpha subunit family pr...   215    3e-54 
ref|ZP_01002738.1|  sarcosine oxidase, alpha subunit family [L...   214    3e-54 
ref|YP_001524086.1|  sarcosine oxidase alpha subunit [Azorhizo...   214    3e-54  
ref|YP_673158.1|  sarcosine oxidase alpha subunit family prote...   214    4e-54  
ref|ZP_02123205.1|  sarcosine oxidase, alpha subunit family [M...   214    4e-54 
ref|ZP_01440252.1|  sarcosine oxidase, alpha subunit [Fulvimar...   214    4e-54 
ref|ZP_01741311.1|  sarcosine oxidase, alpha subunit family pr...   214    4e-54 
ref|ZP_01155726.1|  putative sarcosine oxidase, alpha subunit ...   213    1e-53 
ref|ZP_01879218.1|  sarcosine oxidase, alpha subunit family pr...   213    1e-53 
ref|YP_610570.1|  sarcosine oxidase (alpha subunit) oxidoreduc...   213    1e-53  
ref|YP_262784.1|  sarcosine oxidase, alpha subunit [Pseudomona...   213    1e-53  
ref|YP_510128.1|  sarcosine oxidase alpha subunit family prote...   212    2e-53  
ref|YP_001683989.1|  sarcosine oxidase alpha subunit family pr...   212    2e-53  
ref|ZP_01004712.1|  sarcosine oxidase, alpha subunit family [L...   211    3e-53 
ref|NP_386959.1|  putative sarcosine oxidase alpha subunit tra...   211    3e-53  
ref|YP_001979985.1|  sarcosine oxidase protein, alpha subunit ...   211    4e-53  
ref|YP_001751727.1|  sarcosine oxidase alpha subunit family pr...   211    5e-53  
ref|NP_790307.1|  sarcosine oxidase, alpha subunit [Pseudomona...   210    7e-53 
ref|ZP_03395917.1|  sarcosine oxidase, alpha subunit [Pseudomo...   210    7e-53 
gb|EEB72508.1|  sarcosine oxidase, alpha subunit family [Ruege...   210    9e-53 
ref|YP_237780.1|  sarcosine oxidase, alpha subunit, heterotetr...   210    9e-53  
ref|YP_001666597.1|  sarcosine oxidase alpha subunit family pr...   209    1e-52  
ref|ZP_01012571.1|  sarcosine oxidase, alpha subunit family pr...   209    2e-52 
ref|YP_001265704.1|  sarcosine oxidase alpha subunit family pr...   209    2e-52  
ref|YP_276853.1|  sarcosine oxidase, alpha subunit [Pseudomona...   209    2e-52 
ref|ZP_03268938.1|  sarcosine oxidase, alpha subunit, heterote...   209    2e-52 
ref|NP_742492.1|  sarcosine oxidase, alpha subunit family [Pse...   209    2e-52  
ref|YP_350933.1|  sarcosine oxidase, alpha subunit, heterotetr...   208    3e-52  
ref|ZP_01616384.1|  sarcosine oxidase, alpha subunit [marine g...   207    4e-52 
ref|ZP_01745277.1|  sarcosine oxidase, alpha subunit family pr...   207    4e-52 
ref|ZP_02150577.1|  sarcosine oxidase, alpha subunit family pr...   207    5e-52 
ref|ZP_02147374.1|  sarcosine oxidase, alpha subunit family pr...   207    5e-52 
ref|YP_001328415.1|  sarcosine oxidase alpha subunit family pr...   207    6e-52  
ref|ZP_00630198.1|  Sarcosine oxidase, alpha subunit, heterote...   207    8e-52  
gb|EDY77809.1|  sarcosine oxidase, alpha subunit family [Octad...   206    2e-51 
ref|ZP_01038293.1|  sarcosine oxidase, alpha subunit family pr...   206    2e-51 
gb|ABZ06778.1|  putative glycine cleavage T-protein (aminometh...   206    2e-51 
ref|YP_001341606.1|  sarcosine oxidase alpha subunit family pr...   205    2e-51  
gb|EDY88256.1|  sarcosine oxidase, alpha subunit [Octadecabact...   205    2e-51 
ref|ZP_01057070.1|  sarcosine oxidase, alpha subunit family pr...   204    4e-51 
ref|YP_612966.1|  sarcosine oxidase alpha subunit family prote...   203    1e-50  
ref|ZP_00961076.1|  sarcosine oxidase, alpha subunit family pr...   201    3e-50 
ref|ZP_02186998.1|  sarcosine oxidase alpha subunit [alpha pro...   201    3e-50 
ref|ZP_01902467.1|  sarcosine oxidase, alpha subunit family pr...   201    6e-50 
ref|YP_573056.1|  sarcosine oxidase alpha subunit family prote...   200    6e-50  
ref|ZP_01753658.1|  sarcosine oxidase alpha subunit [Roseobact...   199    1e-49 
ref|NP_102901.1|  sarcosine oxidase alpha subunit [Mesorhizobi...   199    2e-49  
gb|EDZ40585.1|  Glycine cleavage T-protein (aminomethyl transf...   198    3e-49 
ref|YP_001189608.1|  sarcosine oxidase alpha subunit family pr...   196    1e-48  
ref|ZP_01754731.1|  sarcosine oxidase, alpha subunit family pr...   194    7e-48 
ref|YP_002083910.1|  sarcosine oxidase alpha subunit [Pseudomo...   193    1e-47  
ref|NP_254105.1|  sarcosine oxidase alpha subunit [Pseudomonas...   193    1e-47  
ref|YP_270692.1|  sarcosine oxidase, alpha subunit [Colwellia ...   192    1e-47  
gb|EDZ47679.1|  sarcosine oxidase, alpha subunit family [Rhodo...   192    2e-47 
ref|YP_001351517.1|  sarcosine oxidase alpha subunit [Pseudomo...   191    3e-47  
ref|YP_002088982.1|  sarcosine oxidase alpha subunit [Pseudomo...   191    3e-47  
ref|ZP_01368438.1|  hypothetical protein PaerPA_01005598 [Pseu...   191    4e-47 
ref|YP_998771.1|  sarcosine oxidase alpha subunit family prote...   190    7e-47  
ref|ZP_01737315.1|  sarcosine oxidase alpha subunit [Marinobac...   182    1e-44 
ref|NP_105928.1|  sarcosine oxidase alpha subunit [Mesorhizobi...   173    1e-41  
ref|YP_275145.1|  sarcosine oxidase, alpha subunit family prot...   155    2e-36 
ref|ZP_01224997.1|  Aminomethyltransferase [marine gamma prote...   154    4e-36 
ref|YP_235303.1|  aminomethyltransferase [Pseudomonas syringae...   154    4e-36  
ref|ZP_03399343.1|  sarcosine oxidase, alpha subunit [Pseudomo...   153    1e-35 
gb|EEB79908.1|  tRNA uridine 5-carboxymethylaminomethyl modifi...   152    2e-35 
ref|NP_792264.1|  sarcosine oxidase, alpha subunit [Pseudomona...   151    4e-35 
ref|ZP_03277559.1|  glycine cleavage T protein (aminomethyl tr...   150    9e-35 
ref|YP_001188956.1|  aminomethyltransferase [Pseudomonas mendo...   150    9e-35  
ref|YP_047137.1|  sarcosine oxidase (alpha subunit) oxidoreduc...   140    1e-31  
ref|YP_645232.1|  aminomethyltransferase [Rubrobacter xylanoph...   134    5e-30  
ref|YP_391618.1|  aminomethyltransferase [Thiomicrospira cruno...   134    5e-30  
ref|YP_001668339.1|  glycine cleavage T protein (aminomethyl t...   133    1e-29  
ref|YP_001106686.1|  sarcosine oxidase (alpha subunit) oxidore...   132    2e-29  
ref|ZP_01075747.1|  sarcosine oxidase, alpha subunit family pr...   130    1e-28 
ref|ZP_01626855.1|  sarcosine oxidase, alpha subunit family pr...   125    4e-27 
ref|YP_544565.1|  aminomethyltransferase [Methylobacillus flag...   121    5e-26  
ref|YP_001862026.1|  glycine cleavage T protein (aminomethyl t...   113    1e-23  
ref|ZP_00439391.1|  COG0446: Uncharacterized NAD(FAD)-dependen...   107    8e-22 
ref|YP_338415.1|  sarcosine oxidase, alpha subunit, truncation...   107    1e-21  
ref|NP_069111.1|  sarcosine oxidase, subunit alpha (soxA) [Arc...  95.5    3e-18  
ref|YP_949507.1|  sarcosine oxidase alpha subunit [Arthrobacte...  86.3    2e-15  
ref|YP_833177.1|  sarcosine oxidase alpha subunit family prote...  85.9    3e-15  
ref|ZP_02837443.1|  sarcosine oxidase, alpha subunit family [A...  84.7    5e-15 
ref|YP_001107723.1|  sarcosine oxidase (alpha subunit) oxidore...  84.3    7e-15  
gb|AAN65213.1|AF329398_3  sarcosine oxidase alpha subunit [Str...  83.6    1e-14 
ref|YP_701792.1|  sarcosine oxidase [Rhodococcus sp. RHA1] >gb...  80.1    1e-13  
ref|YP_002206035.1|  sarcosine oxidase alpha subunit [Streptom...  79.3    2e-13  
pdb|2GAG|A  Chain A, Heteroteterameric Sarcosine: Structure Of...  77.8    6e-13  
pdb|2GAH|A  Chain A, Heterotetrameric Sarcosine: Structure Of ...  77.4    8e-13  
dbj|BAD97818.1|  subunit alpha of sarocosine oxidase [Coryneba...  77.4    9e-13 
pdb|1VRQ|A  Chain A, Crystal Structure Of Heterotetrameric Sar...  77.4    9e-13  
gb|AAC62216.1|  sarcosine oxidase subunit A [Sinorhizobium mel...  76.6    1e-12 
sp|Q46337.1|SOXA_CORS1  RecName: Full=Sarcosine oxidase subuni...  67.4    9e-10 
gb|AAK16489.1|AF329478_4  sarcosine oxidase subunit A [Arthrob...  66.6    1e-09 
ref|ZP_00379371.1|  COG0446: Uncharacterized NAD(FAD)-dependen...  66.6    1e-09 
ref|ZP_01467622.1|  Dye-L-proDH alpha [Stigmatella aurantiaca ...  63.9    9e-09 
ref|ZP_01911446.1|  Ferredoxin / FAD-dependent pyridine nucleo...  61.6    5e-08 
ref|YP_001855999.1|  sarcosine oxidase alpha subunit [Kocuria ...  60.5    1e-07  
ref|YP_632142.1|  pyridine nucleotide-disulphide oxidoreductas...  57.8    6e-07  
ref|ZP_02323957.1|  FAD-dependent pyridine nucleotide-disulphi...  54.7    5e-06 
ref|YP_002133740.1|  FAD-dependent pyridine nucleotide-disulph...  54.7    5e-06  
ref|YP_465685.1|  ferredoxin / FAD-dependent pyridine nucleoti...  53.9    1e-05  
ref|YP_001378580.1|  FAD-dependent pyridine nucleotide-disulph...  53.1    2e-05  
ref|YP_182532.1|  proline dehydrogenase, alpha subunit [Thermo...  52.4    3e-05  
dbj|BAD13510.1|  Dye-L-proDH alpha [Thermococcus profundus]        50.4    1e-04 
ref|NP_126003.1|  sarcosine oxidase, subunit alpha [Pyrococcus...  50.4    1e-04  
gb|EDY40709.1|  tRNA uridine 5-carboxymethylaminomethyl modifi...  49.3    2e-04 
ref|ZP_02419852.1|  hypothetical protein ANACAC_02446 [Anaeros...  49.3    2e-04 
ref|ZP_03234904.1|  putative sarcosine oxidase, alpha subunit ...  48.9    3e-04 
ref|YP_895341.1|  sarcosine oxidase, alpha subunit [Bacillus t...  48.9    4e-04  
ref|YP_084152.1|  sarcosine oxidase, alpha subunit [Bacillus c...  48.5    4e-04  
ref|ZP_03329249.1|  ferredoxin [Thermotogales bacterium TBF 19...  48.5    5e-04 
ref|ZP_00238363.1|  sarcosine oxidase, subunit alpha [Bacillus...  48.1    5e-04 
ref|ZP_03232892.1|  sarcosine oxidase alpha subunit [Bacillus ...  48.1    6e-04 
ref|YP_028906.1|  sarcosine oxidase alpha subunit, N-terminal ...  47.4    9e-04  
ref|NP_832589.1|  sarcosine oxidase alpha subunit [Bacillus ce...  47.4    0.001  
ref|ZP_01666854.1|  proline dehydrogenase, alpha subunit [Ther...  47.4    0.001 
ref|NP_579524.1|  sarcosine oxidase subunit alpha [Pyrococcus ...  47.0    0.001  
ref|ZP_03295986.1|  hypothetical protein COLINT_01703 [Collins...  46.6    0.002 
ref|YP_001749019.1|  fumarate reductase/succinate dehydrogenas...  46.6    0.002  
gb|EDY35306.1|  tRNA uridine 5-carboxymethylaminomethyl modifi...  45.8    0.003 
gb|EDY35216.1|  tRNA uridine 5-carboxymethylaminomethyl modifi...  45.8    0.003 
ref|YP_001645471.1|  sarcosine oxidase, alpha subunit [Bacillu...  45.4    0.003  
ref|ZP_02327522.1|  hypothetical protein Plarl_07710 [Paenibac...  45.4    0.003 
ref|ZP_01550200.1|  FAD dependent oxidoreductase [Stappia aggr...  44.7    0.007 
ref|ZP_00744209.1|  Sarcosine oxidase alpha subunit [Bacillus ...  44.7    0.007 
gb|EEA99631.1|  amine oxidase, flavin-containing domain-contai...  44.3    0.007 
ref|YP_001136924.1|  hypothetical protein cgR_0061 [Corynebact...  44.3    0.007  
ref|ZP_01443595.1|  putative dehydrogenase [Roseovarius sp. HT...  44.3    0.008 
ref|YP_883390.1|  3-ketosteroid-delta-1-dehydrogenase [Mycobac...  43.9    0.010  
ref|NP_744749.1|  fumarate reductase/succinate dehydrogenase f...  43.9    0.012  
ref|ZP_02043101.1|  hypothetical protein RUMGNA_03911 [Ruminoc...  43.5    0.012 
ref|YP_001718218.1|  hypothetical protein Daud_2097 [Candidatu...  43.5    0.013  
ref|NP_782611.1|  dihydrolipoamide dehydrogenase [Clostridium ...  43.1    0.016  
gb|EEB73433.1|  proline dehydrogenase, alpha subunit [Thermoco...  43.1    0.017 
emb|CAO80836.1|  putative dye-linked L-proline dehydrogenase (...  43.1    0.017 
ref|YP_001863451.1|  fumarate reductase/succinate dehydrogenas...  43.1    0.018  
ref|YP_001613094.1|  putative NADH dehydrogenase [Sorangium ce...  42.7    0.023  
ref|ZP_03132998.1|  putative secreted protein-putative xanthan...  42.7    0.024 
ref|YP_001240605.1|  putative 3-oxosteroid 1-dehydrogenase [Br...  42.7    0.024  
ref|ZP_03127008.1|  conserved hypothetical protein [Chthonioba...  42.7    0.025 
ref|YP_001417593.1|  putative succinate dehydrogenase [Xanthob...  42.7    0.025  
ref|YP_001268432.1|  fumarate reductase/succinate dehydrogenas...  42.7    0.025  
ref|ZP_01696380.1|  FAD-dependent pyridine nucleotide-disulphi...  42.7    0.027 
ref|ZP_02432477.1|  hypothetical protein CLOSCI_02724 [Clostri...  42.7    0.027 
dbj|BAD77802.1|  dye-linked L-proline dehydrogenase alpha2 sub...  42.4    0.030 
ref|NP_143587.1|  D-nopaline dehydrogenase [Pyrococcus horikos...  42.4    0.030  
ref|YP_259557.1|  putative FAD-binding dehydrogenase [Pseudomo...  42.4    0.032  
ref|ZP_01129418.1|  putative oxidoreductase [marine actinobact...  42.0    0.037 
ref|YP_825257.1|  hypothetical protein Acid_4005 [Solibacter u...  42.0    0.042  
ref|YP_982927.1|  putative FAD-binding dehydrogenase [Polaromo...  42.0    0.043  
ref|XP_794903.2|  PREDICTED: similar to amine oxidase (flavin-...  42.0    0.045  
ref|YP_982933.1|  putative succinate dehydrogenase [Polaromona...  41.6    0.051  
ref|YP_001862205.1|  fumarate reductase/succinate dehydrogenas...  41.6    0.053  
ref|ZP_03297321.1|  hypothetical protein COLSTE_01215 [Collins...  41.6    0.056 
ref|YP_955987.1|  3-ketosteroid-delta-1-dehydrogenase [Mycobac...  41.6    0.059  
ref|YP_002307650.1|  proline dehydrogenase, alpha subunit [The...  41.2    0.060  
ref|ZP_01167758.1|  probable pyridine nucleotide-disulphide ox...  41.2    0.061 
ref|ZP_01968924.1|  hypothetical protein RUMTOR_02505 [Ruminoc...  41.2    0.062 
ref|YP_001114495.1|  hypothetical protein Dred_3168 [Desulfoto...  41.2    0.070  
ref|ZP_02207022.1|  hypothetical protein COPEUT_01824 [Coproco...  41.2    0.073 
ref|ZP_01723678.1|  Sarcosine oxidase alpha subunit [Bacillus ...  41.2    0.077 
ref|ZP_03263743.1|  fumarate reductase/succinate dehydrogenase...  40.8    0.080 
ref|YP_001132817.1|  3-ketosteroid-delta-1-dehydrogenase [Myco...  40.8    0.080  
ref|ZP_02011382.1|  FAD dependent oxidoreductase [Opitutaceae ...  40.8    0.088 
ref|ZP_01771821.1|  Hypothetical protein COLAER_00810 [Collins...  40.8    0.090 
ref|YP_001581965.1|  geranylgeranyl reductase [Nitrosopumilus ...  40.8    0.091  
ref|YP_001698497.1|  hypothetical protein Bsph_2834 [Lysinibac...  40.8    0.094  
ref|YP_705544.1|  putrescine oxidase [Rhodococcus sp. RHA1] >g...  40.8    0.10   
ref|YP_520533.1|  hypothetical protein DSY4300 [Desulfitobacte...  40.4    0.10   
ref|ZP_03270170.1|  fumarate reductase/succinate dehydrogenase...  40.4    0.11  
ref|ZP_01372601.1|  NADH:flavin oxidoreductase/NADH oxidase [D...  40.4    0.11  
ref|NP_864155.1|  hypothetical protein RB941 [Rhodopirellula b...  40.4    0.11   
ref|ZP_01725266.1|  hypothetical protein BB14905_15385 [Bacill...  40.4    0.11  
ref|ZP_01735924.1|  soluble pyridine nucleotide transhydrogena...  40.4    0.11  
ref|ZP_03293066.1|  hypothetical protein CLOHIR_01014 [Clostri...  40.4    0.12  
ref|ZP_02329462.1|  hypothetical protein Plarl_17756 [Paenibac...  40.4    0.13  
ref|NP_266414.1|  hypothetical protein L56208 [Lactococcus lac...  40.4    0.13   
ref|YP_945873.1|  putrescine oxidase [Arthrobacter aurescens T...  40.4    0.13   
ref|YP_001031621.1|  putative flavoprotein [Lactococcus lactis...  40.0    0.14   
ref|NP_929497.1|  hypothetical protein plu2240 [Photorhabdus l...  40.0    0.14   
ref|YP_808296.1|  flavoprotein [Lactococcus lactis subsp. crem...  40.0    0.14   
gb|EEB75826.1|  hypothetical protein CDSM653_205 [Carboxydibra...  40.0    0.15  
gb|EEB75816.1|  hypothetical protein CDSM653_195 [Carboxydibra...  40.0    0.15  
ref|YP_001746779.1|  invasion protein IbeA [Escherichia coli S...  40.0    0.16   
ref|YP_543969.1|  invasion protein IbeA [Escherichia coli UTI8...  40.0    0.16   
emb|CAH55802.1|  invasion protein IbeA [Escherichia coli]          40.0    0.16  
gb|AAF98391.2|  invasion protein IbeA [Escherichia coli]           40.0    0.16  
ref|NP_773445.1|  putative dehydrogenase [Bradyrhizobium japon...  40.0    0.17   
sp|Q04616.3|3O1D_RHOOP  RecName: Full=3-oxosteroid 1-dehydroge...  39.7    0.19  
ref|YP_001408579.1|  Tat pathway signal sequence domain-contai...  39.7    0.22   
ref|ZP_02013163.1|  FAD dependent oxidoreductase [Opitutaceae ...  39.3    0.23  
ref|YP_350639.1|  putative FAD-binding dehydrogenase [Pseudomo...  39.3    0.24   
ref|ZP_02190258.1|  putative dehydrogenase [alpha proteobacter...  39.3    0.24  
emb|CAQ90090.1|  conserved hypothetical protein; putative expo...  39.3    0.25  
ref|ZP_03487970.1|  hypothetical protein EUBIFOR_00535 [Eubact...  39.3    0.25  
gb|AAF19054.1|AF096929_2  3-ketosteroid dehydrogenase [Rhodoco...  39.3    0.27  
ref|ZP_00416411.1|  conserved hypothetical protein [Azotobacte...  39.3    0.28  
ref|ZP_03266240.1|  fumarate reductase/succinate dehydrogenase...  39.3    0.29  
ref|ZP_01855464.1|  probable xanthan lyase [Planctomyces maris...  39.3    0.29  
ref|YP_002240100.1|  FAD-dependent oxidoreductase [Klebsiella ...  39.3    0.29   
ref|ZP_02928926.1|  probable xanthan lyase [Verrucomicrobium s...  39.3    0.29  
ref|ZP_03124937.1|  enoate reductase [Clostridium difficile QC...  38.9    0.34  
ref|ZP_02419851.1|  hypothetical protein ANACAC_02445 [Anaeros...  38.9    0.35  
ref|YP_982015.1|  dihydrolipoamide dehydrogenase [Polaromonas ...  38.9    0.35   
sp|P35903.1|ACHC_ACHFU  RecName: Full=Achacin; Flags: Precurso...  38.9    0.36  
gb|ABY74497.1|  putrescine oxidase [Rhodococcus erythropolis]      38.9    0.38  
ref|XP_001009119.1|  amine oxidase, flavin-containing family p...  38.9    0.38   
ref|NP_891223.1|  hypothetical protein BB4691 [Bordetella bron...  38.5    0.40   
ref|YP_001695873.1|  sarcosine oxidase alpha subunit [Lysiniba...  38.5    0.41   
ref|YP_001334066.1|  hypothetical protein KPN_00384 [Klebsiell...  38.5    0.43   
ref|ZP_01772393.1|  Hypothetical protein COLAER_01399 [Collins...  38.5    0.45  
ref|XP_381934.1|  hypothetical protein FG01758.1 [Gibberella z...  38.5    0.45   
ref|YP_176361.1|  flavoprotein [Bacillus clausii KSM-K16] >dbj...  38.5    0.46   
ref|YP_002093785.1|  Pyruvate dehydrogenase complex, dehydroge...  38.5    0.47   
ref|ZP_01313445.1|  Succinate dehydrogenase [Desulfuromonas ac...  38.5    0.48  
ref|YP_001626631.1|  putrescine oxidase [Renibacterium salmoni...  38.5    0.49   
ref|YP_002031424.1|  pyruvate dehydrogenase complex E3 compone...  38.5    0.49   
ref|ZP_02388030.1|  pyruvate dehydrogenase, E3 component, dihy...  38.5    0.50  
ref|ZP_02374190.1|  pyruvate dehydrogenase, E3 component, dihy...  38.5    0.50  
ref|YP_001758239.1|  dihydrolipoamide dehydrogenase [Methyloba...  38.5    0.50   
ref|YP_369679.1|  dihydrolipoamide dehydrogenase [Burkholderia...  38.5    0.51   
ref|YP_625780.1|  dihydrolipoamide dehydrogenase [Burkholderia...  38.5    0.51   
ref|YP_002231334.1|  putative dihydrolipoamide dehydrogenase [...  38.1    0.51   
ref|ZP_01764402.1|  pyruvate dehydrogenase complex E3 componen...  38.1    0.51  
ref|ZP_02490652.1|  pyruvate dehydrogenase complex E3 componen...  38.1    0.51  
ref|YP_002097966.1|  Pyruvate/2-oxoglutarate dehydrogenase com...  38.1    0.51   
ref|YP_442396.1|  pyruvate dehydrogenase, E3 component, dihydr...  38.1    0.51   
ref|YP_001066918.1|  pyruvate dehydrogenase complex E3 compone...  38.1    0.51   
ref|YP_001059636.1|  pyruvate dehydrogenase complex E3 compone...  38.1    0.51   
ref|YP_108895.1|  putative dihydrolipoamide dehydrogenase [Bur...  38.1    0.51   
ref|YP_001765436.1|  dihydrolipoamide dehydrogenase [Burkholde...  38.1    0.52   
ref|YP_103339.1|  pyruvate dehydrogenase, E3 component, dihydr...  38.1    0.53   
ref|ZP_02403599.1|  pyruvate dehydrogenase complex E3 componen...  38.1    0.53  
ref|YP_001120052.1|  dihydrolipoamide dehydrogenase [Burkholde...  38.1    0.53   
ref|ZP_02094457.1|  hypothetical protein PEPMIC_01223 [Peptost...  38.1    0.54  
ref|ZP_02207021.1|  hypothetical protein COPEUT_01823 [Coproco...  38.1    0.54  
ref|YP_001790680.1|  dihydrolipoamide dehydrogenase [Leptothri...  38.1    0.56   
ref|ZP_02122076.1|  fumarate reductase/succinate dehydrogenase...  38.1    0.56  
ref|YP_001117832.1|  dihydrolipoamide dehydrogenase [Burkholde...  38.1    0.56   
ref|XP_001367001.1|  PREDICTED: similar to amine oxidase (flav...  38.1    0.59   
ref|ZP_03268782.1|  fumarate reductase/succinate dehydrogenase...  38.1    0.60  
ref|ZP_02170720.1|  geranylgeranyl reductase [Bacillus selenit...  38.1    0.60  
ref|ZP_01756481.1|  soluble pyridine nucleotide transhydrogena...  38.1    0.60  
ref|YP_001512146.1|  fumarate reductase/succinate dehydrogenas...  38.1    0.61   
ref|ZP_02429421.1|  hypothetical protein CLORAM_02844 [Clostri...  38.1    0.64  
ref|ZP_02079187.1|  hypothetical protein CLOLEP_00625 [Clostri...  38.1    0.64  
ref|YP_559464.1|  dihydrolipoamide dehydrogenase [Burkholderia...  38.1    0.66   
ref|ZP_02887334.1|  dihydrolipoamide dehydrogenase [Burkholder...  37.7    0.67  
ref|XP_001367053.1|  PREDICTED: similar to amine oxidase (flav...  37.7    0.68   
gb|EDX89347.1|  FAD dependent oxidoreductase, putative [Alcani...  37.7    0.68  
ref|ZP_02885878.1|  dihydrolipoamide dehydrogenase [Burkholder...  37.7    0.69  
ref|YP_549483.1|  dihydrolipoamide dehydrogenase [Polaromonas ...  37.7    0.70   
ref|ZP_01090336.1|  hypothetical protein DSM3645_21392 [Blasto...  37.7    0.71  
ref|NP_770309.1|  putative succinate dehydrogenase [Bradyrhizo...  37.7    0.72   
ref|YP_576242.1|  hypothetical protein Nham_0924 [Nitrobacter ...  37.7    0.73   
ref|YP_001918886.1|  dihydrolipoamide dehydrogenase [Natranaer...  37.7    0.74   
ref|YP_065982.1|  opine/octopine dehydrogenase, subunit A [Des...  37.7    0.74   
ref|ZP_02327052.1|  hypothetical protein Plarl_05305 [Paenibac...  37.7    0.76  
ref|YP_001328600.1|  dihydrolipoamide dehydrogenase [Sinorhizo...  37.7    0.78   
ref|ZP_03488969.1|  hypothetical protein EUBIFOR_01555 [Eubact...  37.7    0.82  
ref|ZP_01189900.1|  Dihydrolipoamide dehydrogenase [Halothermo...  37.7    0.82  
ref|YP_001192073.1|  pyridine nucleotide-disulphide oxidoreduc...  37.7    0.83   
ref|NP_387154.1|  dihydrolipoamide dehydrogenase [Sinorhizobiu...  37.7    0.83   
ref|YP_575887.1|  dihydrolipoamide dehydrogenase [Nitrobacter ...  37.7    0.84   
ref|YP_624765.1|  dihydrolipoamide dehydrogenase [Burkholderia...  37.7    0.84   
ref|ZP_02992268.1|  HI0933 family protein [Exiguobacterium sp....  37.7    0.86  
ref|ZP_01894333.1|  Pyruvate/2-oxoglutarate dehydrogenase comp...  37.4    0.91  
ref|ZP_02887350.1|  FAD-dependent pyridine nucleotide-disulphi...  37.4    0.93  
ref|ZP_00418574.1|  putative 3-oxosteroid 1-dehydrogenase [Azo...  37.4    0.93  
ref|YP_001541235.1|  ribulose-1,5-biphosphate synthetase [Cald...  37.4    0.95   
gb|AAD30450.1|AF121894_1  lipoamide dehydrogenase [Ascaris suum]   37.4    0.98  
ref|ZP_02861345.1|  hypothetical protein ANASTE_00546 [Anaerof...  37.4    0.98  
ref|NP_816133.1|  UDP-galactopyranose mutase [Enterococcus fae...  37.4    0.98   
ref|YP_001021317.1|  dihydrolipoamide dehydrogenase [Methylibi...  37.4    0.99   
ref|YP_153586.1|  glutathione reductase [Anaplasma marginale s...  37.4    0.99   
ref|NP_693222.1|  hypothetical protein OB2301 [Oceanobacillus ...  37.4    0.99   
ref|XP_002128583.1|  PREDICTED: similar to dihydrolipoamide de...  37.4    1.0    
ref|NP_377756.1|  hypothetical protein ST1775 [Sulfolobus toko...  37.4    1.1    
ref|YP_925259.1|  fumarate reductase/succinate dehydrogenase f...  37.4    1.1    
ref|YP_743364.1|  NADPH-glutathione reductase [Alkalilimnicola...  37.4    1.1    
ref|YP_001860180.1|  fumarate reductase/succinate dehydrogenas...  37.4    1.1    
ref|ZP_01728707.1|  hypothetical protein CY0110_29874 [Cyanoth...  37.4    1.1   
ref|ZP_01802594.1|  hypothetical protein CdifQ_04003580 [Clost...  37.4    1.1   
ref|YP_925245.1|  fumarate reductase/succinate dehydrogenase f...  37.0    1.1    
ref|ZP_01733462.1|  putative transmembrane CBS domain transpor...  37.0    1.2   
ref|YP_001808742.1|  dihydrolipoamide dehydrogenase [Burkholde...  37.0    1.2    
ref|YP_778381.1|  dihydrolipoamide dehydrogenase [Burkholderia...  37.0    1.2    
ref|ZP_02894230.1|  dihydrolipoamide dehydrogenase [Burkholder...  37.0    1.3   
ref|YP_611681.1|  soluble pyridine nucleotide transhydrogenase...  37.0    1.3    
ref|ZP_02929239.1|  putative secreted protein, putative xantha...  37.0    1.3   
ref|ZP_02693005.1|  hypothetical protein Epulo_07663 [Epulopis...  37.0    1.3   
ref|ZP_02909094.1|  dihydrolipoamide dehydrogenase [Burkholder...  37.0    1.3   
ref|YP_829541.1|  putrescine oxidase [Arthrobacter sp. FB24] >...  37.0    1.3    
emb|CAQ42594.1|  Flavin containing amine oxidoreductase,putati...  37.0    1.3   
ref|ZP_02363411.1|  pyruvate dehydrogenase complex E3 componen...  37.0    1.3   
ref|ZP_02356284.1|  dihydrolipoamide dehydrogenase [Burkholder...  37.0    1.4   
gb|AAN32984.1|  BarJ [Lyngbya majuscula]                           37.0    1.4   
ref|ZP_02379120.1|  dihydrolipoamide dehydrogenase [Burkholder...  37.0    1.4   
ref|NP_377847.1|  lipoamide dehydrogenase [Sulfolobus tokodaii...  37.0    1.4    
ref|YP_774062.1|  dihydrolipoamide dehydrogenase [Burkholderia...  37.0    1.4    
ref|XP_644354.1|  hypothetical protein [Dictyostelium discoide...  37.0    1.4    
ref|ZP_02329931.1|  hypothetical protein Plarl_20162 [Paenibac...  37.0    1.4   
ref|YP_001323884.1|  geranylgeranyl reductase [Methanococcus v...  37.0    1.4    
ref|YP_001896208.1|  dihydrolipoamide dehydrogenase [Burkholde...  37.0    1.4    
ref|YP_458491.1|  2-oxoglutarate dehydrogenase, E3 component, ...  37.0    1.4    
ref|YP_001857696.1|  dihydrolipoamide dehydrogenase [Burkholde...  37.0    1.4    
ref|XP_782447.2|  PREDICTED: similar to Dihydrolipoyl dehydrog...  37.0    1.4    
ref|YP_391272.1|  dihydrolipoamide dehydrogenase [Thiomicrospi...  36.6    1.5    
ref|YP_001313783.1|  BFD/(2Fe-2S)-binding domain-containing pr...  36.6    1.6    
ref|ZP_02062853.1|  dihydrolipoyl dehydrogenase [Rickettsiella...  36.6    1.7   
ref|YP_036862.1|  dihydrolipoamide dehydrogenase [Bacillus thu...  36.6    1.7    
ref|YP_959191.1|  soluble pyridine nucleotide transhydrogenase...  36.6    1.7    
ref|YP_001395159.1|  BfmBC [Clostridium kluyveri DSM 555] >gb|...  36.6    1.7    
ref|ZP_03146647.1|  FAD dependent oxidoreductase [Geobacillus ...  36.6    1.7   
ref|YP_364325.1|  putative pyridine nucleotide-disulphide oxid...  36.6    1.8    
ref|ZP_02464060.1|  dihydrolipoamide dehydrogenase [Burkholder...  36.6    1.8   
ref|YP_001579322.1|  dihydrolipoamide dehydrogenase [Burkholde...  36.6    1.8    
ref|YP_572935.1|  2,4-dienoyl-CoA reductase [Chromohalobacter ...  36.6    1.8    
ref|XP_002068171.1|  GK12667 [Drosophila willistoni] >gb|EDW79...  36.6    1.8    
ref|ZP_02012504.1|  invasion protein IbeA [Opitutaceae bacteri...  36.6    1.9   
ref|YP_198391.1|  dihydrolipoamide dehydrogenase E3 component ...  36.6    1.9    
ref|YP_148689.1|  hypothetical protein GK2836 [Geobacillus kau...  36.2    2.0    
ref|YP_456335.1|  dihydrolipoamide dehydrogenase [Aster yellow...  36.2    2.0    
ref|ZP_01811232.1|  amine oxidase [candidate division TM7 geno...  36.2    2.0   
ref|ZP_01447460.1|  hypothetical protein OM2255_09786 [alpha p...  36.2    2.0   
gb|EEB79175.1|  oxidoreductase, FAD/FMN-binding family [marine...  36.2    2.1   
ref|ZP_02931089.1|  probable xanthan lyase [Verrucomicrobium s...  36.2    2.1   
ref|ZP_02168476.1|  dihydrolipoamide dehydrogenase [Hoeflea ph...  36.2    2.1   
ref|ZP_01727730.1|  Adrenodoxin reductase [Cyanothece sp. CCY0...  36.2    2.1   
ref|XP_644355.1|  hypothetical protein [Dictyostelium discoide...  36.2    2.1    
ref|ZP_03270752.1|  dihydrolipoamide dehydrogenase [Burkholder...  36.2    2.1   
ref|XP_001956477.1|  GF24574 [Drosophila ananassae] >gb|EDV392...  36.2    2.1    
ref|YP_001667931.1|  soluble pyridine nucleotide transhydrogen...  36.2    2.2    
ref|YP_001321631.1|  fumarate reductase/succinate dehydrogenas...  36.2    2.2    
ref|NP_744300.1|  soluble pyridine nucleotide transhydrogenase...  36.2    2.2    
ref|NP_578974.1|  d-nopaline dehydrogenase [Pyrococcus furiosu...  36.2    2.3    
ref|ZP_01575419.1|  HI0933-like protein [Clostridium celluloly...  36.2    2.3   
ref|XP_001630345.1|  predicted protein [Nematostella vectensis...  36.2    2.3    
ref|YP_886656.1|  geranylgeranyl reductase [Mycobacterium smeg...  36.2    2.4    
ref|ZP_01901268.1|  soluble pyridine nucleotide transhydrogena...  36.2    2.4   
ref|YP_002351977.1|  FAD-dependent pyridine nucleotide-disulph...  36.2    2.4   

>gb|EDZ60822.1|  sarcosine oxidase alpha subunit [Candidatus Pelagibacter sp. 
HTCC7211]
Length=998

 Score =  429 bits (1102),  Expect = 1e-118, Method: Compositional matrix adjust.
 Identities = 204/233 (87%), Positives = 214/233 (91%), Gaps = 0/233 (0%)

Query  8    MTQSFRLETGGLINRDKKISFKFNGKNYFGYEGDTLASALIANGVHLIGRSFKYHRPRGF  67
            MTQSFRL   GLINRD+K+SFKFN   Y+GYEGDTLASALIANGVHL+GRSFKYHRPRGF
Sbjct  1    MTQSFRLNDVGLINRDRKLSFKFNSVTYYGYEGDTLASALIANGVHLVGRSFKYHRPRGF  60

Query  68   FGAGVDEPYAIVQLYRNGETEPNIKATEQELFEGLEAKSVNCWPSVNFDVGAINNFLKIF  127
            FGAGVDEPYAIVQLYRNGETEPNIKATEQELFEGLEAKSVNCWPSVNFDVGAINNFLKIF
Sbjct  61   FGAGVDEPYAIVQLYRNGETEPNIKATEQELFEGLEAKSVNCWPSVNFDVGAINNFLKIF  120

Query  128  LPAGFYYKTFMWPKSFWYKIYEPFIRKAAGLGTASIKHDKERYEHKYEYCDLLITGSRPS  187
            LPAGFYYKTFMWPKSFWYK+YEPFIRKAAGLG AS KHDKERYEHKYEYCDLLI GS PS
Sbjct  121  LPAGFYYKTFMWPKSFWYKVYEPFIRKAAGLGVASTKHDKERYEHKYEYCDLLIAGSGPS  180

Query  188  GLASAYSAAKNGAKVILAEDKSRFGGTLLTSDVNIGNQSVKSGQIVLFQNLKK  240
            GLASAY+AAKNGA+VILAEDKSRFGGTLLTSDVNIGNQ+ K     +   LK+
Sbjct  181  GLASAYAAAKNGARVILAEDKSRFGGTLLTSDVNIGNQTGKEWADGIISELKE  233


>ref|YP_266690.1| Gene info sarcosine oxidase alpha chain [Candidatus Pelagibacter ubique 
HTCC1062]
 gb|AAZ22086.1| Gene info sarcosine oxidase alpha chain [Candidatus Pelagibacter ubique 
HTCC1062]
Length=998

 GENE ID: 3517319 soxA2 | sarcosine oxidase alpha chain
[Candidatus Pelagibacter ubique HTCC1062] (10 or fewer PubMed links)

 Score =  423 bits (1088),  Expect = 6e-117, Method: Compositional matrix adjust.
 Identities = 199/233 (85%), Positives = 213/233 (91%), Gaps = 0/233 (0%)

Query  8    MTQSFRLETGGLINRDKKISFKFNGKNYFGYEGDTLASALIANGVHLIGRSFKYHRPRGF  67
            MTQ++RL+  GLINRDKKISFKFNG  YFGYEGDTLASAL+ANGVHLIGRSFKYHRPRGF
Sbjct  1    MTQNYRLDNVGLINRDKKISFKFNGVTYFGYEGDTLASALLANGVHLIGRSFKYHRPRGF  60

Query  68   FGAGVDEPYAIVQLYRNGETEPNIKATEQELFEGLEAKSVNCWPSVNFDVGAINNFLKIF  127
            FGAGVDEPYAIVQLYRN ETEPN+KATEQELFEGLEA SVNCWPSVNFD+GAINN LKIF
Sbjct  61   FGAGVDEPYAIVQLYRNNETEPNVKATEQELFEGLEATSVNCWPSVNFDIGAINNLLKIF  120

Query  128  LPAGFYYKTFMWPKSFWYKIYEPFIRKAAGLGTASIKHDKERYEHKYEYCDLLITGSRPS  187
            LPAGFYYKTFMWPKSFWYK+YEPFIRKAAGLG ASI+HDKERYEHKYEYCDLLI GS PS
Sbjct  121  LPAGFYYKTFMWPKSFWYKVYEPFIRKAAGLGVASIEHDKERYEHKYEYCDLLIAGSGPS  180

Query  188  GLASAYSAAKNGAKVILAEDKSRFGGTLLTSDVNIGNQSVKSGQIVLFQNLKK  240
            GLASAY+AAKNGA+VILAEDK RFGGTLLTS+VNIGNQ+ K     +   LK+
Sbjct  181  GLASAYAAAKNGARVILAEDKPRFGGTLLTSEVNIGNQTGKEWAENIISELKE  233


>ref|ZP_01264926.1|  sarcosine oxidase alpha chain [Candidatus Pelagibacter ubique 
HTCC1002]
 gb|EAS85413.1|  sarcosine oxidase alpha chain [Candidatus Pelagibacter ubique 
HTCC1002]
Length=998

 Score =  423 bits (1087),  Expect = 6e-117, Method: Compositional matrix adjust.
 Identities = 199/233 (85%), Positives = 213/233 (91%), Gaps = 0/233 (0%)

Query  8    MTQSFRLETGGLINRDKKISFKFNGKNYFGYEGDTLASALIANGVHLIGRSFKYHRPRGF  67
            MTQ++RL+  GLINRDKKISFKFNG  YFGYEGDTLASAL+ANGVHLIGRSFKYHRPRGF
Sbjct  1    MTQNYRLDNVGLINRDKKISFKFNGVTYFGYEGDTLASALLANGVHLIGRSFKYHRPRGF  60

Query  68   FGAGVDEPYAIVQLYRNGETEPNIKATEQELFEGLEAKSVNCWPSVNFDVGAINNFLKIF  127
            FGAGVDEPYAIVQLYRN ETEPN+KATEQELFEGLEA SVNCWPSVNFD+GAINN LKIF
Sbjct  61   FGAGVDEPYAIVQLYRNNETEPNVKATEQELFEGLEATSVNCWPSVNFDIGAINNLLKIF  120

Query  128  LPAGFYYKTFMWPKSFWYKIYEPFIRKAAGLGTASIKHDKERYEHKYEYCDLLITGSRPS  187
            LPAGFYYKTFMWPKSFWYK+YEPFIRKAAGLG ASI+HDKERYEHKYEYCDLLI GS PS
Sbjct  121  LPAGFYYKTFMWPKSFWYKVYEPFIRKAAGLGVASIEHDKERYEHKYEYCDLLIAGSGPS  180

Query  188  GLASAYSAAKNGAKVILAEDKSRFGGTLLTSDVNIGNQSVKSGQIVLFQNLKK  240
            GLASAY+AAKNGA+VILAEDK RFGGTLLTS+VNIGNQ+ K     +   LK+
Sbjct  181  GLASAYAAAKNGARVILAEDKPRFGGTLLTSEVNIGNQTGKEWAENIISELKE  233


>gb|ABZ06303.1|  putative glycine cleavage T-protein (aminomethyl transferase) 
[uncultured marine microorganism HF4000_008G09]
Length=998

 Score =  370 bits (950),  Expect = 5e-101, Method: Compositional matrix adjust.
 Identities = 175/233 (75%), Positives = 193/233 (82%), Gaps = 0/233 (0%)

Query  8    MTQSFRLETGGLINRDKKISFKFNGKNYFGYEGDTLASALIANGVHLIGRSFKYHRPRGF  67
            MTQ FRL   GL+NR+K ISF FNGK YFGYEGDTLASAL+ANG+HL+GRSFKYHRPRGF
Sbjct  1    MTQKFRLPNLGLVNRNKTISFHFNGKKYFGYEGDTLASALLANGIHLVGRSFKYHRPRGF  60

Query  68   FGAGVDEPYAIVQLYRNGETEPNIKATEQELFEGLEAKSVNCWPSVNFDVGAINNFLKIF  127
            FGAGVDEP A VQLY   +TEPN+ ATE EL EGL AKS NCWPSV FDVGAINNF   F
Sbjct  61   FGAGVDEPNAKVQLYEGDKTEPNVNATELELVEGLVAKSQNCWPSVEFDVGAINNFFSRF  120

Query  128  LPAGFYYKTFMWPKSFWYKIYEPFIRKAAGLGTASIKHDKERYEHKYEYCDLLITGSRPS  187
             PAGFYYKTFMWPKSFWYK+YEP IRKAAGLG AS K D  RYEHKYEYCD+L+ GS PS
Sbjct  121  FPAGFYYKTFMWPKSFWYKVYEPLIRKAAGLGVASPKPDTSRYEHKYEYCDVLVVGSGPS  180

Query  188  GLASAYSAAKNGAKVILAEDKSRFGGTLLTSDVNIGNQSVKSGQIVLFQNLKK  240
            GL+SAY+AAKNGA+VILAEDK RFGG+LLT DVNIGNQ+ K     + + LK+
Sbjct  181  GLSSAYAAAKNGARVILAEDKPRFGGSLLTDDVNIGNQTGKEWAEDVIKELKQ  233


>gb|ABZ05929.1|  putative glycine cleavage T-protein (aminomethyl transferase) 
[uncultured marine microorganism HF4000_001B09]
Length=998

 Score =  340 bits (872),  Expect = 6e-92, Method: Compositional matrix adjust.
 Identities = 167/221 (75%), Positives = 186/221 (84%), Gaps = 0/221 (0%)

Query  8    MTQSFRLETGGLINRDKKISFKFNGKNYFGYEGDTLASALIANGVHLIGRSFKYHRPRGF  67
            M+Q +RL+  G INRDKKISF FNGK YFGYEGDTLASAL+ANG+HL+GRSFKYHRPRGF
Sbjct  1    MSQKYRLDNIGYINRDKKISFTFNGKKYFGYEGDTLASALLANGIHLVGRSFKYHRPRGF  60

Query  68   FGAGVDEPYAIVQLYRNGETEPNIKATEQELFEGLEAKSVNCWPSVNFDVGAINNFLKIF  127
            FGAGVDEP A VQLY+  +TEPN  ATE EL EGL  KS NCWPSV+FD GAINN  + F
Sbjct  61   FGAGVDEPNAKVQLYKGAKTEPNANATEVELVEGLIVKSQNCWPSVSFDFGAINNLFQKF  120

Query  128  LPAGFYYKTFMWPKSFWYKIYEPFIRKAAGLGTASIKHDKERYEHKYEYCDLLITGSRPS  187
             PAGFYYKTFMWPKSFWYK+YEP IRKAAGLG A +K D +RYEHKYEYCD+LI GS PS
Sbjct  121  FPAGFYYKTFMWPKSFWYKVYEPIIRKAAGLGVAPLKPDPDRYEHKYEYCDVLIAGSGPS  180

Query  188  GLASAYSAAKNGAKVILAEDKSRFGGTLLTSDVNIGNQSVK  228
            GLASA +AAKNGA+VILAEDKSRFGG+LL  +V IGN+  K
Sbjct  181  GLASALAAAKNGARVILAEDKSRFGGSLLVDEVTIGNKKGK  221


>gb|ABZ06659.1|  putative glycine cleavage T-protein (aminomethyl transferase) 
[uncultured marine microorganism HF4000_133I24]
Length=998

 Score =  337 bits (863),  Expect = 6e-91, Method: Compositional matrix adjust.
 Identities = 166/221 (75%), Positives = 184/221 (83%), Gaps = 0/221 (0%)

Query  8    MTQSFRLETGGLINRDKKISFKFNGKNYFGYEGDTLASALIANGVHLIGRSFKYHRPRGF  67
            M Q +RL+  G INRDKKISF FNGK YFGYEGDTLASAL+ANG+HL+GRSFKYHRPRGF
Sbjct  1    MPQKYRLDNIGYINRDKKISFTFNGKKYFGYEGDTLASALLANGIHLVGRSFKYHRPRGF  60

Query  68   FGAGVDEPYAIVQLYRNGETEPNIKATEQELFEGLEAKSVNCWPSVNFDVGAINNFLKIF  127
            FGAGVDEP A VQLY+  +TEPN  ATE EL E L  KS NCWPSV+FD GAINN  + F
Sbjct  61   FGAGVDEPNAKVQLYKGAKTEPNANATEVELVEDLIVKSQNCWPSVSFDFGAINNLFQKF  120

Query  128  LPAGFYYKTFMWPKSFWYKIYEPFIRKAAGLGTASIKHDKERYEHKYEYCDLLITGSRPS  187
             PAGFYYKTFMWPKSFWYK+YEP IRKAAGLG A +K D +RYEHKYEYCD+LI GS PS
Sbjct  121  FPAGFYYKTFMWPKSFWYKVYEPIIRKAAGLGVAPLKPDPDRYEHKYEYCDVLIAGSGPS  180

Query  188  GLASAYSAAKNGAKVILAEDKSRFGGTLLTSDVNIGNQSVK  228
            GLASA +AAKNGA+VILAEDKSRFGG+LL  +V IGN+  K
Sbjct  181  GLASALAAAKNGARVILAEDKSRFGGSLLVDEVTIGNKKGK  221


>ref|ZP_01754673.1|  sarcosine oxidase, alpha subunit family protein [Roseobacter 
sp. SK209-2-6]
 gb|EBA16865.1|  sarcosine oxidase, alpha subunit family protein [Roseobacter 
sp. SK209-2-6]
Length=985

 Score =  270 bits (690),  Expect = 7e-71, Method: Composition-based stats.
 Identities = 128/219 (58%), Positives = 162/219 (73%), Gaps = 1/219 (0%)

Query  8    MTQSFRLETGGLINRDKKISFKFNGKNYFGYEGDTLASALIANGVHLIGRSFKYHRPRGF  67
            MT+  RL+ GG INR K++SF F+G  Y GYEGDTLASAL+ANG  L+GRSFKYHRPRG 
Sbjct  1    MTEVNRLD-GGQINRAKEVSFTFDGHRYKGYEGDTLASALLANGERLMGRSFKYHRPRGV  59

Query  68   FGAGVDEPYAIVQLYRNGETEPNIKATEQELFEGLEAKSVNCWPSVNFDVGAINNFLKIF  127
              AG +EP A+V+L + G  EPN +AT  ELF+GLEA   N WPS+ FD  A+N+    F
Sbjct  60   LTAGSEEPNALVELRKGGRQEPNTRATVIELFDGLEAAPQNAWPSLRFDAMAVNDRFSNF  119

Query  128  LPAGFYYKTFMWPKSFWYKIYEPFIRKAAGLGTASIKHDKERYEHKYEYCDLLITGSRPS  187
            L AGFYYKTFMWPK+FW KIYEP IRKAAGLG+ S + D + Y+  + +CDLLI GS PS
Sbjct  120  LTAGFYYKTFMWPKAFWEKIYEPIIRKAAGLGSISFEEDPDLYDKGFLHCDLLIIGSGPS  179

Query  188  GLASAYSAAKNGAKVILAEDKSRFGGTLLTSDVNIGNQS  226
            GLA+A +A ++GA+VILA++  R GG L +  + +G+QS
Sbjct  180  GLAAALTAGRSGARVILADEDFRMGGRLNSETLALGDQS  218


>gb|EDZ42222.1|  sarcosine oxidase, alpha subunit family [Rhodobacterales bacterium 
HTCC2083]
Length=979

 Score =  265 bits (677),  Expect = 2e-69, Method: Composition-based stats.
 Identities = 123/219 (56%), Positives = 156/219 (71%), Gaps = 1/219 (0%)

Query  8    MTQSFRLETGGLINRDKKISFKFNGKNYFGYEGDTLASALIANGVHLIGRSFKYHRPRGF  67
            MTQ  R+E GG I+R+  + FKF+GK+Y G+ GDTLASAL+ANGV L+GRSFKYHRPRG 
Sbjct  1    MTQVNRVE-GGQIDRNTPLKFKFDGKSYTGHAGDTLASALLANGVRLMGRSFKYHRPRGP  59

Query  68   FGAGVDEPYAIVQLYRNGETEPNIKATEQELFEGLEAKSVNCWPSVNFDVGAINNFLKIF  127
              AG +EP AIV L      EPN +AT  ELF+GL A+S NCWPSV FD  A+N+    F
Sbjct  60   LSAGSEEPNAIVTLRDGARAEPNTRATTAELFDGLSARSQNCWPSVKFDALAVNDAASDF  119

Query  128  LPAGFYYKTFMWPKSFWYKIYEPFIRKAAGLGTASIKHDKERYEHKYEYCDLLITGSRPS  187
            L AGFYYKTFMWP  FW K+YEP IRKAAGLG  S++ D + Y+  + +CDLLI G+ PS
Sbjct  120  LAAGFYYKTFMWPAPFWEKVYEPIIRKAAGLGALSMQEDPDEYDKGFRHCDLLIVGAGPS  179

Query  188  GLASAYSAAKNGAKVILAEDKSRFGGTLLTSDVNIGNQS  226
            GL +A +A + G +VILA++    GG LL+  + +G+ S
Sbjct  180  GLMAALTAGRAGKEVILADEDFAMGGRLLSEQIEVGSTS  218


>ref|YP_611611.1| Gene info sarcosine oxidase alpha subunit family protein [Silicibacter 
sp. TM1040]
 gb|ABF62349.1| Gene info sarcosine oxidase alpha subunit family [Silicibacter sp. TM1040]
Length=984

 GENE ID: 4075276 TM1040_3377 | sarcosine oxidase alpha subunit family protein
[Silicibacter sp. TM1040]

 Score =  263 bits (673),  Expect = 7e-69, Method: Composition-based stats.
 Identities = 123/219 (56%), Positives = 162/219 (73%), Gaps = 1/219 (0%)

Query  8    MTQSFRLETGGLINRDKKISFKFNGKNYFGYEGDTLASALIANGVHLIGRSFKYHRPRGF  67
            MTQ  R+ +GGLI+R  +++F F+GKNY GY GDTLASAL+ANGV L+GRSFKYHRPRG 
Sbjct  1    MTQVNRI-SGGLIDRSTELNFTFDGKNYQGYAGDTLASALLANGVRLMGRSFKYHRPRGV  59

Query  68   FGAGVDEPYAIVQLYRNGETEPNIKATEQELFEGLEAKSVNCWPSVNFDVGAINNFLKIF  127
              AG +EP A+V+L   G  EPN +AT  E++EGL A S N WPS+  DV AIN+    F
Sbjct  60   LAAGSEEPNALVELRSGGRQEPNTRATVAEIYEGLSANSQNRWPSLKHDVMAINDRFSAF  119

Query  128  LPAGFYYKTFMWPKSFWYKIYEPFIRKAAGLGTASIKHDKERYEHKYEYCDLLITGSRPS  187
            L AGFYYKTFMWP++FW K+YEP IRKAAGLG+ S + D + Y+  Y +CDLL+ G+ P+
Sbjct  120  LSAGFYYKTFMWPRAFWEKLYEPVIRKAAGLGSLSGEGDPDAYDKGYLHCDLLVIGAGPA  179

Query  188  GLASAYSAAKNGAKVILAEDKSRFGGTLLTSDVNIGNQS  226
            GL++A +A + GA+VILA++  + GG LL+   ++ NQS
Sbjct  180  GLSAALTAGRGGAQVILADEDFQLGGRLLSDAQSLCNQS  218


>ref|YP_166984.1| Gene info sarcosine oxidase alpha subunit family protein [Silicibacter 
pomeroyi DSS-3]
 gb|AAV95026.1| Gene info sarcosine oxidase, alpha subunit family [Silicibacter pomeroyi 
DSS-3]
Length=977

 GENE ID: 3193191 SPO1746 | sarcosine oxidase alpha subunit family protein
[Silicibacter pomeroyi DSS-3] (10 or fewer PubMed links)

 Score =  263 bits (673),  Expect = 8e-69, Method: Composition-based stats.
 Identities = 118/210 (56%), Positives = 156/210 (74%), Gaps = 0/210 (0%)

Query  13   RLETGGLINRDKKISFKFNGKNYFGYEGDTLASALIANGVHLIGRSFKYHRPRGFFGAGV  72
            R++  GLI+RD+ +SF F+G  Y GY+GDTLASAL+AN V L+GRSFKYHRPRG   AG 
Sbjct  2    RVQGKGLIDRDRPVSFTFDGVGYSGYQGDTLASALLANEVRLVGRSFKYHRPRGILTAGS  61

Query  73   DEPYAIVQLYRNGETEPNIKATEQELFEGLEAKSVNCWPSVNFDVGAINNFLKIFLPAGF  132
            +EP A+V + R G  +PN++AT QE++EG+EA+S N WPS++FD+ AIN+    FL AGF
Sbjct  62   EEPNALVTIGRGGRQDPNVRATVQEIYEGMEAQSQNRWPSLSFDLMAINDLAAPFLGAGF  121

Query  133  YYKTFMWPKSFWYKIYEPFIRKAAGLGTASIKHDKERYEHKYEYCDLLITGSRPSGLASA  192
            YYKTFMWP+SFW K+YEP IR+AAGLG  S + + +RYE  + +CDLL+ G+ P+GL +A
Sbjct  122  YYKTFMWPRSFWEKLYEPVIRRAAGLGALSGQDNADRYERAFAFCDLLVIGAGPAGLMAA  181

Query  193  YSAAKNGAKVILAEDKSRFGGTLLTSDVNI  222
              A + GA VILAE+ +R GG LL     I
Sbjct  182  LVAGRAGADVILAEEDARMGGRLLAETYEI  211
------------------------------------------------------------------------------------------------
b) 
                                                                   Score     E
Sequences producing significant alignments:                       (Bits)  Value

sp|O87386.2|SOXA_RHIME  RecName: Full=Sarcosine oxidase subuni...   255    1e-67
sp|Q46337.1|SOXA_CORS1  RecName: Full=Sarcosine oxidase subuni...  67.4    6e-11
sp|Q04616.3|3O1D_RHOOP  RecName: Full=3-oxosteroid 1-dehydroge...  39.7    0.013
sp|P35903.1|ACHC_ACHFU  RecName: Full=Achacin; Flags: Precursor    38.9    0.025
sp|Q556K4.1|AOFC_DICDI  RecName: Full=Probable flavin-containi...  37.0    0.094
sp|A6US00.1|GGR_METVS  RecName: Full=Digeranylgeranylglyceroph...  37.0    0.094
sp|A1U1Y5.1|STHA_MARAV  RecName: Full=Soluble pyridine nucleot...  36.6    0.11  
sp|Q556K3.1|AOFB_DICDI  RecName: Full=Probable flavin-containi...  36.2    0.14 
sp|B0KH90.1|STHA_PSEPG  RecName: Full=Soluble pyridine nucleot...  36.2    0.15  
sp|Q88KY8.3|STHA_PSEPK  RecName: Full=Soluble pyridine nucleot...  36.2    0.15  
sp|Q25861.1|TRXR_PLAF5  RecName: Full=Thioredoxin reductase; S...  36.2    0.17 
sp|Q2NER9.1|GGR3_METST  RecName: Full=Digeranylgeranylglycerop...  36.2    0.17 
sp|P32382.1|NADO_THEBR  RecName: Full=NADH oxidase                 36.2    0.17 
sp|A6VJ23.1|GGR_METM7  RecName: Full=Digeranylgeranylglyceroph...  35.4    0.23 
sp|Q6M083.1|GGR_METMP  RecName: Full=Digeranylgeranylglyceroph...  35.4    0.24 
sp|A9A6R1.1|GGR_METM6  RecName: Full=Digeranylgeranylglyceroph...  35.4    0.25 
sp|A4FZB4.1|GGR_METM5  RecName: Full=Digeranylgeranylglyceroph...  35.4    0.26 
sp|Q6CZB1.1|STHA_ERWCT  RecName: Full=Soluble pyridine nucleot...  35.0    0.30 
sp|P32370.1|BAIH_EUBSP  RecName: Full=NADH-dependent flavin ox...  35.0    0.31 
sp|Q97ZY5.1|THI4_SULSO  Putative thiazole biosynthetic enzyme      35.0    0.33 
sp|O32434.1|PPOX_PROFF  RecName: Full=Protoporphyrinogen oxida...  35.0    0.33 
sp|Q9WZP4|THI4_THEMA  Putative thiazole biosynthetic enzyme        35.0    0.34 
sp|Q2NFF7.1|GGR2_METST  RecName: Full=Digeranylgeranylglycerop...  34.7    0.41 
sp|O18480.1|DLDH_MANSE  RecName: Full=Dihydrolipoyl dehydrogen...  34.7    0.44 
sp|P78965.2|GSHR_SCHPO  RecName: Full=Glutathione reductase; S...  34.7    0.48  
sp|Q8K9T7.1|DLDH_BUCAP  RecName: Full=Dihydrolipoyl dehydrogen...  34.7    0.49 
sp|P48639.1|GSHR_BURCE  RecName: Full=Glutathione reductase; S...  34.3    0.60 
sp|Q1QX78.1|STHA_CHRSD  RecName: Full=Soluble pyridine nucleot...  34.3    0.64  
sp|A6VW16.1|STHA_MARMS  RecName: Full=Soluble pyridine nucleot...  33.9    0.74  
sp|Q9V0J8|THI4_PYRAB  Putative thiazole biosynthetic enzyme        33.9    0.78 
sp|P54805.1|YNH2_METBA  RecName: Full=Uncharacterized protein ...  33.9    0.79 
sp|O59082.2|THI4_PYRHO  RecName: Full=Putative thiazole biosyn...  33.9    0.82 
sp|A5UNX8.1|GGR_METS3  RecName: Full=Digeranylgeranylglyceroph...  33.9    0.85 
sp|Q15ZF7.1|PEPQ_PSEA6  RecName: Full=Xaa-Pro dipeptidase; Sho...  33.5    0.92  
sp|Q3K9F5.1|STHA_PSEPF  RecName: Full=Soluble pyridine nucleot...  33.5    0.93 
sp|Q4KFA6.1|STHA_PSEF5  RecName: Full=Soluble pyridine nucleot...  33.5    0.94  
sp|O05139.3|STHA_PSEFL  RecName: Full=Soluble pyridine nucleot...  33.5    0.98 
sp|Q4ZV77.2|STHA_PSEU2  RecName: Full=Soluble pyridine nucleot...  33.5    0.99 
sp|Q48KI8.1|STHA_PSE14  RecName: Full=Soluble pyridine nucleot...  33.5    0.99  
sp|Q1I7F0.1|STHA_PSEE4  RecName: Full=Soluble pyridine nucleot...  33.5    1.0   
sp|Q9XBQ9.1|STHA_AZOVI  RecName: Full=Soluble pyridine nucleot...  33.5    1.1  
sp|A4YIV7.1|THI4_METS5  RecName: Full=Putative thiazole biosyn...  33.1    1.3   
sp|O07668.1|MRAY_ENTHR  RecName: Full=Phospho-N-acetylmuramoyl...  33.1    1.4  
sp|Q884I6.3|STHA_PSESM  RecName: Full=Soluble pyridine nucleot...  33.1    1.4  
sp|Q04829.2|DLDH_HALVO  RecName: Full=Dihydrolipoyl dehydrogen...  32.7    1.8  
sp|O29786.2|GGR_ARCFU  RecName: Full=Digeranylgeranylglyceroph...  32.3    2.1  
sp|P80647.1|DLDH_HYMDI  RecName: Full=Dihydrolipoyl dehydrogen...  32.3    2.1  
sp|Q94IG7.1|PPOCM_SPIOL  RecName: Full=Protoporphyrinogen oxid...  32.3    2.2  
sp|P19643.3|AOFB_RAT  RecName: Full=Amine oxidase [flavin-cont...  32.3    2.2   
sp|A4XSQ1.1|STHA_PSEMY  RecName: Full=Soluble pyridine nucleot...  32.3    2.2   
sp|A1RW13.2|THI4_PYRIL  RecName: Full=Putative thiazole biosyn...  32.3    2.3  
sp|P0A0E4.1|MERA_STAES  RecName: Full=Mercuric reductase; AltN...  32.3    2.4   
sp|Q5JD25|THI4_PYRKO  Putative thiazole biosynthetic enzyme        32.3    2.4   
sp|Q8U0Q5|THI4_PYRFU  Putative thiazole biosynthetic enzyme        32.0    2.5  
sp|Q9Y9Z0.2|THI4_AERPE  RecName: Full=Putative thiazole biosyn...  32.0    2.8  
sp|A7ZEX8.1|MNMC_CAMC1  RecName: Full=tRNA 5-methylaminomethyl...  32.0    3.0  
sp|P40974.1|PUO_MICRU  RecName: Full=Putrescine oxidase            32.0    3.0  
sp|Q55629.1|Y782_SYNY3  RecName: Full=Uncharacterized protein ...  32.0    3.0   
sp|O26377.1|GGR1_METTH  RecName: Full=Digeranylgeranylglycerop...  32.0    3.2   
sp|Q12YW2.1|GGR1_METBU  RecName: Full=Digeranylgeranylglycerop...  31.6    3.5  
sp|Q17043.1|APLY_APLKU  RecName: Full=Aplysianin-A; Flags: Pre...  31.6    3.8  
sp|Q975R0.1|THI4_SULTO  Putative thiazole biosynthetic enzyme      31.6    3.8  
sp|Q0TA96.1|STHA_ECOL5  RecName: Full=Soluble pyridine nucleot...  31.6    3.9   
sp|Q8R2T8.2|TF3C5_MOUSE  RecName: Full=General transcription f...  31.6    4.0   
sp|Q8FB93.3|STHA_ECOL6  RecName: Full=Soluble pyridine nucleot...  31.6    4.0  
sp|P27306.5|STHA_ECOLI  RecName: Full=Soluble pyridine nucleot...  31.6    4.0  
sp|O00087.2|DLDH_SCHPO  RecName: Full=Dihydrolipoyl dehydrogen...  31.6    4.0   
sp|Q83MI1.1|STHA_SHIFL  RecName: Full=Soluble pyridine nucleot...  31.6    4.0  
sp|A8A770.1|STHA_ECOHS  RecName: Full=Soluble pyridine nucleot...  31.6    4.0   
sp|Q8X727.3|STHA_ECO57  RecName: Full=Soluble pyridine nucleot...  31.6    4.0  
sp|P0AB60.1|YCIM_ECO57  RecName: Full=Uncharacterized protein ...  31.6    4.0  
sp|P26829.1|DHNA_BACYN  RecName: Full=NADH dehydrogenase; AltN...  31.6    4.1  
sp|Q2NFZ1.1|GGR1_METST  RecName: Full=Digeranylgeranylglycerop...  31.2    4.2  
sp|Q5R4B1.1|DLDH_PONAB  RecName: Full=Dihydrolipoyl dehydrogen...  31.2    4.3   
sp|Q60HG3.1|DLDH_MACFA  RecName: Full=Dihydrolipoyl dehydrogen...  31.2    4.3  
sp|P49819.1|DLDH_CANFA  RecName: Full=Dihydrolipoyl dehydrogen...  31.2    4.3   
sp|Q21988.3|AMX1_CAEEL  RecName: Full=Amine oxidase family mem...  31.2    4.4   
sp|Q2NQZ3.1|STHA_SODGM  RecName: Full=Soluble pyridine nucleot...  31.2    4.9   
sp|A4WG49.1|STHA_ENT38  RecName: Full=Soluble pyridine nucleot...  31.2    5.1   
sp|Q8VHE9.1|RETST_RAT  RecName: Full=All-trans-retinol 13,14-r...  30.8    5.5   
sp|Q64FW2.2|RETST_MOUSE  RecName: Full=All-trans-retinol 13,14...  30.8    5.6   
sp|Q8CIZ7.1|DLDH_CRIGR  RecName: Full=Dihydrolipoyl dehydrogen...  30.8    6.1  
sp|Q6P6R2.1|DLDH_RAT  RecName: Full=Dihydrolipoyl dehydrogenas...  30.8    6.1   
sp|P57303.1|DLDH_BUCAI  RecName: Full=Dihydrolipoyl dehydrogen...  30.8    6.1  
sp|P09623.1|DLDH_PIG  RecName: Full=Dihydrolipoyl dehydrogenas...  30.8    6.2   
sp|Q4JAF8.1|THI4_SULAC  Putative thiazole biosynthetic enzyme      30.8    6.3  
sp|P09622.1|DLDH_HUMAN  RecName: Full=Dihydrolipoyl dehydrogen...  30.8    6.3   
sp|Q54IT3.1|AOFA_DICDI  RecName: Full=Probable flavin-containi...  30.8    6.3  
sp|A4WKY7.2|THI4_PYRAR  Putative thiazole biosynthetic enzyme      30.8    6.4  
sp|O08749.2|DLDH_MOUSE  RecName: Full=Dihydrolipoyl dehydrogen...  30.8    6.5   
sp|A4VMU6.1|STHA_PSEU5  RecName: Full=Soluble pyridine nucleot...  30.8    6.7   
sp|Q465Z7.1|GGR_METBF  RecName: Full=Digeranylgeranylglyceroph...  30.8    6.9  
sp|Q6LXJ8|THI4_METMP  Putative thiazole biosynthetic enzyme        30.8    6.9  
sp|Q5BLE8.1|RETST_DANRE  RecName: Full=Putative all-trans-reti...  30.4    7.3   
sp|Q58053.1|Y636_METJA  RecName: Full=Uncharacterized protein ...  30.4    7.4  
sp|Q0KF58.1|METX_RALEH  Homoserine O-acetyltransferase (Homose...  30.4    7.7   
sp|Q12WF0.1|GGR2_METBU  RecName: Full=Digeranylgeranylglycerop...  30.4    7.7  
sp|Q8PU50.2|GGR_METMA  RecName: Full=Digeranylgeranylglyceroph...  30.4    7.8  
sp|Q8TQQ6.1|GGR_METAC  RecName: Full=Digeranylgeranylglyceroph...  30.4    7.9  
sp|A7MID0.1|TDH_ENTS8  RecName: Full=L-threonine 3-dehydrogenase   30.4    8.5   
sp|Q8BUY8.2|GASP2_MOUSE  RecName: Full=G-protein coupled recep...  30.4    8.7   
sp|Q9HUY1.1|DLDH3_PSEAE  RecName: Full=Dihydrolipoyl dehydroge...  30.0    9.6  

>sp|O87386.2|SOXA_RHIME  RecName: Full=Sarcosine oxidase subunit alpha; Short=Sarcosine 
oxidase subunit
Length=987

 Score =  255 bits (652),  Expect = 1e-67, Method: Compositional matrix adjust.
 Identities = 119/217 (54%), Positives = 155/217 (71%), Gaps = 0/217 (0%)

Query  3    QSFRLETGGLINRDKKISFKFNGKNYFGYEGDTLASALIANGVHLIGRSFKYHRPRGFFG  62
             S+RL   GL++R+  +SF F+G+   G EGDTLASAL+ANG  L+GRSFKYHRPRG   
Sbjct  2    SSYRLPKRGLVDRNVPLSFTFDGRPMQGLEGDTLASALLANGRMLVGRSFKYHRPRGILT  61

Query  63   AGVDEPYAIVQLYRNGETEPNIKATEQELFEGLEAKSVNCWPSVNFDVGAINNFLKIFLP  122
            AG  EP A+V + R G  EPN +AT QEL+EGLEA+S N WPS+ FD+GA+N  L  FL 
Sbjct  62   AGAAEPNALVTVGRGGRAEPNTRATMQELYEGLEARSQNRWPSLAFDIGALNGLLSPFLG  121

Query  123  AGFYYKTFMWPKSFWYKIYEPFIRKAAGLGTASIKHDKERYEHKYEYCDLLITGSRPSGL  182
            AGFYYKTFMWP   W K+YEP IR+AAGLG AS + D + YE  + +CDLL+ G+ P+GL
Sbjct  122  AGFYYKTFMWPAPLWEKLYEPVIRRAAGLGKASYEADPDAYEKSWAHCDLLVIGAGPTGL  181

Query  183  ASAYSAAKNGAKVILAEDKSRFGGTLLTSDVNIGNQS  219
            A+A +A + GA+VIL ++ S  GG+LL+    I  ++
Sbjct  182  AAALTAGRAGARVILVDEGSLPGGSLLSDTATIDGKA  218


>sp|Q46337.1|SOXA_CORS1  RecName: Full=Sarcosine oxidase subunit alpha; Short=Sarcosine 
oxidase subunit
Length=967

 Score = 67.4 bits (163),  Expect = 6e-11, Method: Composition-based stats.
 Identities = 56/196 (28%), Positives = 89/196 (45%), Gaps = 45/196 (22%)

Query  13   INRDKKISFKFNGKNYFGYEGDTLASALIANGVHLIGRSFKYHRPRGFFGAGVDEPYAIV  72
            I+R + +    +GK    + GDT+ASA++ANG    G S    RPRG F AGV+EP A+V
Sbjct  19   IDRGEALVLTVDGKQLEAFRGDTVASAMLANGQRACGNSMYLDRPRGIFSAGVEEPNALV  78

Query  73   QLYRNGETEPNIKATEQELFEGLEAKSVNCWPSVNFDVGAINNFLKIFLPAGFYYKTFMW  132
             +             EQ++ E + A +          V    N     L           
Sbjct  79   TVEAR---------HEQDINESMLAATT---------VPVTANLSATLL-----------  109

Query  133  PKSFWYKIYEPFIRKAAGLGTASIKHDKERYEHKYEYCDLLITGSRPSGLASAYSAAKNG  192
                             GLG      D   Y+H + + D+L+ G+ P+GLA+A  A+++G
Sbjct  110  ----------------RGLGVLDPSTDPAYYDHVHVHTDVLVVGAGPAGLAAAREASRSG  153

Query  193  AKVILAEDKSRFGGTL  208
            A+V+L ++++  GG+L
Sbjct  154  ARVLLLDERAEAGGSL  169


>sp|Q04616.3|3O1D_RHOOP  RecName: Full=3-oxosteroid 1-dehydrogenase
Length=507

 Score = 39.7 bits (91),  Expect = 0.013, Method: Compositional matrix adjust.
 Identities = 21/46 (45%), Positives = 25/46 (54%), Gaps = 0/46 (0%)

Query  170  CDLLITGSRPSGLASAYSAAKNGAKVILAEDKSRFGGTLLTSDVNI  215
            CDLL+ GS    L  AY+AA  G   I+ E   RFGGT   S  +I
Sbjct  8    CDLLVVGSGGGALTGAYTAAAQGLTTIVLEKTDRFGGTSAYSGASI  53


>sp|P35903.1|ACHC_ACHFU  RecName: Full=Achacin; Flags: Precursor
Length=531

 Score = 38.9 bits (89),  Expect = 0.025, Method: Compositional matrix adjust.
 Identities = 24/70 (34%), Positives = 38/70 (54%), Gaps = 1/70 (1%)

Query  170  CDLLITGSRPSGLASAYSAAKNGAKVILAEDKSRFGGTLLTSDV-NIGNQSVKSGQIVLF  228
             D+ + G+ PSG  SAY     G  V L E  +R GG L T+ + N+ + +++SG +  F
Sbjct  38   VDVAVVGAGPSGTYSAYKLRNKGQTVELFEYSNRIGGRLFTTHLPNVPDLNLESGGMRYF  97

Query  229  QNLKKCLMLL  238
            +N  K   +L
Sbjct  98   KNHHKIFGVL  107


>sp|Q556K4.1|AOFC_DICDI  RecName: Full=Probable flavin-containing monoamine oxidase C
Length=467

 Score = 37.0 bits (84),  Expect = 0.094, Method: Compositional matrix adjust.
 Identities = 20/53 (37%), Positives = 31/53 (58%), Gaps = 2/53 (3%)

Query  171  DLLITGSRPSGLASAYSAAKNGAKVILAEDKSRFGGTLLTSDVNIGNQSVKSG  223
            D +I G   SGL +AY   K+  K+++ E ++RFGG   T  + IG+  V +G
Sbjct  6    DTIIIGGGMSGLKTAYDLKKSNFKILVLEARNRFGGR--TDSIKIGDGWVDAG  56


>sp|A6US00.1|GGR_METVS  RecName: Full=Digeranylgeranylglycerophospholipid reductase; 
Short=DGGGPL reductase; AltName: Full=2,3-di-O-geranylgeranylglyceryl 
phosphate reductase; AltName: Full=Geranylgeranyl 
reductase; Short=GGR
Length=390

 Score = 37.0 bits (84),  Expect = 0.094, Method: Compositional matrix adjust.
 Identities = 16/38 (42%), Positives = 25/38 (65%), Gaps = 0/38 (0%)

Query  168  EYCDLLITGSRPSGLASAYSAAKNGAKVILAEDKSRFG  205
            E  D+++ G+ P+G  S+Y+A+KNGAK +L E     G
Sbjct  6    ESYDVVVVGAGPAGSMSSYNASKNGAKTLLIEKAQEIG  43


>sp|A1U1Y5.1|STHA_MARAV  RecName: Full=Soluble pyridine nucleotide transhydrogenase; Short=STH; 
AltName: Full=NAD(P)(+) transhydrogenase [B-specific]
Length=463

 GENE ID: 4654234 Maqu_1923 | soluble pyridine nucleotide transhydrogenase
[Marinobacter aquaeolei VT8]

 Score = 36.6 bits (83),  Expect = 0.11, Method: Compositional matrix adjust.
 Identities = 18/43 (41%), Positives = 27/43 (62%), Gaps = 3/43 (6%)

Query  164  EHKYEYCDLLITGSRPSGLASAYSAAKNGAKVILAEDKSRFGG  206
            EH Y   D+++ G+ PSG  +A +AAK+  +V + EDK   GG
Sbjct  3    EHHY---DVVVIGAGPSGEGAAMNAAKHNRRVAIIEDKPTVGG  42


>sp|Q556K3.1|AOFB_DICDI  RecName: Full=Probable flavin-containing monoamine oxidase B
Length=471

 Score = 36.2 bits (82),  Expect = 0.14, Method: Compositional matrix adjust.
 Identities = 22/57 (38%), Positives = 33/57 (57%), Gaps = 3/57 (5%)

Query  167  YEYCDLLITGSRPSGLASAYSAAKNGAKVILAEDKSRFGGTLLTSDVNIGNQSVKSG  223
            Y Y D +I G   SGL +AY   K+  K+++ E ++RFGG   T  V +G+  V +G
Sbjct  7    YNY-DTIIIGGGLSGLNTAYDLKKSNFKILVLEARNRFGGR--TDSVKVGDGWVDAG  60


>sp|B0KH90.1|STHA_PSEPG  RecName: Full=Soluble pyridine nucleotide transhydrogenase; Short=STH; 
AltName: Full=NAD(P)(+) transhydrogenase [B-specific]
Length=464

 GENE ID: 5869472 PputGB1_1692 | soluble pyridine nucleotide transhydrogenase
[Pseudomonas putida GB-1]

 Score = 36.2 bits (82),  Expect = 0.15, Method: Compositional matrix adjust.
 Identities = 17/40 (42%), Positives = 27/40 (67%), Gaps = 1/40 (2%)

Query  167  YEYCDLLITGSRPSGLASAYSAAKNGAKVILAEDKSRFGG  206
            Y Y D+++ GS P+G  +A +AAK G KV + +D+ + GG
Sbjct  4    YNY-DVVVLGSGPAGEGAAMNAAKAGRKVAMVDDRRQVGG  42


>sp|Q88KY8.3|STHA_PSEPK  RecName: Full=Soluble pyridine nucleotide transhydrogenase; Short=STH; 
AltName: Full=NAD(P)(+) transhydrogenase [B-specific]
 sp|A5W6F5.1|STHA_PSEP1  RecName: Full=Soluble pyridine nucleotide transhydrogenase; Short=STH; 
AltName: Full=NAD(P)(+) transhydrogenase [B-specific]
Length=464

 GENE ID: 1045007 sthA | soluble pyridine nucleotide transhydrogenase
[Pseudomonas putida KT2440] (10 or fewer PubMed links)

 Score = 36.2 bits (82),  Expect = 0.15, Method: Compositional matrix adjust.
 Identities = 17/40 (42%), Positives = 27/40 (67%), Gaps = 1/40 (2%)

Query  167  YEYCDLLITGSRPSGLASAYSAAKNGAKVILAEDKSRFGG  206
            Y Y D+++ GS P+G  +A +AAK G KV + +D+ + GG
Sbjct  4    YNY-DVVVLGSGPAGEGAAMNAAKAGRKVAMVDDRRQVGG  42


---------------------------------------------------------------------------------------------------
c)

                                                                   Score     E
Sequences producing significant alignments:                       (Bits)  Value

gb|EDZ60822.1|  sarcosine oxidase alpha subunit [Candidatus Pe...   422    5e-121
ref|ZP_01264926.1|  sarcosine oxidase alpha chain [Candidatus ...   415    6e-118
ref|YP_266690.1|  sarcosine oxidase alpha chain [Candidatus Pe...   415    6e-118 Gene info
gb|ABZ06303.1|  putative glycine cleavage T-protein (aminometh...   363    2e-101
gb|ABZ05929.1|  putative glycine cleavage T-protein (aminometh...   352    2e-95 
gb|ABZ06659.1|  putative glycine cleavage T-protein (aminometh...   349    2e-94 
ref|ZP_01754673.1|  sarcosine oxidase, alpha subunit family pr...   258    5e-67 
ref|YP_266475.1|  sarcosine oxidase alpha chain [Candidatus Pe...   255    3e-66  Gene info
ref|ZP_01546296.1|  sarcosine oxidase, alpha subunit [Stappia ...   254    9e-66 
ref|NP_106776.1|  sarcosine oxidase alpha subunit [Mesorhizobi...   253    2e-65  Gene info
ref|NP_356432.1|  sarcosine oxidase alpha subunit [Agrobacteri...   253    2e-65  Gene info
gb|EDZ42222.1|  sarcosine oxidase, alpha subunit family [Rhodo...   252    3e-65 
gb|EDZ61064.1|  sarcosine oxidase, alpha subunit [Candidatus P...   251    4e-65 
ref|YP_001261448.1|  glycine cleavage T protein (aminomethyl t...   251    4e-65  Gene info
emb|CAD31286.1|  PUTATIVE SARCOSINE OXIDASE ALPHA SUBUNIT PROT...   251    4e-65 
gb|EDZ45195.1|  sarcosine oxidase, alpha subunit family [Rhodo...   251    6e-65 
ref|YP_611611.1|  sarcosine oxidase alpha subunit family prote...   251    8e-65  Gene info
ref|YP_001592099.1|  sarcosine oxidase alpha subunit family pr...   250    1e-64  Gene info
ref|NP_697265.1|  sarcosine oxidase, alpha subunit [Brucella s...   250    1e-64  Gene info
emb|CAD31640.1|  PROBABLE SARCOSINE OXIDASE ALPHA SUBUNIT TRAN...   250    1e-64 
ref|YP_166984.1|  sarcosine oxidase alpha subunit family prote...   249    2e-64  Gene info
ref|YP_002362923.1|  sarcosine oxidase, alpha subunit family [...   249    3e-64 
ref|YP_001258259.1|  sarcosine oxidase alpha subunit [Brucella...   249    3e-64  Gene info
ref|ZP_01754466.1|  sarcosine oxidase, alpha subunit family pr...   249    3e-64 
ref|YP_002277908.1|  sarcosine oxidase, alpha subunit family [...   248    6e-64  Gene info
ref|NP_881143.1|  sarcosine oxidase alpha subunit [Bordetella ...   248    6e-64  Gene info
ref|YP_771596.1|  putative sarcosine oxidase alpha subunit [Rh...   247    8e-64  Gene info
ref|ZP_02147355.1|  sarcosine oxidase, alpha subunit family pr...   247    1e-63 
ref|NP_885663.1|  sarcosine oxidase alpha subunit [Bordetella ...   247    1e-63  Gene info
ref|YP_614139.1|  sarcosine oxidase alpha subunit family prote...   247    1e-63  Gene info
gb|ABZ05963.1|  hypothetical protein ALOHA_HF4000001L24ctg1g32...   246    1e-63 
ref|ZP_02297488.1|  Uncharacterized NAD(FAD)-dependent dehydro...   246    1e-63 
ref|NP_540637.1|  sarcosine oxidase alpha subunit [Brucella me...   246    1e-63  Gene info
ref|ZP_02168605.1|  sarcosine oxidase alpha subunit [Hoeflea p...   246    2e-63 
ref|YP_001328944.1|  sarcosine oxidase alpha subunit family pr...   246    2e-63  Gene info
ref|YP_472588.1|  sarcosine oxidase alpha subunit protein [Rhi...   246    2e-63  Gene info
ref|ZP_02149843.1|  sarcosine oxidase, alpha subunit family pr...   246    2e-63 
ref|YP_001368863.1|  sarcosine oxidase alpha subunit family pr...   246    2e-63  Gene info
ref|YP_743995.1|  sarcosine oxidase alpha subunit [Granulibact...   246    2e-63  Gene info
ref|ZP_01056477.1|  sarcosine oxidase, alpha subunit family pr...   245    3e-63 
ref|YP_001985949.1|  sarcosine oxidase protein, alpha subunit ...   245    4e-63  Gene info
ref|ZP_02188467.1|  sarcosine oxidase, alpha subunit family pr...   245    4e-63 
ref|ZP_02141516.1|  sarcosine oxidase, alpha subunit [Roseobac...   244    5e-63 
ref|NP_384189.1|  putative sarcosine oxidase alpha subunit tra...   244    7e-63  Gene info
ref|NP_107653.1|  sarcosine oxidase alpha subunit [Mesorhizobi...   242    3e-62  Gene info
ref|ZP_01054876.1|  sarcosine oxidase, alpha subunit family pr...   242    3e-62 
ref|YP_682013.1|  sarcosine oxidase, alpha subunit [Roseobacte...   241    5e-62  Gene info
ref|ZP_02118479.1|  sarcosine oxidase alpha subunit [Methyloba...   241    8e-62 
ref|NP_104289.1|  sarcosine oxidase alpha subunit [Mesorhizobi...   241    8e-62  Gene info
ref|ZP_01033968.1|  sarcosine oxidase, alpha subunit family pr...   241    8e-62 
ref|YP_001524410.1|  sarcosine oxidase alpha subunit [Azorhizo...   240    1e-61  Gene info
ref|YP_743901.1|  sarcosine oxidase alpha subunit [Granulibact...   240    1e-61  Gene info
gb|EDY87835.1|  sarcosine oxidase, alpha subunit [Octadecabact...   239    2e-61 
gb|EEA96709.1|  sarcosine oxidase, alpha subunit family [Pseud...   239    2e-61 
ref|YP_001533452.1|  sarcosine oxidase alpha subunit family pr...   239    3e-61  Gene info
ref|ZP_02059400.1|  sarcosine oxidase, alpha subunit family [M...   238    4e-61 
ref|ZP_01439029.1|  sarcosine oxidase, alpha subunit family pr...   238    4e-61 
ref|YP_166827.1|  sarcosine oxidase alpha subunit family prote...   238    4e-61  Gene info
ref|ZP_01002095.1|  sarcosine oxidase, alpha subunit [Loktanel...   238    7e-61 
gb|EEB71634.1|  sarcosine oxidase, alpha subunit family [Ruege...   237    9e-61 
ref|YP_002100542.1|  hypothetical protein BDAG_03838 [Burkhold...   237    1e-60  Gene info
ref|ZP_02485951.1|  sarcosine oxidase, alpha subunit [Burkhold...   237    1e-60 
ref|ZP_02466697.1|  sarcosine oxidase, alpha subunit [Burkhold...   237    1e-60 
ref|ZP_02459962.1|  putative sarcosine oxidase alpha subunit [...   237    1e-60 
ref|ZP_02407205.1|  sarcosine oxidase, alpha subunit [Burkhold...   237    1e-60 
ref|YP_439195.1|  sarcosine oxidase, alpha subunit [Burkholder...   237    1e-60  Gene info
ref|YP_001062947.1|  sarcosine oxidase, alpha subunit, heterot...   237    1e-60  Gene info
ref|YP_001075894.1|  sarcosine oxidase, alpha subunit [Burkhol...   237    1e-60  Gene info
ref|YP_111378.1|  sarcosine oxidase alpha subunit [Burkholderi...   237    1e-60  Gene info
ref|YP_001583482.1|  sarcosine oxidase alpha subunit family pr...   236    1e-60  Gene info
gb|EEB80013.1|  sarcosine oxidase, alpha subunit family [marin...   236    1e-60 
ref|ZP_02154807.1|  sarcosine oxidase, alpha subunit family pr...   236    1e-60 
ref|YP_001419578.1|  sarcosine oxidase alpha subunit family pr...   236    1e-60  Gene info
ref|ZP_03456801.1|  sarcosine oxidase, alpha subunit [Burkhold...   236    2e-60 
ref|YP_001641183.1|  sarcosine oxidase alpha subunit family pr...   236    2e-60  Gene info
ref|NP_519224.1|  sarcosine oxidase subunit alpha [Ralstonia s...   236    2e-60  Gene info
ref|YP_001755163.1|  sarcosine oxidase alpha subunit family pr...   236    3e-60  Gene info
ref|ZP_02370511.1|  sarcosine oxidase, alpha subunit [Burkhold...   236    3e-60 
gb|EEB84114.1|  sarcosine oxidase, alpha subunit family [Roseo...   235    3e-60 
ref|YP_001207761.1|  sarcosine oxidase, alpha subunit [Bradyrh...   235    3e-60  Gene info
ref|YP_610570.1|  sarcosine oxidase (alpha subunit) oxidoreduc...   235    3e-60  Gene info
ref|ZP_01449178.1|  sarcosine oxidase, alpha subunit family pr...   226    4e-60 
ref|YP_001109958.1|  sarcosine oxidase alpha subunit family pr...   235    4e-60  Gene info
gb|EDZ45988.1|  sarcosine oxidase, alpha subunit family [Rhodo...   235    4e-60 
ref|ZP_01441755.1|  sarcosine oxidase, alpha subunit family pr...   235    4e-60 
ref|YP_001926650.1|  sarcosine oxidase, alpha subunit family [...   234    1e-59  Gene info
ref|NP_356342.1|  sarcosine oxidase alpha subunit [Agrobacteri...   234    1e-59  Gene info
ref|NP_521609.1|  sarcosine oxidase subunit alpha [Ralstonia s...   234    1e-59  Gene info
ref|YP_001238046.1|  sarcosine oxidase, alpha subunit [Bradyrh...   234    1e-59  Gene info
ref|ZP_03395917.1|  sarcosine oxidase, alpha subunit [Pseudomo...   229    1e-59 
ref|NP_790307.1|  sarcosine oxidase, alpha subunit [Pseudomona...   229    1e-59  Gene info
ref|ZP_02366005.1|  sarcosine oxidase, alpha subunit [Burkhold...   233    1e-59 
ref|ZP_02358969.1|  sarcosine oxidase, alpha subunit [Burkhold...   233    1e-59 
ref|ZP_01879735.1|  sarcosine oxidase, alpha subunit family pr...   233    1e-59 
ref|YP_237780.1|  sarcosine oxidase, alpha subunit, heterotetr...   228    2e-59  Gene info
ref|ZP_02054752.1|  sarcosine oxidase, alpha subunit family [M...   233    2e-59 
ref|ZP_01447755.1|  sarcosine oxidase, alpha subunit family pr...   233    2e-59 
ref|ZP_02886813.1|  sarcosine oxidase, alpha subunit family [B...   233    2e-59 
ref|YP_001751727.1|  sarcosine oxidase alpha subunit family pr...   233    2e-59  Gene info
ref|YP_262784.1|  sarcosine oxidase, alpha subunit [Pseudomona...   233    2e-59  Gene info

>gb|EDZ60822.1|  sarcosine oxidase alpha subunit [Candidatus Pelagibacter sp. 
HTCC7211]
Length=998

 Score =  422 bits (1084),  Expect(2) = 5e-121
 Identities = 202/221 (91%), Positives = 210/221 (95%), Gaps = 0/221 (0%)
 Frame = +3

Query  198  MTQSFRLETGGLINRDKKISFKFNGKNYFGYEGDTLASALIANGVHLIGRSFKYHRPRGF  377
            MTQSFRL   GLINRD+K+SFKFN   Y+GYEGDTLASALIANGVHL+GRSFKYHRPRGF
Sbjct  1    MTQSFRLNDVGLINRDRKLSFKFNSVTYYGYEGDTLASALIANGVHLVGRSFKYHRPRGF  60

Query  378  FGAGVDEPYAIVQLYRNGETEPNIKATEQELFEGLEAKSVNCWPSVNFDVGAINNFLKIF  557
            FGAGVDEPYAIVQLYRNGETEPNIKATEQELFEGLEAKSVNCWPSVNFDVGAINNFLKIF
Sbjct  61   FGAGVDEPYAIVQLYRNGETEPNIKATEQELFEGLEAKSVNCWPSVNFDVGAINNFLKIF  120

Query  558  LPAGFYYKTFMWPKSFWYKIYEPFIRKAAGLGTASIKHDKERYEHKYEYCDLLITGSRPS  737
            LPAGFYYKTFMWPKSFWYK+YEPFIRKAAGLG AS KHDKERYEHKYEYCDLLI GS PS
Sbjct  121  LPAGFYYKTFMWPKSFWYKVYEPFIRKAAGLGVASTKHDKERYEHKYEYCDLLIAGSGPS  180

Query  738  GLASAYSAAKNGAKVILAEDKSRFGGTLLTSDVNIGNQSVK  860
            GLASAY+AAKNGA+VILAEDKSRFGGTLLTSDVNIGNQ+ K
Sbjct  181  GLASAYAAAKNGARVILAEDKSRFGGTLLTSDVNIGNQTGK  221


 Score = 38.5 bits (88),  Expect(2) = 5e-121
 Identities = 16/20 (80%), Positives = 18/20 (90%), Gaps = 0/20 (0%)
 Frame = +2

Query  857  KEWADSIVSELKEMSNVTIK  916
            KEWAD I+SELKEM NVT+K
Sbjct  221  KEWADGIISELKEMPNVTVK  240


>ref|ZP_01264926.1|  sarcosine oxidase alpha chain [Candidatus Pelagibacter ubique 
HTCC1002]
 gb|EAS85413.1|  sarcosine oxidase alpha chain [Candidatus Pelagibacter ubique 
HTCC1002]
Length=998

 Score =  415 bits (1066),  Expect(2) = 6e-118
 Identities = 197/221 (89%), Positives = 209/221 (94%), Gaps = 0/221 (0%)
 Frame = +3

Query  198  MTQSFRLETGGLINRDKKISFKFNGKNYFGYEGDTLASALIANGVHLIGRSFKYHRPRGF  377
            MTQ++RL+  GLINRDKKISFKFNG  YFGYEGDTLASAL+ANGVHLIGRSFKYHRPRGF
Sbjct  1    MTQNYRLDNVGLINRDKKISFKFNGVTYFGYEGDTLASALLANGVHLIGRSFKYHRPRGF  60

Query  378  FGAGVDEPYAIVQLYRNGETEPNIKATEQELFEGLEAKSVNCWPSVNFDVGAINNFLKIF  557
            FGAGVDEPYAIVQLYRN ETEPN+KATEQELFEGLEA SVNCWPSVNFD+GAINN LKIF
Sbjct  61   FGAGVDEPYAIVQLYRNNETEPNVKATEQELFEGLEATSVNCWPSVNFDIGAINNLLKIF  120

Query  558  LPAGFYYKTFMWPKSFWYKIYEPFIRKAAGLGTASIKHDKERYEHKYEYCDLLITGSRPS  737
            LPAGFYYKTFMWPKSFWYK+YEPFIRKAAGLG ASI+HDKERYEHKYEYCDLLI GS PS
Sbjct  121  LPAGFYYKTFMWPKSFWYKVYEPFIRKAAGLGVASIEHDKERYEHKYEYCDLLIAGSGPS  180

Query  738  GLASAYSAAKNGAKVILAEDKSRFGGTLLTSDVNIGNQSVK  860
            GLASAY+AAKNGA+VILAEDK RFGGTLLTS+VNIGNQ+ K
Sbjct  181  GLASAYAAAKNGARVILAEDKPRFGGTLLTSEVNIGNQTGK  221


 Score = 35.0 bits (79),  Expect(2) = 6e-118
 Identities = 14/20 (70%), Positives = 18/20 (90%), Gaps = 0/20 (0%)
 Frame = +2

Query  857  KEWADSIVSELKEMSNVTIK  916
            KEWA++I+SELKEM NV +K
Sbjct  221  KEWAENIISELKEMPNVIVK  240


>ref|YP_266690.1| Gene info sarcosine oxidase alpha chain [Candidatus Pelagibacter ubique 
HTCC1062]
 gb|AAZ22086.1| Gene info sarcosine oxidase alpha chain [Candidatus Pelagibacter ubique 
HTCC1062]
Length=998

 GENE ID: 3517319 soxA2 | sarcosine oxidase alpha chain
[Candidatus Pelagibacter ubique HTCC1062] (10 or fewer PubMed links)

 Score =  415 bits (1066),  Expect(2) = 6e-118
 Identities = 197/221 (89%), Positives = 209/221 (94%), Gaps = 0/221 (0%)
 Frame = +3

Query  198  MTQSFRLETGGLINRDKKISFKFNGKNYFGYEGDTLASALIANGVHLIGRSFKYHRPRGF  377
            MTQ++RL+  GLINRDKKISFKFNG  YFGYEGDTLASAL+ANGVHLIGRSFKYHRPRGF
Sbjct  1    MTQNYRLDNVGLINRDKKISFKFNGVTYFGYEGDTLASALLANGVHLIGRSFKYHRPRGF  60

Query  378  FGAGVDEPYAIVQLYRNGETEPNIKATEQELFEGLEAKSVNCWPSVNFDVGAINNFLKIF  557
            FGAGVDEPYAIVQLYRN ETEPN+KATEQELFEGLEA SVNCWPSVNFD+GAINN LKIF
Sbjct  61   FGAGVDEPYAIVQLYRNNETEPNVKATEQELFEGLEATSVNCWPSVNFDIGAINNLLKIF  120

Query  558  LPAGFYYKTFMWPKSFWYKIYEPFIRKAAGLGTASIKHDKERYEHKYEYCDLLITGSRPS  737
            LPAGFYYKTFMWPKSFWYK+YEPFIRKAAGLG ASI+HDKERYEHKYEYCDLLI GS PS
Sbjct  121  LPAGFYYKTFMWPKSFWYKVYEPFIRKAAGLGVASIEHDKERYEHKYEYCDLLIAGSGPS  180

Query  738  GLASAYSAAKNGAKVILAEDKSRFGGTLLTSDVNIGNQSVK  860
            GLASAY+AAKNGA+VILAEDK RFGGTLLTS+VNIGNQ+ K
Sbjct  181  GLASAYAAAKNGARVILAEDKPRFGGTLLTSEVNIGNQTGK  221


 Score = 35.0 bits (79),  Expect(2) = 6e-118
 Identities = 14/20 (70%), Positives = 18/20 (90%), Gaps = 0/20 (0%)
 Frame = +2

Query  857  KEWADSIVSELKEMSNVTIK  916
            KEWA++I+SELKEM NV +K
Sbjct  221  KEWAENIISELKEMPNVIVK  240


>gb|ABZ06303.1|  putative glycine cleavage T-protein (aminomethyl transferase) 
[uncultured marine microorganism HF4000_008G09]
Length=998

 Score =  363 bits (933),  Expect(2) = 2e-101
 Identities = 173/221 (78%), Positives = 188/221 (85%), Gaps = 0/221 (0%)
 Frame = +3

Query  198  MTQSFRLETGGLINRDKKISFKFNGKNYFGYEGDTLASALIANGVHLIGRSFKYHRPRGF  377
            MTQ FRL   GL+NR+K ISF FNGK YFGYEGDTLASAL+ANG+HL+GRSFKYHRPRGF
Sbjct  1    MTQKFRLPNLGLVNRNKTISFHFNGKKYFGYEGDTLASALLANGIHLVGRSFKYHRPRGF  60

Query  378  FGAGVDEPYAIVQLYRNGETEPNIKATEQELFEGLEAKSVNCWPSVNFDVGAINNFLKIF  557
            FGAGVDEP A VQLY   +TEPN+ ATE EL EGL AKS NCWPSV FDVGAINNF   F
Sbjct  61   FGAGVDEPNAKVQLYEGDKTEPNVNATELELVEGLVAKSQNCWPSVEFDVGAINNFFSRF  120

Query  558  LPAGFYYKTFMWPKSFWYKIYEPFIRKAAGLGTASIKHDKERYEHKYEYCDLLITGSRPS  737
             PAGFYYKTFMWPKSFWYK+YEP IRKAAGLG AS K D  RYEHKYEYCD+L+ GS PS
Sbjct  121  FPAGFYYKTFMWPKSFWYKVYEPLIRKAAGLGVASPKPDTSRYEHKYEYCDVLVVGSGPS  180

Query  738  GLASAYSAAKNGAKVILAEDKSRFGGTLLTSDVNIGNQSVK  860
            GL+SAY+AAKNGA+VILAEDK RFGG+LLT DVNIGNQ+ K
Sbjct  181  GLSSAYAAAKNGARVILAEDKPRFGGSLLTDDVNIGNQTGK  221


 Score = 31.6 bits (70),  Expect(2) = 2e-101
 Identities = 11/20 (55%), Positives = 16/20 (80%), Gaps = 0/20 (0%)
 Frame = +2

Query  857  KEWADSIVSELKEMSNVTIK  916
            KEWA+ ++ ELK+M NV +K
Sbjct  221  KEWAEDVIKELKQMPNVIVK  240


>gb|ABZ05929.1|  putative glycine cleavage T-protein (aminomethyl transferase) 
[uncultured marine microorganism HF4000_001B09]
Length=998

 Score =  352 bits (904),  Expect = 2e-95
 Identities = 167/221 (75%), Positives = 186/221 (84%), Gaps = 0/221 (0%)
 Frame = +3

Query  198  MTQSFRLETGGLINRDKKISFKFNGKNYFGYEGDTLASALIANGVHLIGRSFKYHRPRGF  377
            M+Q +RL+  G INRDKKISF FNGK YFGYEGDTLASAL+ANG+HL+GRSFKYHRPRGF
Sbjct  1    MSQKYRLDNIGYINRDKKISFTFNGKKYFGYEGDTLASALLANGIHLVGRSFKYHRPRGF  60

Query  378  FGAGVDEPYAIVQLYRNGETEPNIKATEQELFEGLEAKSVNCWPSVNFDVGAINNFLKIF  557
            FGAGVDEP A VQLY+  +TEPN  ATE EL EGL  KS NCWPSV+FD GAINN  + F
Sbjct  61   FGAGVDEPNAKVQLYKGAKTEPNANATEVELVEGLIVKSQNCWPSVSFDFGAINNLFQKF  120

Query  558  LPAGFYYKTFMWPKSFWYKIYEPFIRKAAGLGTASIKHDKERYEHKYEYCDLLITGSRPS  737
             PAGFYYKTFMWPKSFWYK+YEP IRKAAGLG A +K D +RYEHKYEYCD+LI GS PS
Sbjct  121  FPAGFYYKTFMWPKSFWYKVYEPIIRKAAGLGVAPLKPDPDRYEHKYEYCDVLIAGSGPS  180

Query  738  GLASAYSAAKNGAKVILAEDKSRFGGTLLTSDVNIGNQSVK  860
            GLASA +AAKNGA+VILAEDKSRFGG+LL  +V IGN+  K
Sbjct  181  GLASALAAAKNGARVILAEDKSRFGGSLLVDEVTIGNKKGK  221


>gb|ABZ06659.1|  putative glycine cleavage T-protein (aminomethyl transferase) 
[uncultured marine microorganism HF4000_133I24]
Length=998

 Score =  349 bits (895),  Expect = 2e-94
 Identities = 166/221 (75%), Positives = 184/221 (83%), Gaps = 0/221 (0%)
 Frame = +3

Query  198  MTQSFRLETGGLINRDKKISFKFNGKNYFGYEGDTLASALIANGVHLIGRSFKYHRPRGF  377
            M Q +RL+  G INRDKKISF FNGK YFGYEGDTLASAL+ANG+HL+GRSFKYHRPRGF
Sbjct  1    MPQKYRLDNIGYINRDKKISFTFNGKKYFGYEGDTLASALLANGIHLVGRSFKYHRPRGF  60

Query  378  FGAGVDEPYAIVQLYRNGETEPNIKATEQELFEGLEAKSVNCWPSVNFDVGAINNFLKIF  557
            FGAGVDEP A VQLY+  +TEPN  ATE EL E L  KS NCWPSV+FD GAINN  + F
Sbjct  61   FGAGVDEPNAKVQLYKGAKTEPNANATEVELVEDLIVKSQNCWPSVSFDFGAINNLFQKF  120

Query  558  LPAGFYYKTFMWPKSFWYKIYEPFIRKAAGLGTASIKHDKERYEHKYEYCDLLITGSRPS  737
             PAGFYYKTFMWPKSFWYK+YEP IRKAAGLG A +K D +RYEHKYEYCD+LI GS PS
Sbjct  121  FPAGFYYKTFMWPKSFWYKVYEPIIRKAAGLGVAPLKPDPDRYEHKYEYCDVLIAGSGPS  180

Query  738  GLASAYSAAKNGAKVILAEDKSRFGGTLLTSDVNIGNQSVK  860
            GLASA +AAKNGA+VILAEDKSRFGG+LL  +V IGN+  K
Sbjct  181  GLASALAAAKNGARVILAEDKSRFGGSLLVDEVTIGNKKGK  221


>ref|ZP_01754673.1|  sarcosine oxidase, alpha subunit family protein [Roseobacter 
sp. SK209-2-6]
 gb|EBA16865.1|  sarcosine oxidase, alpha subunit family protein [Roseobacter 
sp. SK209-2-6]
Length=985

 Score =  258 bits (659),  Expect = 5e-67
 Identities = 128/219 (58%), Positives = 162/219 (73%), Gaps = 1/219 (0%)
 Frame = +3

Query  198  MTQSFRLETGGLINRDKKISFKFNGKNYFGYEGDTLASALIANGVHLIGRSFKYHRPRGF  377
            MT+  RL+ GG INR K++SF F+G  Y GYEGDTLASAL+ANG  L+GRSFKYHRPRG 
Sbjct  1    MTEVNRLD-GGQINRAKEVSFTFDGHRYKGYEGDTLASALLANGERLMGRSFKYHRPRGV  59

Query  378  FGAGVDEPYAIVQLYRNGETEPNIKATEQELFEGLEAKSVNCWPSVNFDVGAINNFLKIF  557
              AG +EP A+V+L + G  EPN +AT  ELF+GLEA   N WPS+ FD  A+N+    F
Sbjct  60   LTAGSEEPNALVELRKGGRQEPNTRATVIELFDGLEAAPQNAWPSLRFDAMAVNDRFSNF  119

Query  558  LPAGFYYKTFMWPKSFWYKIYEPFIRKAAGLGTASIKHDKERYEHKYEYCDLLITGSRPS  737
            L AGFYYKTFMWPK+FW KIYEP IRKAAGLG+ S + D + Y+  + +CDLLI GS PS
Sbjct  120  LTAGFYYKTFMWPKAFWEKIYEPIIRKAAGLGSISFEEDPDLYDKGFLHCDLLIIGSGPS  179

Query  738  GLASAYSAAKNGAKVILAEDKSRFGGTLLTSDVNIGNQS  854
            GLA+A +A ++GA+VILA++  R GG L +  + +G+QS
Sbjct  180  GLAAALTAGRSGARVILADEDFRMGGRLNSETLALGDQS  218


>ref|YP_266475.1| Gene info sarcosine oxidase alpha chain [Candidatus Pelagibacter ubique 
HTCC1062]
 ref|ZP_01265173.1|  sarcosine oxidase alpha chain [Candidatus Pelagibacter ubique 
HTCC1002]
 gb|AAZ21871.1| Gene info sarcosine oxidase  alpha chain [Candidatus Pelagibacter ubique 
HTCC1062]
 gb|EAS84273.1|  sarcosine oxidase alpha chain [Candidatus Pelagibacter ubique 
HTCC1002]
Length=1002

 GENE ID: 3517368 soxA | sarcosine oxidase alpha chain
[Candidatus Pelagibacter ubique HTCC1062] (10 or fewer PubMed links)

 Score =  255 bits (652),  Expect = 3e-66
 Identities = 124/226 (54%), Positives = 163/226 (72%), Gaps = 7/226 (3%)
 Frame = +3

Query  198  MTQSFRLETGGLINRDKKISFKFNGKNYFGYEGDTLASALIANGVHLIGRSFKYHRPRGF  377
            M ++ R+ T   I+   ++SFKFNGK+YFGY+GDTLASAL+ANG+HL+GRSFKYHRPRG 
Sbjct  1    MLKNLRVTTSKYIDETSRVSFKFNGKSYFGYKGDTLASALLANGIHLVGRSFKYHRPRGI  60

Query  378  FGAGVDEPYAIVQLYRN-GETEPNIKATEQELFEGLEAKSVNCWPSVNFDVGAINNFLKI  554
              +G +EP AIVQ+  N   TEPN++ATE E++ GLEA S NCWPSVNFD+G INNFL  
Sbjct  61   MTSGSEEPNAIVQVNNNTALTEPNVRATELEIYHGLEANSQNCWPSVNFDIGGINNFLSP  120

Query  555  FLPAGFYYKTFMWPKSFWYKIYEPFIRKAAGLGTASIKHDKERYEHKYEYCDLLITGSRP  734
             LPAGFYYKTFMWP +FW K YE  IR +AGLG +    D + Y+HKY +CD+L+ G+  
Sbjct  121  LLPAGFYYKTFMWPANFWEK-YEYVIRHSAGLGKSPTVPDPDIYDHKYIHCDVLVIGAGI  179

Query  735  SGLASAYSAAKNGAKVILAEDKSRFGGTLLTSD-----VNIGNQSV  857
            SG+ +A +AAKN  K +L ++K+  GG+ +  +     +N  N SV
Sbjct  180  SGIIAAKTAAKNNLKTLLLDEKNEIGGSTIFQNSDHIKINDQNSSV  225


>ref|ZP_01546296.1|  sarcosine oxidase, alpha subunit [Stappia aggregata IAM 12614]
 gb|EAV44852.1|  sarcosine oxidase, alpha subunit [Stappia aggregata IAM 12614]
Length=1000

 Score =  254 bits (648),  Expect = 9e-66
 Identities = 117/210 (55%), Positives = 161/210 (76%), Gaps = 0/210 (0%)
 Frame = +3

Query  198  MTQSFRLETGGLINRDKKISFKFNGKNYFGYEGDTLASALIANGVHLIGRSFKYHRPRGF  377
            M+Q FR E GG I+R ++++F F+G+   G++GDTLASAL+ANGVHL+GRSFKYHRPRG 
Sbjct  1    MSQPFRTEKGGRIDRAEQLTFTFDGEEMQGHKGDTLASALLANGVHLVGRSFKYHRPRGI  60

Query  378  FGAGVDEPYAIVQLYRNGETEPNIKATEQELFEGLEAKSVNCWPSVNFDVGAINNFLKIF  557
              AG +EP A+V +YRNG+  PN++AT+ EL++GLEA S N +PS+ FD+GA+N+ L   
Sbjct  61   LTAGSEEPNALVGVYRNGDQTPNLRATQVELYQGLEAISQNRFPSLGFDIGAVNDLLSPL  120

Query  558  LPAGFYYKTFMWPKSFWYKIYEPFIRKAAGLGTASIKHDKERYEHKYEYCDLLITGSRPS  737
             PAGFYYKTFMWP +FW K+YEP IR AAGLG      D + Y + Y +CD+L+ GS P+
Sbjct  121  FPAGFYYKTFMWPHAFWDKVYEPIIRSAAGLGKPPKNPDHDVYGNIYAHCDVLVVGSGPT  180

Query  738  GLASAYSAAKNGAKVILAEDKSRFGGTLLT  827
            GLA+A +A + GAKV+L ++++ FGG+LL+
Sbjct  181  GLAAALAAGETGAKVMLVDEQAEFGGSLLS  210


>ref|NP_106776.1| Gene info sarcosine oxidase alpha subunit [Mesorhizobium loti MAFF303099]
 dbj|BAB52562.1| Gene info sarcosine oxidase alpha subunit [Mesorhizobium loti MAFF303099]
Length=988

 GENE ID: 1229431 mll6238 | sarcosine oxidase alpha subunit
[Mesorhizobium loti MAFF303099] (10 or fewer PubMed links)

 Score =  253 bits (646),  Expect = 2e-65
 Identities = 116/216 (53%), Positives = 159/216 (73%), Gaps = 0/216 (0%)
 Frame = +3

Query  207  SFRLETGGLINRDKKISFKFNGKNYFGYEGDTLASALIANGVHLIGRSFKYHRPRGFFGA  386
            S+RL +GGLI+R  ++ F F+G++  G+ GDTLASAL+ANG  L+GRSFKYHRPRG   A
Sbjct  3    SYRLPSGGLIDRHSRLGFSFDGQSLTGHAGDTLASALLANGRQLVGRSFKYHRPRGILTA  62

Query  387  GVDEPYAIVQLYRNGETEPNIKATEQELFEGLEAKSVNCWPSVNFDVGAINNFLKIFLPA  566
            G  EP A++ +   G TEPN +AT Q+L++GLEA+S N WPS+NFD+G++N  L  FL A
Sbjct  63   GAAEPNALMTIGSGGRTEPNTRATMQDLYDGLEARSQNRWPSLNFDIGSLNGLLSPFLAA  122

Query  567  GFYYKTFMWPKSFWYKIYEPFIRKAAGLGTASIKHDKERYEHKYEYCDLLITGSRPSGLA  746
            GFYYKTFMWP  FW  +YEPFIR+AAGLG A+ + D +RYE  + +CDLL+ G+ P+GLA
Sbjct  123  GFYYKTFMWPAKFWEGLYEPFIRRAAGLGKATYEADPDRYEKSWAHCDLLVIGAGPAGLA  182

Query  747  SAYSAAKNGAKVILAEDKSRFGGTLLTSDVNIGNQS  854
            +A    + GA+VI+ ++ S  GG+LL+    +G +S
Sbjct  183  AALIVGRAGARVIILDEHSLAGGSLLSETATVGGES  218

ORF finding

PROTOCOLE:

a) SMS ORFinder / sens direct / cadres 1, 2 & 3 / min 60 AA / initiation 'any codon' / 
code génétique 'standard'
b) SMS ORFinder / sens indirect / cadres 1, 2 & 3 / min 60 AA / initiation 'any codon' / 
code génétique 'standard'
---------------------------------------------------------------------------------------------------
ANALYSE DES RÉSULTATS:

a)
On obtient un ORF bien plus grand que l'autre.On choisit donc l'ORF qui est sur le cadre de  lecture 
+3. D'après les résultats du Blastx fait plus loin on a observé un "Frame Shift" entrainant un saut
du cadre +3 au cadre +2.
b)
Il n'y a pas d'ORF dans les 3 cadres de lecture.

---------------------------------------------------------------------------------------------------
RÉSULTATS BRUTS:

a)

>ORF number 1 in reading frame 1 on the direct strand extends from base 1 to base 198.
GTTAAACGACCAGAATTAGGTGAAGAAATATCTGATCACGATTGGGATAATTTTGTTTAC
AATAGAAAAAGCTTGAGAGGAAAGCATTGGGAGTTATGGCAACATTTATCAGGTTGCAGA
CAATGGATTAAAGTTCAGAGAGATACAGCTACACACGAAATTTTTAAAACTCTTAAAGCA
AACGAAGATATTTCATAA

>Translation of ORF number 1 in reading frame 1 on the direct strand.
VKRPELGEEISDHDWDNFVYNRKSLRGKHWELWQHLSGCRQWIKVQRDTATHEIFKTLKA
NEDIS*

No ORFs were found in reading frame 2.

>ORF number 1 in reading frame 3 on the direct strand extends from base 177 to base 914.
AGCAAACGAAGATATTTCATAATGACACAAAGTTTTAGATTAGAAACTGGTGGATTAATA
AATAGAGATAAAAAAATTTCTTTTAAATTTAATGGTAAAAATTATTTTGGTTATGAGGGA
GACACTCTTGCTTCTGCATTAATTGCCAATGGAGTTCATTTAATTGGAAGAAGTTTCAAA
TATCATAGACCAAGAGGTTTTTTTGGTGCTGGGGTTGATGAGCCATATGCAATAGTTCAA
TTATACAGAAACGGTGAAACAGAGCCAAATATTAAAGCTACTGAACAAGAACTTTTTGAA
GGTCTTGAAGCAAAAAGTGTTAATTGTTGGCCGAGTGTGAATTTTGATGTTGGAGCTATA
AATAATTTTTTAAAGATATTTCTTCCTGCAGGCTTTTATTACAAGACTTTTATGTGGCCA
AAAAGTTTTTGGTATAAAATTTATGAACCATTCATCAGAAAAGCTGCTGGTTTAGGCACT
GCATCTATAAAACATGATAAAGAAAGATATGAACATAAATATGAATATTGTGATCTGCTA
ATCACAGGCTCACGTCCATCTGGATTAGCGAGTGCTTATTCAGCTGCAAAAAATGGTGCT
AAAGTAATTCTCGCAGAGGACAAATCACGATTTGGTGGAACTCTATTAACCAGTGATGTC
AATATAGGGAATCAATCAGTAAAGAGTGGGCAGATAGTATTGTTTCAGAACTTAAAGAAA
TGTCTAATGTTACTATAA

>Translation of ORF number 1 in reading frame 3 on the direct strand.
SKRRYFIMTQSFRLETGGLINRDKKISFKFNGKNYFGYEGDTLASALIANGVHLIGRSFK
YHRPRGFFGAGVDEPYAIVQLYRNGETEPNIKATEQELFEGLEAKSVNCWPSVNFDVGAI
NNFLKIFLPAGFYYKTFMWPKSFWYKIYEPFIRKAAGLGTASIKHDKERYEHKYEYCDLL
ITGSRPSGLASAYSAAKNGAKVILAEDKSRFGGTLLTSDVNIGNQSVKSGQIVLFQNLKK
CLMLL*
Personal tools