ORF PU18140

From Metagenes
Warning: this metagenomic sequence has been carefully annotated by students during bioinformatics assignments. These quality annotations are therefore the result of a teaching exercise that you are most welcome to amend and extend if necessary!


Sequence
CAMERA AccNum : AACY01160119.1
Annotathon code: ORF_PU18140
Sample :
  • GPS :31°10'30n; 64°19'27.6w
  • Sargasso Sea: Sargasso Sea, Station 11 - Bermuda (UK)
  • Open Ocean (-5m, 20.5°C, 0.1-0.8 microns)
Authors
Team : BioCell 2006
Username : Samar
Annotated on : 2008-03-19 18:52:37
  • LEZEAU sami
  • PAPANIAN marion

Synopsis

Genomic Sequence

>AACY01160119.1 ORF_PU18140 genomic DNA
CGGCGAAAAACCGGGCTGGGGTCAGCTCCAAAAACAATTGTCGAGCTGATCGATGAGCGGGTTGACGTTACGGCTCAAGCCTTAAATCAGCCAAAAGATG
AGGTCTTCAAAAAGTTCATGCAGGGATCAATACCGTTGCTATCGCTTGGCGGCGTAACTTTGCTTGACACTGGGGCTGCACAAACAAATATGGAGGCGGC
TGATGGTCAAGATACTTAAAGAAGGGTTTAAATCATTAGCAGAAGCCACTCGCAAAGCCGAGAACAGAGCGCACGGCATGAAAGTGCCGGATAATGACGT
TACCAAAACGCAAAGCGGTGACCTTGTTATCAAGGCGATGCCAAACGAAGACCTCCAGCTTTTAAATGAATCACTAGCCAAAAACGCTGGCATCAGCAAA
GGGTTAAACCTGGGCCGTATCGGTGACATATTTGAGCTAGAGGGCTTCAACGACATTGTCTTTAATGTTGCTGATGGTGATTTTGGTCTAACTAAAGTTT
TAGAAAATATAAAAAAGAATAACAAAGAGGTTTTCGATTATCTCAAGCGTGACACCAAGTCGATGGACGATTTGATGAAACTTGCCAACGCTACTGGGTA
TGAGGGGATTATCTATAAAATGCTTGGCCGCAAAGCCGGTGATGTAGCCCCGGCAGAAGACACGCTAGCTGGTATTGTAGCTATGATAAAGTTTGGCAAA
GAAATTGAGGCGTTAGCAAAAACAGGGGCCAAGGCAACAGATATAGCTGCCAAAGAAGAAGCGTTTAAAAAGATGCGGCTGCTGGCAACAATTCAGTCTA
ACCTCGCCGCGCAAGTGTCTGGAAATGTTAGTGAATATGGACGCGGCCTTGCTGTGGTTCGGCATCTGTCAACAATTGATATTGATGCCAAAGAC

Translation

[ - /895]   direct strand


Phylogeny


Annotator commentaries

La première analyse que nous avons effectué consiste à la recherche d'ORF potentiellement inclus dans la séquence. Ce résultat nous donne 4 ORFS potentiels.

Ensuite, on détermine si nos 4 ORFS sont vraisemblablement des vrais ou des faux positifs. Pour cela, on effectue une comparaison par similarité de séquences des traductions putatives des ORFS avec une banque de séquences protéiques (swissprot et nr). Les résultats Blast p obtenus sont très mauvais puisque toutes les E-values sont supérieurs à 1 et ceci pour chacun des ORFS. Aucune séquence homologue n'est donc détectée.

Nous pouvons donc penser que nous avons là une sequence non codante, toutefois l'absence d'homologues dans les banques de séquences n'est pas la preuve certaine que l'ORF est non-codant. Pour vérifier que l'on a raté aucun orf, on effectue un Blast X à partir du lot de départ. La encore, les E-value restent supérieur à 1/3 et les scores très faibles.

La recherche de domaine protéique par interpro reste stérile, elle aussi.

A notre niveau, nous ne pouvons donc que conclure à une séquence non-codante. Mais peut-etre venons nous de découvrir un nouveau gène d'un microorganisme encore inconnu de la mer des Sargasses...

Pour aller plus loin, nous avons refait un blast X blosum 45 (matrice avec un minimun de 45% d'identité entre les séquences homologues), les résultats ne sont pas meilleurs.


Multiple Alignement


BLAST

Blast P - GenBank CDS - ORF number 1 in reading frame 1 on the direct strand 1-219

                                                                   Score     E
Sequences producing significant alignments:                       (Bits)  Value

gi|91770141|ref|ZP_01271970.1|  low temperature requirement A ...  33.9    2.0  
gi|110006568|gb|ABG48731.1|  methyl-coenzyme M reductase alpha...  32.0    9.4  
gi|110006550|gb|ABG48722.1|  methyl-coenzyme M reductase alpha...  31.6    9.8  

& Blast P - swissprot
                                                                   Score     E
Sequences producing significant alignments:                       (Bits)  Value

gi|728875|sp|P23793|ARCA_MYCAR  Arginine deiminase (ADI) (Arginin  28.1    8.2 




Blast P - GenBank CDS - ORF number 1 in reading frame 2 on the direct strand 158-895

                                                                   Score     E
Sequences producing significant alignments:                       (Bits)  Value

gi|58578718|ref|YP_196930.1|  5-aminolevulinate synthase [Ehrl...  35.8    1.9   Gene info
gi|57238793|ref|YP_179929.1|  5-aminolevulinate synthase [Ehrl...  35.8    1.9   Gene info
gi|42782830|ref|NP_980077.1|  glyoxylase family protein [Bacil...  35.4    2.3   Gene info
gi|30263749|ref|NP_846126.1|  glyoxylase family protein [Bacil...  34.3    5.2   Gene info
gi|115617205|ref|XP_001203518.1|  PREDICTED: similar to ENSANG...  33.9    6.2   Gene info
gi|49478939|ref|YP_037813.1|  glyoxylase family protein [Bacil...  33.9    7.3   Gene info
gi|89207787|ref|ZP_01186319.1|  Glyoxalase/bleomycin resistanc...  33.9    7.4  

& Blast p - swissprot
                                                                   Score     E
Sequences producing significant alignments:                       (Bits)  Value

gi|6225627|sp|Q9ZDG3|LNT_RICPR  Apolipoprotein N-acyltransferase   33.1    0.78 
gi|8134799|sp|Q9WYA3|UVRC_THEMA  UvrABC system protein C (Prot...  31.6    2.6  
gi|1346587|sp|P04933|MSP1_PLAFW  Merozoite surface protein 1 p...  30.0    7.6  
gi|1346586|sp|P04932|MSP1_PLAFK  Merozoite surface protein 1 p...  30.0    7.7  
gi|139636|sp|P18709|VITA2_XENLA  Vitellogenin-A2 precursor (VT...  29.6    9.1  
gi|82057242|sp|Q7T6X2|YR826_MIMIV  Putative serine/threonine-p...  29.6    9.2  



Blast P - GenBank CDS & swissprot - ORF number 1 in reading frame 2 on the reverse strand 104-283

No significant similarity found.



Blast P - GenBank CDS - ORF number 2 in reading frame 2 on the reverse strand  359-757

                                                                   Score     E
Sequences producing significant alignments:                       (Bits)  Value

gi|86157113|ref|YP_463898.1|  type II secretion system protein...  33.9    2.1   Gene info
gi|90575363|ref|ZP_01231847.1|  hypothetical protein CdifQ_020...  33.9    2.5  

& Blast P Swissprot
                                                                   Score     E
Sequences producing significant alignments:                       (Bits)  Value

gi|37538292|sp|P31877|HYDE_WOLSU  Protein hydE                     30.0    2.0  
gi|6136519|sp|O83388|TILS_TREPA  tRNA(Ile)-lysidine synthase (...  28.5    6.3  
gi|2492698|sp|Q64762|FIB2_ADEG1  Fiber protein 2                   28.1    6.7  




Blast X - GenBank CDS - blosum 62
                                                                   Score     E
Sequences producing significant alignments:                       (Bits)  Value

gi|117621851|ref|YP_854487.1|  hypothetical protein BAPKO_5516...  39.3    0.25  Gene info
gi|56560998|ref|YP_161414.1|  hypothetical protein BGP129 [Bor...  38.9    0.33 
gi|56560892|ref|YP_161312.1|  hypothetical protein BGP027 [Bor...  38.9    0.33 
gi|5453178|gb|AAD43467.1|AF113608_1  UspA1 [Moraxella catarrhalis  36.6    1.6  
gi|42782830|ref|NP_980077.1|  glyoxylase family protein [Bacil...  35.8    2.8   Gene info
gi|58578718|ref|YP_196930.1|  5-aminolevulinate synthase [Ehrl...  35.8    2.8   Gene info
gi|57238793|ref|YP_179929.1|  5-aminolevulinate synthase [Ehrl...  35.8    2.8   Gene info
gi|74025566|ref|XP_829349.1|  hypothetical protein Tb11.52.000...  35.8    2.8   Gene info
gi|47568764|ref|ZP_00239459.1|  membrane protein, MmpL family,...  35.4    3.6  
gi|88602913|ref|YP_503091.1|  methyl-accepting chemotaxis sens...  35.0    4.8   Gene info
gi|52142573|ref|YP_084257.1|  conserved hypothetical protein; ...  35.0    4.8   Gene info
gi|11498022|ref|NP_069246.1|  signal-transducing histidine kin...  35.0    4.8   Gene info
gi|30263749|ref|NP_846126.1|  glyoxylase family protein [Bacil...  35.0    4.8   Gene info
gi|118028956|ref|ZP_01500417.1|  PpiC-type peptidyl-prolyl cis...  34.7    6.2  
gi|115372852|ref|ZP_01460157.1|  DifA protein [Stigmatella aur...  34.7    6.2  
gi|13111594|gb|AAK12392.1|AF296098_1  polyprotein [Porcine tescho  34.7    6.2  
gi|49478939|ref|YP_037813.1|  glyoxylase family protein [Bacil...  34.7    6.2   Gene info
gi|29165184|emb|CAD67982.1|  variable membrane protein precursor   34.7    6.2  
gi|545660|gb|AAB30051.1|  Hkr1p [Saccharomyces cerevisiae]         34.3    8.1  
gi|6320628|ref|NP_010708.1|  Serine/threonine rich cell surfac...  34.3    8.1   Gene info
gi|15828753|ref|NP_326113.1|  LIPOPROTEIN [Mycoplasma pulmonis...  34.3    8.1   Gene info
gi|117673663|ref|ZP_01496721.1|  hypothetical protein YpseI_02...  34.3    8.1  
gi|67482131|ref|XP_656415.1|  hypothetical protein 13.t00005 [...  34.3    8.1   Gene info

>gi|117621851|ref|YP_854487.1| Gene info hypothetical protein BAPKO_5516 [Borrelia afzelii PKo]
 gi|110891310|gb|ABH02468.1| Gene info hypothetical protein BAPKO_5516 [Borrelia afzelii PKo]
Length=625

 Score = 39.3 bits (90),  Expect = 0.25
 Identities = 56/200 (28%), Positives = 88/200 (44%), Gaps = 24/200 (12%)
 Frame = +2

Query  221  EGFKSLAEATRKAENRAHGMKVPDNDVTKTQSGDLVIKAMPNEDLQLLNESLAKNAGI--  394
            + FKSLA++     ++A GMK     +     G   +K + N+  +  N+ LA  A I  
Sbjct  68   KNFKSLADSVDSVNSKAGGMKSIGKVLKSVGKGLGNVKNLANKTGEAFNQMLAAFAPIII  127

Query  395  -SKGLNL--GRIGDIFE-----LEGFNDIVFNVAD--GDFGLTKVL-ENIKKNNKEVF--  535
              K L      I  IF+     L+ FN+ V   +D  G+  L K L E+++    E    
Sbjct  128  VVKALQAIGSTISGIFDGAMDALDEFNEEVSTFSDMLGNEDLGKSLAESMRAFGDETLFT  187

Query  536  -DYLKRDTKSMDDLMKLANATGYEGIIYKMLGRKAGDVAPAEDTLAGIVAMIKFGKEIE-  709
             D +   TK+M  L   A A+  E  I +M G  AG  +   + LA + + ++   ++  
Sbjct  188  RDAITNATKTM--LSYGATASEVEERI-RMFGEAAGGSSEGLEKLAEVYSRVESSNQVNL  244

Query  710  ----ALAKTGAKATDIAAKE  757
                AL   G   TDI A+E
Sbjct  245  EDLYALRDAGVDITDILAEE  264


>gi|56560998|ref|YP_161414.1|  hypothetical protein BGP129 [Borrelia garinii PBi]
 gi|52696638|gb|AAU85979.1|  hypothetical protein BGP129 [Borrelia garinii PBi]
Length=662

 Score = 38.9 bits (89),  Expect = 0.33
 Identities = 55/200 (27%), Positives = 87/200 (43%), Gaps = 24/200 (12%)
 Frame = +2

Query  221  EGFKSLAEATRKAENRAHGMKVPDNDVTKTQSGDLVIKAMPNEDLQLLNESLAKNAGISK  400
            + FKSLA++     ++A GMK     +     G   +K + N+  +  N+ LA  A I  
Sbjct  68   KNFKSLADSVDSVNSKAGGMKSIGKVLKSVGKGLGNVKNLANKTGEAFNQMLAAFAPIII  127

Query  401  GLNL-----GRIGDIFE-----LEGFNDIVFNVAD--GDFGLTKVLENIKK--NNKEVF-  535
             +         I  IF+     L+ FN+ V   +D  G+  L K L    +   +K +F 
Sbjct  128  AVKALQAIGSTISGIFDGAMDALDEFNEEVSTFSDMLGNEDLGKSLAESMRAFGDKTLFT  187

Query  536  -DYLKRDTKSMDDLMKLANATGYEGIIYKMLGRKAGDVAPAEDTLAGIVAMIKFG-----  697
             D +   TK+M  L   A A+  E  I +M G  AG  +   + LA   + ++       
Sbjct  188  RDAITNATKTM--LSYGATASEVEERI-RMFGEAAGGSSEGLEKLAEAYSRVESSNQVNL  244

Query  698  KEIEALAKTGAKATDIAAKE  757
            K++ AL   G   TDI A+E
Sbjct  245  KDLYALRDAGVDITDILAEE  264


>gi|56560892|ref|YP_161312.1|  hypothetical protein BGP027 [Borrelia garinii PBi]
 gi|52696533|gb|AAU85877.1|  hypothetical protein BGP027 [Borrelia garinii PBi]
Length=1081

 Score = 38.9 bits (89),  Expect = 0.33
 Identities = 54/200 (27%), Positives = 86/200 (43%), Gaps = 24/200 (12%)
 Frame = +2

Query  221  EGFKSLAEATRKAENRAHGMKVPDNDVTKTQSGDLVIKAMPNEDLQLLNESLAKNAGISK  400
            + FKSLA +     ++A GMK     +     G   +K + N+  +  N+ LA  A I  
Sbjct  68   KNFKSLANSVDSVNSKASGMKSIGKVLKSVGKGLGSVKNLANKTGEAFNQMLAAFAPIII  127

Query  401  GLNL-----GRIGDIFE-----LEGFNDIVFNVAD--GDFGLTKVL-ENIKKNNKEVF--  535
             +         I  IF+     L+ FN+ V   +D  G+  L K L E+++    E    
Sbjct  128  AVKALQAIGSTISGIFDGAMDALDEFNEEVSTFSDMLGNEELGKSLAESMRAFGDETLFT  187

Query  536  -DYLKRDTKSMDDLMKLANATGYEGIIYKMLGRKAGDVAPAEDTLAGIVAMIKFGKEIE-  709
             D +   TK+M  L   A A+  E  I +M G  AG  +   + LA + + ++   ++  
Sbjct  188  RDAIANATKTM--LSYGATASEVEERI-RMFGEAAGGSSEGLEKLAEVYSRVESSNQVNL  244

Query  710  ----ALAKTGAKATDIAAKE  757
                AL   G   TDI A+E
Sbjct  245  EDLYALRDAGVDITDILAEE  264

>gi|5453178|gb|AAD43467.1|AF113608_1  UspA1 [Moraxella catarrhalis]
Length=941

 Score = 36.6 bits (83),  Expect = 1.6
 Identities = 47/168 (27%), Positives = 75/168 (44%), Gaps = 37/168 (22%)
 Frame = +2

Query  335  AMPNEDLQLLNESLAKNAGISKGLNLGRIGDIFELE--------GFNDIVFNVADGDFGL  490
            A+    L  L +++AKN    KGLN G    + EL+          N +  +VAD    +
Sbjct  318  AVNGSQLHALAKAVAKNKSDIKGLNKG----VKELDKEVGVLSRDINSLHDDVADNQDSI  373

Query  491  TKVLENIKKNNKEVFDYLKRDTKSMDDLMKLANATGYEGIIYKMLGRKAGDVAPAEDTLA  670
             K   +IK  NKEV        K +D  +         G++ + +G    DVA  +D++A
Sbjct  374  AKNKADIKGLNKEV--------KELDKEV---------GVLSRDIGSLHDDVADNQDSIA  416

Query  671  GIVAMIK-FGKEIEALAK-TGAKATDIAAKEEAFKKMRLLATIQSNLA  808
               A IK   KE++ L K  G  + DI +  +       +AT Q+++A
Sbjct  417  KNKADIKGLNKEVKELDKEVGVLSRDIGSLHDD------VATNQADIA  458


>gi|42782830|ref|NP_980077.1| Gene info glyoxylase family protein [Bacillus cereus ATCC 10987]
 gi|42738757|gb|AAS42685.1| Gene info glyoxylase family protein [Bacillus cereus ATCC 10987]
Length=129

 Score = 35.8 bits (81),  Expect = 2.8
 Identities = 24/78 (30%), Positives = 45/78 (57%), Gaps = 9/78 (11%)
 Frame = +2

Query  398  KGLNLGRIGDIFELEGFNDIVFNVADGDFGL--TKVLENI-----KKNNKEVFDYLKRDT  556
            KGL L RIG+ ++ EG++ ++F + D ++ L  T+ ++        K+N  VF Y+  D+
Sbjct  25   KGLGLKRIGEFYDHEGYDGVMFGLPDEEYHLEFTRHIDGSPCPAPTKDNLLVF-YMHEDS  83

Query  557  KSMDDLMKLANATGYEGI  610
            + M  + K  +A GY+ +
Sbjct  84   E-MKKVSKRLHALGYDEV  100


Blast X - GenBank CDS - Blosum 45
                                                                   Score     E
Sequences producing significant alignments:                       (Bits)  Value

gi|58578718|ref|YP_196930.1|  5-aminolevulinate synthase [Ehrl...  36.5    1.3   
gi|57238793|ref|YP_179929.1|  5-aminolevulinate synthase [Ehrl...  36.5    1.3   
gi|82914955|ref|XP_728910.1|  235 kDa rhoptry protein [Plasmod...  35.9    1.9   
gi|118049848|ref|ZP_01518399.1|  response regulator receiver p...  35.7    2.4  
gi|152097|gb|AAA26216.1|  5-aminolevulinic acid synthase (ALAS)    35.7    2.4  
gi|82736622|ref|ZP_00899479.1|  ATPase, E1-E2 type:Copper-tran...  35.4    2.9  
gi|114592992|ref|XP_517070.2|  PREDICTED: hypothetical protein [P  35.1    3.5   
gi|77954169|ref|ZP_00818568.1|  peptidyl-prolyl cis-trans isom...  35.1    3.5  
gi|56560892|ref|YP_161312.1|  hypothetical protein BGP027 [Bor...  34.8    4.3  
gi|117621851|ref|YP_854487.1|  hypothetical protein BAPKO_5516...  34.5    5.3   
gi|595867|gb|AAA59392.1|  colicin protein                          34.5    5.3  
gi|25991459|gb|AAN76845.1|AF453415_1  colicin Ia [Escherichia fer  34.5    5.3  
gi|595885|gb|AAA59404.1|  colicin protein                          34.5    5.3  
gi|595873|gb|AAA59396.1|  colicin protein                          34.5    5.3  
gi|595870|gb|AAA59394.1|  colicin protein                          34.5    5.3  
gi|595876|gb|AAA59398.1|  colicin protein                          34.5    5.3  
gi|595882|gb|AAA59402.1|  colicin protein                          34.5    5.3  
gi|78484706|ref|YP_390631.1|  methyl-accepting chemotaxis sens...  34.5    5.3   
gi|2914580|pdb|1CII|   Chain  , Colicin Ia                         34.5    5.3   
gi|37999919|sp|P06716|CEIA_ECOLI  Colicin-Ia >gi|6960321|gb|AA...  34.5    5.3  
gi|88932725|ref|ZP_01138407.1|  hypothetical protein DehaBAV1D...  34.5    5.3  
gi|68171327|ref|ZP_00544725.1|  5-aminolevulinic acid synthase...  34.5    5.3  
gi|115251580|emb|CAJ69413.1|  glycogen branching enzyme [Clostrid  34.2    6.5  
gi|90573911|ref|ZP_01230419.1|  hypothetical protein CdifQ_020...  34.2    6.5  
gi|56560998|ref|YP_161414.1|  hypothetical protein BGP129 [Bor...  34.2    6.5  
gi|31044147|gb|AAP42859.1|  NanA5 [Streptomyces nanchangensis]     34.2    6.5  
gi|51464412|ref|XP_379250.2|  PREDICTED: hypothetical protein [Ho  34.2    6.5   
gi|15607099|ref|NP_214481.1|  hypothetical protein aq_2159 [Aq...  34.2    6.5   
gi|90426295|ref|YP_534665.1|  5-aminolevulinic acid synthase [...  34.2    6.5   
gi|115526775|ref|YP_783686.1|  5-aminolevulinic acid synthase ...  34.2    6.5   
gi|117673663|ref|ZP_01496721.1|  hypothetical protein YpseI_02...  34.2    6.5  
gi|73666689|ref|YP_302705.1|  5-aminolevulinate synthase [Ehrl...  33.9    8.0   
gi|42519606|ref|NP_965536.1|  cation-transporting ATPase [Lact...  33.9    8.0   
gi|27376311|ref|NP_767840.1|  5-aminolevulinate synthase [Brad...  33.9    8.0   
gi|78696750|ref|ZP_00861259.1|  5-aminolevulinic acid synthase...  33.9    8.0  
gi|113460933|ref|YP_719000.1|  possible large adhesin [Haemoph...  33.6    9.8   
gi|107025409|ref|YP_622920.1|  transcriptional regulator, AsnC...  33.6    9.8   
gi|11496688|ref|NP_045470.1|  hypothetical protein BBG10 [Borr...  33.6    9.8   
gi|89902660|ref|YP_525131.1|  CRISPR-associated helicase Cas3 ...  33.6    9.8   
gi|77404658|ref|YP_345232.1|  Transketolase [Rhodobacter sphae...  33.6    9.8   
gi|83949644|ref|ZP_00958377.1|  ribonuclease, Rne/Rng family p...  33.6    9.8  
gi|47568764|ref|ZP_00239459.1|  membrane protein, MmpL family,...  33.6    9.8  

ORF finding

Brin sens - any codon - code génétique standard


>ORF number 1 in reading frame 1 on the direct strand extends from base 1 to base 219.
CGGCGAAAAACCGGGCTGGGGTCAGCTCCAAAAACAATTGTCGAGCTGATCGATGAGCGG
GTTGACGTTACGGCTCAAGCCTTAAATCAGCCAAAAGATGAGGTCTTCAAAAAGTTCATG
CAGGGATCAATACCGTTGCTATCGCTTGGCGGCGTAACTTTGCTTGACACTGGGGCTGCA
CAAACAAATATGGAGGCGGCTGATGGTCAAGATACTTAA

>Translation of ORF number 1 in reading frame 1 on the direct strand.
RRKTGLGSAPKTIVELIDERVDVTAQALNQPKDEVFKKFMQGSIPLLSLGGVTLLDTGAA
QTNMEAADGQDT*

>ORF number 1 in reading frame 2 on the direct strand extends from base 158 to base 895.
CTTTGCTTGACACTGGGGCTGCACAAACAAATATGGAGGCGGCTGATGGTCAAGATACTT
AAAGAAGGGTTTAAATCATTAGCAGAAGCCACTCGCAAAGCCGAGAACAGAGCGCACGGC
ATGAAAGTGCCGGATAATGACGTTACCAAAACGCAAAGCGGTGACCTTGTTATCAAGGCG
ATGCCAAACGAAGACCTCCAGCTTTTAAATGAATCACTAGCCAAAAACGCTGGCATCAGC
AAAGGGTTAAACCTGGGCCGTATCGGTGACATATTTGAGCTAGAGGGCTTCAACGACATT
GTCTTTAATGTTGCTGATGGTGATTTTGGTCTAACTAAAGTTTTAGAAAATATAAAAAAG
AATAACAAAGAGGTTTTCGATTATCTCAAGCGTGACACCAAGTCGATGGACGATTTGATG
AAACTTGCCAACGCTACTGGGTATGAGGGGATTATCTATAAAATGCTTGGCCGCAAAGCC
GGTGATGTAGCCCCGGCAGAAGACACGCTAGCTGGTATTGTAGCTATGATAAAGTTTGGC
AAAGAAATTGAGGCGTTAGCAAAAACAGGGGCCAAGGCAACAGATATAGCTGCCAAAGAA
GAAGCGTTTAAAAAGATGCGGCTGCTGGCAACAATTCAGTCTAACCTCGCCGCGCAAGTG
TCTGGAAATGTTAGTGAATATGGACGCGGCCTTGCTGTGGTTCGGCATCTGTCAACAATT
GATATTGATGCCAAAGAC

>Translation of ORF number 1 in reading frame 2 on the direct strand.
LCLTLGLHKQIWRRLMVKILKEGFKSLAEATRKAENRAHGMKVPDNDVTKTQSGDLVIKA
MPNEDLQLLNESLAKNAGISKGLNLGRIGDIFELEGFNDIVFNVADGDFGLTKVLENIKK
NNKEVFDYLKRDTKSMDDLMKLANATGYEGIIYKMLGRKAGDVAPAEDTLAGIVAMIKFG
KEIEALAKTGAKATDIAAKEEAFKKMRLLATIQSNLAAQVSGNVSEYGRGLAVVRHLSTI
DIDAKD

No ORFs were found in reading frame 3.



Brin antisens - any codon - code génétique standard

No ORFs were found in reading frame 1.

>ORF number 1 in reading frame 2 on the reverse strand extends from base 104 to base 283.
ATTGTTGCCAGCAGCCGCATCTTTTTAAACGCTTCTTCTTTGGCAGCTATATCTGTTGCC
TTGGCCCCTGTTTTTGCTAACGCCTCAATTTCTTTGCCAAACTTTATCATAGCTACAATA
CCAGCTAGCGTGTCTTCTGCCGGGGCTACATCACCGGCTTTGCGGCCAAGCATTTTATAG


>Translation of ORF number 1 in reading frame 2 on the reverse strand.
IVASSRIFLNASSLAAISVALAPVFANASISLPNFIIATIPASVSSAGATSPALRPSIL*


>ORF number 2 in reading frame 2 on the reverse strand extends from base 359 to base 757.
TCGAAAACCTCTTTGTTATTCTTTTTTATATTTTCTAAAACTTTAGTTAGACCAAAATCA
CCATCAGCAACATTAAAGACAATGTCGTTGAAGCCCTCTAGCTCAAATATGTCACCGATA
CGGCCCAGGTTTAACCCTTTGCTGATGCCAGCGTTTTTGGCTAGTGATTCATTTAAAAGC
TGGAGGTCTTCGTTTGGCATCGCCTTGATAACAAGGTCACCGCTTTGCGTTTTGGTAACG
TCATTATCCGGCACTTTCATGCCGTGCGCTCTGTTCTCGGCTTTGCGAGTGGCTTCTGCT
AATGATTTAAACCCTTCTTTAAGTATCTTGACCATCAGCCGCCTCCATATTTGTTTGTGC
AGCCCCAGTGTCAAGCAAAGTTACGCCGCCAAGCGATAG

>Translation of ORF number 2 in reading frame 2 on the reverse strand.
SKTSLLFFFIFSKTLVRPKSPSATLKTMSLKPSSSNMSPIRPRFNPLLMPAFLASDSFKS
WRSSFGIALITRSPLCVLVTSLSGTFMPCALFSALRVASANDLNPSLSILTISRLHICLC
SPSVKQSYAAKR*

No ORFs were found in reading frame 3.