ORF GI23210

From Metagenes
Warning: this metagenomic sequence has been carefully annotated by students during bioinformatics assignments. These quality annotations are therefore the result of a teaching exercise that you are most welcome to amend and extend if necessary!


Sequence
CAMERA AccNum : AACY01160370.1
Annotathon code: ORF_GI23210
Sample :
  • GPS :31°10'30n; 64°19'27.6w
  • Sargasso Sea: Sargasso Sea, Station 11 - Bermuda (UK)
  • Open Ocean (-5m, 20.5°C, 0.1-0.8 microns)
Authors
Team : BioCell 2007
Username : gscj
Annotated on : 2008-03-19 18:52:37
  • JACQU0T CAROLINE
  • SAWKA GREGOIRE

Synopsis

Genomic Sequence

>AACY01160370.1 ORF_GI23210 genomic DNA
CTCAAAGCCACAGTGGAGATAGAAATCGACAGCGAGATCGACGCTGCCATCGCGAGTGGCGCTTCTGCTTTGGCGAGCACACAGGAGAAAATGGAAGGCA
TGCAGGCTGCGATATCGGAGGTGTTTGTGCGCCGGCTCATCACGGCCAGGAGGAATAGGGTGAAGCATCTTGCTGTGGCCCTGCTTCGAGTGTGGCAGGA
GGGGGTGCGGCGTCTCCATGACGACTTAGATCGCATGCATCGCCATACGGAGGGCATGCTTCTGGCTGAGCGCAAGGAAGCTGAGCGTGCTAAACAGCTC
CAGGCAGCGGCGCATGCGCAGGTGGATGCGGTGATCACCCCAATGTCGGTCGCGATCGATGCGGCACGGAATAGAGTGGAGCTGCTGGAGTTGTCCGTGG
CCGAGGCTTTCCTGCGTCGCTCGGTAGGAGCTTTGAGGATGGTCGCAAGGCAGCAGCTGCGGGATGCAGTGATGAGCTGGACACGGAATGCGTTCATCAA
CCACACAGGGAAACTGCAGGGTGCTCTGGACGATGTCCAACAGAGACTGTTCCGAACCGAAGCTGAGCTGGTACAAGCAGTTCGCGACTTGTCGGAGAAC
CAAGCGGCGCTGGATGCCATGGAATACATGGCAGCCAGGCTGATGGCTCGAGCGGAAGCGCGGGCAGGGGAAGCGCCGCCCATGTAACAGTAAGGGCACT
GCCACAGGCCCCGGGGGCACCCCGAGGGCAGGCCGCCTGAGGCGCCCCTGGCCTTTGGAGCCAGCCCGGGCAGTACCGCGGTGCCTGCGGCGGCGCTTGG
GCTGGATTAGGCACCATAGCACCCGAGGTTCGGCTTGCCCGCGGGTCGCCCGGGCCACATGTGGGGCCGGGTGGGGGCACACTGGCAGGGCCACTACCCA
CGTACTAACACGTACCCACCACGCTACCCGCGAA

Translation

[129 - 932/934]   indirect strand
>ORF_GI23210 Translation [129-932   indirect strand]
SSPSAAAGTAVLPGLAPKARGASGGLPSGCPRGLWQCPYCYMGGASPARASARAISLAAMYSMASSAAWFSDKSRTACTSSASVRNSLCWTSSRAPCSFP
VWLMNAFRVQLITASRSCCLATILKAPTERRRKASATDNSSSSTLFRAASIATDIGVITASTCACAAAWSCLARSASLRSARSMPSVWRCMRSKSSWRRR
TPSCHTRSRATARCFTLFLLAVMSRRTNTSDIAACMPSIFSCVLAKAEAPLAMAASISLSISISTVAL

[ Warning ] 5' incomplete: does not start with a Methionine
[ Warning ] 3' incomplete: following codon is not a STOP

Phylogeny


Annotator commentaries

La recherche d'ORF de notre séquence nous a donné de nombreux ORF possibles (dont quatre possédant plus de 300 nucléotides) que nous pourrions étudier. Mais tous les blastp réalisés contre les banques swissprot, nr, environnementale, n'ont donné que des E-value supérieures à 0.5. Nous avons donc choisi arbitrairement l'ORF le plus long (celui sur le brin reverse, allant des nucléotides 129 à 932).

Nous avons tout d'abord fait des blastp des différents ORF ayant une taille supérieur à 300 nucléotides contre les banques swissprot, nr, environnementale ; la meilleur E-value trouvée était d'environ 0.5. Nous avons ensuite fait un blastx de notre séquence contre la banque environnmentale. La meilleure E-value était alors de 0.18. Pour finir, nous avons fait un tblastx contre la banque environnementale, nous avons trouvé notre séquence (E-value 0), la meilleur E-value des séquences "homologues" étant 0.003. Au vue du tblastx, les ORFs donnant le plus d'homologue semble être sur le brin reverse. Les deux ORF du carde 2 de lecture sur le brin reverse sont l'une à la suite de l'autre, la première finissant à 709 et l'autre commençant à 710. Il est donc possible que ces deux ORFs correspondent à une seule séquence codante, le codon "stop" pouvant être sur un intron. De plus on constate qu'il y a d'autres ORF dans les autres cadres de lecture, il est donc possible que ces gènes se "chevauchent". Ne trouvant aucun homologue significatif, nous ne pouvons faire que ces suppositions sur la position et le nombre des séquences codantes. Vues les E-values trouvées, nous ne pouvons pas parler d'homolgue et donc nous ne pouvons pas faire d'alignement multiple.

Puisque nous n'avons pas d'homologues significatifs, nous ne pouvons pas conclure sur la fonction des protéines pouvant être codées par notre séquence. De plus, la recherche de domaines conservés par interpro ne donne aucun résultat. Sans la fonction, nous ne pouvons savoir dans quel processus ces protéines pourraient être impliquées. De même, puisque nous ne pouvons pas faire d'alignement multiple, nous ne pouvons avoir aucune information sur la taxonimie de cette séquence.

Enfin, il semble important de se demander pourquoi cette séquence n'a pas d'homologue connu. Au vue de la longeur des ORF, il semble peu probable qu'elle soit non-codante. Nous pouvons donc supposer qu'elle provient d'un organisme pour lequel aucune étude n'a été réalisée, ou qu'elle provient d'un organisme dont le génome n'est pas complètement séquencé.

Multiple Alignement


BLAST

NCBI/blastp de l'ORF le plus long contre la banque nr

No significant similarity found

---------------------------------------------------------------------------------------------------
NCBI/blastp de l'ORF le plus long contre la banque environnementale
                                                                   Score     E
Sequences producing significant alignments:                       (Bits)  Value

gb|EDJ35159.1|  hypothetical protein GOS_1710817 [marine metageno  32.1    9.7  


>gb|EDJ35159.1|  hypothetical protein GOS_1710817 [marine metagenome]
Length=447

 Score = 32.1 bits (78),  Expect = 9.7, Method: Composition-based stats.
 Identities = 27/80 (33%), Positives = 37/80 (46%), Gaps = 10/80 (12%)

Query  23  SGGLPSGCPRGLWQCPYCYMGGASPARASARAISLAAMYSMASSAAWFSDKSRTACTSSA  82
           + GLP+G    L  CPY     ASP R S+R +      S ++S +  +  SRT   +SA
Sbjct  16  TSGLPNGSMSSLILCPYVE---ASPIRRSSRPLP-----SCSASPSRTASGSRTPKRASA  67

Query  83  SVRNSLC--WTSSRAPCSFP  100
                 C  W  +  PCS P
Sbjct  68  PFAERTCRRWALATRPCSRP  87
---------------------------------------------------------------------------------------------------

NCBI/tblastx contre la banque environnementale

                                                                  Score     E
Sequences producing significant alignments:                       (Bits)  Value  N

gb|AACY024086371.1|  Marine metagenome 1096626585910, whole genom   576    0.0    1
gb|AACY020295314.1|  Marine metagenome 1096626371890, whole genom  35.9    0.003  1
gb|ABEF01052632.1|  Marine metagenome HOTS_Contig52632, whole ...  36.3    0.014  1
gb|AACY023531942.1|  Marine metagenome ctg_1101668339293, whol...  29.5    0.018  1
gb|AACY022922012.1|  Marine metagenome ctg_1101667329363, whol...  26.3    0.031  1
gb|AACY021186073.1|  Marine metagenome 93604, whole genome shotgu  31.8    0.039  1
gb|AACY022981631.1|  Marine metagenome ctg_1101667388982, whol...  30.4    0.052  1
gb|AACY023457709.1|  Marine metagenome ctg_1101668265060, whol...  30.4    0.063  1
gb|AACY021675922.1|  Marine metagenome 1091139191823, whole genom  28.1    0.098  1
gb|AACY020309644.1|  Marine metagenome 1096626387470, whole genom  32.7    0.10   1
gb|AACY023825481.1|  Marine metagenome ctg_1101668632832, whol...  32.2    0.19   1
gb|AACY023184397.1|  Marine metagenome ctg_1101667591748, whol...  32.2    0.19   1
gb|AACY023882931.1|  Marine metagenome ctg_1101668690282, whol...  41.4    0.24   1
gb|AACY020039396.1|  Marine metagenome 1096626035322, whole genom  40.9    0.34   1
gb|AACY022763425.1|  Marine metagenome ctg_1101667170776, whol...  26.3    0.36   1
gb|AACY020773605.1|  Marine metagenome 1095525024050, whole genom  26.7    0.36   1
gb|AACY023793964.1|  Marine metagenome ctg_1101668601315, whol...  32.2    0.39   1
gb|AACY023784501.1|  Marine metagenome ctg_1101668591852, whol...  33.1    0.43   1
gb|AAQK01003041.1|  Metagenome sequence s7_164594, whole genome s  40.5    0.46   1
gb|AACY020307128.1|  Marine metagenome 1096626384760, whole genom  34.1    0.48   1
gb|AACY020680746.1|  Marine metagenome 1093023012299, whole genom  26.7    0.51   1
gb|AACY021381116.1|  Marine metagenome 2131679, whole genome shot  25.8    0.56   1
gb|AACY021337408.1|  Marine metagenome 1092963122981, whole genom  40.0    0.63   1
gb|AACY023841669.1|  Marine metagenome ctg_1101668649020, whol...  33.6    0.64   1
gb|AACY020531545.1|  Marine metagenome 1096626703386, whole genom  28.1    0.64   1
gb|AACY020206491.1|  Marine metagenome 1096626239574, whole genom  28.6    0.66   1
gb|AACY023793461.1|  Marine metagenome ctg_1101668600812, whol...  32.2    0.74   1
gb|AACY022440642.1|  Marine metagenome 1944563, whole genome shot  24.9    0.77   1
gb|AACY023819479.1|  Marine metagenome ctg_1101668626830, whol...  30.9    0.84   1
gb|AACY020827003.1|  Marine metagenome 1092351031247, whole genom  39.6    0.87   1
gb|AACY023825906.1|  Marine metagenome ctg_1101668633257, whol...  39.6    0.87   1
gb|AACY024061079.1|  Marine metagenome 1096626463360, whole genom  39.6    0.87   1
gb|AAFX01012076.1|  Metagenome sequence XZS60012.b1, whole genome  39.6    0.87   1
gb|AACY023822197.1|  Marine metagenome ctg_1101668629548, whol...  32.2    0.88   1
gb|AAFX01031915.1|  Metagenome sequence XZS41959.g1, whole genome  29.0    0.88   1
gb|AACY020305912.1|  Marine metagenome 1096626827103, whole genom  28.1    0.94   1
gb|AACY020532923.1|  Marine metagenome 1096626838888, whole genom  29.0    1.0    1
gb|AACY024047586.1|  Marine metagenome 1096626385821, whole genom  24.4    1.0    1
gb|AACY023357305.1|  Marine metagenome ctg_1101668164656, whol...  38.6    1.1    1
gb|AACY023841386.1|  Marine metagenome ctg_1101668648737, whol...  32.7    1.1    1
gb|AAFX01106373.1|  Metagenome sequence XZS9827.x1, whole genome   35.9    1.2    1
gb|AACY020375099.1|  Marine metagenome 1096626471710, whole genom  29.5    1.2    1
gb|AACY020908331.1|  Marine metagenome 1092959543256, whole genom  31.8    1.2    1
gb|AACY024110437.1|  Marine metagenome 1096626708522, whole genom  26.3    1.2    1
gb|AAFY01001145.1|  Metagenome sequence 3634298_fasta.screen.C...  24.9    1.2    1
gb|AACY020190057.1|  Marine metagenome 1096626221497, whole genom  39.1    1.2    1
gb|AACY022392988.1|  Marine metagenome 1093018929310, whole genom  39.1    1.2    1
gb|AACY022633373.1|  Marine metagenome ctg_1101667040724, whol...  39.1    1.2    1
gb|AAFX01073512.1|  Metagenome sequence XZS45883.g2, whole genome  27.6    1.2    1
gb|AACY020522256.1|  Marine metagenome 1096626797373, whole genom  30.9    1.2    1
gb|AACY020561146.1|  Marine metagenome 1096626843446, whole genom  36.8    1.3    1
gb|AATN01000285.1|  Metagenome sequence ctg11312, whole genome sh  29.5    1.4    1
gb|AACY020533759.1|  Marine metagenome 1096626839062, whole genom  35.9    1.6    1
gb|AACY023821447.1|  Marine metagenome ctg_1101668628798, whol...  31.3    1.6    1
gb|AACY020293731.1|  Marine metagenome 1096626370028, whole genom  30.4    1.6    1
gb|AACY020952346.1|  Marine metagenome 2050247, whole genome shot  29.0    1.6    1
gb|AACY023827795.1|  Marine metagenome ctg_1101668635146, whol...  38.6    1.6    1
gb|ABEF01049337.1|  Marine metagenome HOTS_Contig49337, whole ...  38.6    1.6    1
gb|AACY020561147.1|  Marine metagenome 1096626843447, whole genom  38.6    1.6    1
gb|AACY020558576.1|  Marine metagenome 1096626802135, whole genom  25.8    1.6    1
gb|AACY021894716.1|  Marine metagenome 1093018252535, whole genom  35.0    1.7    1
gb|AACY020165979.1|  Marine metagenome 1096626189770, whole genom  28.6    1.7    1
gb|AACY021155916.1|  Marine metagenome 1092351087023, whole genom  35.9    1.8    1
gb|AACY020540083.1|  Marine metagenome 1096626799569, whole genom  34.5    1.8    1
gb|AACY023184364.1|  Marine metagenome ctg_1101667591715, whol...  29.9    1.8    1
gb|AACY023843661.1|  Marine metagenome ctg_1101668651012, whol...  27.6    1.8    1
gb|AACY020365125.1|  Marine metagenome 1096626455721, whole genom  33.1    1.9    1
gb|AACY023824327.1|  Marine metagenome ctg_1101668631678, whol...  32.7    1.9    1
gb|AACY020296735.1|  Marine metagenome 1096626826190, whole genom  29.0    1.9    1
gb|AACY022908856.1|  Marine metagenome ctg_1101667316207, whol...  24.4    2.0    1
gb|AACY021595285.1|  Marine metagenome 1091138174446, whole genom  28.6    2.0    1
gb|AAFX01058439.1|  Metagenome sequence XZS60603.b1, whole genome  31.3    2.1    1
gb|AACY023827752.1|  Marine metagenome ctg_1101668635103, whol...  28.1    2.1    1
gb|AACY020941484.1|  Marine metagenome 3048564, whole genome shot  35.0    2.1    1
gb|AASZ01004906.1|  Metagenome sequence GutlessWorm_Cont4906, ...  28.6    2.2    1
gb|AACY020922979.1|  Marine metagenome 2045553, whole genome shot  32.2    2.2    1
gb|AACY022467161.1|  Marine metagenome 1953017, whole genome shot  28.6    2.2    1
gb|AACY022239400.1|  Marine metagenome 879230, whole genome shotg  28.1    2.2    1
gb|AACY022441969.1|  Marine metagenome 1092405944945, whole genom  29.0    2.2    1
gb|ABEF01031814.1|  Marine metagenome HOTS_Contig31814, whole ...  38.2    2.2    1
emb|CAAN02150206.1|  Fossil metagenome sequence DSASCWG02FL1LD...  38.2    2.2    1
gb|AACY020173760.1|  Marine metagenome 1096626816631, whole genom  38.2    2.3    1
gb|AACY020367955.1|  Marine metagenome 1096626460158, whole genom  38.2    2.3    1
gb|AACY022446479.1|  Marine metagenome 1446362, whole genome shot  38.2    2.3    1
gb|AACY023842309.1|  Marine metagenome ctg_1101668649660, whol...  38.2    2.3    1
gb|AACY022021326.1|  Marine metagenome 291658, whole genome shotg  30.9    2.3    1
gb|AAFX01114216.1|  Metagenome sequence 2662324_fasta.screen.C...  25.4    2.4    1
gb|AACY021501469.1|  Marine metagenome 1095522155525, whole genom  26.3    2.5    1
gb|AACY020312781.1|  Marine metagenome 1096626390895, whole genom  29.9    2.5    1
gb|AAFX01047177.1|  Metagenome sequence XZS47453.g1, whole genome  30.9    2.6    1
gb|AACY023328005.1|  Marine metagenome ctg_1101668135356, whol...  26.3    2.6    1
gb|AACY020252531.1|  Marine metagenome 1096626289862, whole genom  29.5    2.9    1
gb|AACY023802089.1|  Marine metagenome ctg_1101668609440, whol...  32.2    2.9    1
gb|AACY023443825.1|  Marine metagenome ctg_1101668251176, whol...  31.8    2.9    1
gb|AACY023831104.1|  Marine metagenome ctg_1101668638455, whol...  28.6    2.9    1
gb|AAFX01075041.1|  Metagenome sequence XZS50632.y1, whole genome  31.3    3.0    1
gb|AACY024004432.1|  Marine metagenome 1096626188131, whole genom  30.4    3.0    1
gb|AACY022912078.1|  Marine metagenome ctg_1101667319429, whol...  30.9    3.0    1
gb|AASZ01001454.1|  Metagenome sequence GutlessWorm_Cont1454, ...  37.3    3.0    1
gb|AACY020804650.1|  Marine metagenome 1095295028369, whole genom  31.8    3.0    1



10 premières séquences : 


>gb|AACY024086371.1|  Marine metagenome 1096626585910, whole genome shotgun sequence
Length=935

 Score =  576 bits (1252),  Expect = 0.0
 Identities = 228/228 (100%), Positives = 228/228 (100%), Gaps = 0/228 (0%)
 Frame = -2/-3

Query  684  HGRRFPCPRFRSSHQPGCHVFHGIQRRLVLRQVANCLYQLSFGSEQSLLDIVQSTLQFPC  505
            HGRRFPCPRFRSSHQPGCHVFHGIQRRLVLRQVANCLYQLSFGSEQSLLDIVQSTLQFPC
Sbjct  684  HGRRFPCPRFRSSHQPGCHVFHGIQRRLVLRQVANCLYQLSFGSEQSLLDIVQSTLQFPC  505

Query  504  VVDERIPCPAHHCIPQLLPCDHPQSSYRATQESLGHGQLQQLHSIPCRIDRDRHWGDHRI  325
            VVDERIPCPAHHCIPQLLPCDHPQSSYRATQESLGHGQLQQLHSIPCRIDRDRHWGDHRI
Sbjct  504  VVDERIPCPAHHCIPQLLPCDHPQSSYRATQESLGHGQLQQLHSIPCRIDRDRHWGDHRI  325

Query  324  HLRMRRCLELFSTLSFLALSQKHALRMAMHAI*VVMETPHPLLPHSKQGHSKMLHPIPPG  145
            HLRMRRCLELFSTLSFLALSQKHALRMAMHAI*VVMETPHPLLPHSKQGHSKMLHPIPPG
Sbjct  324  HLRMRRCLELFSTLSFLALSQKHALRMAMHAI*VVMETPHPLLPHSKQGHSKMLHPIPPG  145

Query  144  RDEPAHKHLRYRSLHAFHFLLCARQSRSATRDGSVDLAVDFYLHCGFE  1
            RDEPAHKHLRYRSLHAFHFLLCARQSRSATRDGSVDLAVDFYLHCGFE
Sbjct  144  RDEPAHKHLRYRSLHAFHFLLCARQSRSATRDGSVDLAVDFYLHCGFE  1


 Score =  177 bits (381),  Expect = 0.0
 Identities = 73/73 (100%), Positives = 73/73 (100%), Gaps = 0/73 (0%)
 Frame = -3/-1

Query  893  WPCQCAPTRPHMWPGRPAGKPNLGCYGa*sspsaaAGTAVLPGLAPKARGASGGLPSGCP  714
            WPCQCAPTRPHMWPGRPAGKPNLGCYGA*SSPSAAAGTAVLPGLAPKARGASGGLPSGCP
Sbjct  893  WPCQCAPTRPHMWPGRPAGKPNLGCYGA*SSPSAAAGTAVLPGLAPKARGASGGLPSGCP  714

Query  713  RGLWQCPYCYMGG  675
            RGLWQCPYCYMGG
Sbjct  713  RGLWQCPYCYMGG  675


 Score = 27.6 bits (54),  Expect = 0.0
 Identities = 11/11 (100%), Positives = 11/11 (100%), Gaps = 0/11 (0%)
 Frame = -1/-2

Query  931  AGSVVGTC*YV  899
            AGSVVGTC*YV
Sbjct  931  AGSVVGTC*YV  899


 Score =  316 bits (684),  Expect = 0.0
 Identities = 119/119 (100%), Positives = 119/119 (100%), Gaps = 0/119 (0%)
 Frame = +2/+2

Query  338  PQCRSRSMRHGIEWSCWSCPWPRLSCVAR*EL*GWSQGSSCGMQ**AGHGMRSSTTQGNC  517
            PQCRSRSMRHGIEWSCWSCPWPRLSCVAR*EL*GWSQGSSCGMQ**AGHGMRSSTTQGNC
Sbjct  338  PQCRSRSMRHGIEWSCWSCPWPRLSCVAR*EL*GWSQGSSCGMQ**AGHGMRSSTTQGNC  517

Query  518  RVLWTMSNRDCSEPKLSWYKQFATCRRTKRRWMPWNTWQPG*WLERKRGQGKRRPCNSK  694
            RVLWTMSNRDCSEPKLSWYKQFATCRRTKRRWMPWNTWQPG*WLERKRGQGKRRPCNSK
Sbjct  518  RVLWTMSNRDCSEPKLSWYKQFATCRRTKRRWMPWNTWQPG*WLERKRGQGKRRPCNSK  694


 Score =  162 bits (349),  Expect = 0.0
 Identities = 61/61 (100%), Positives = 61/61 (100%), Gaps = 0/61 (0%)
 Frame = +2/+2

Query  749  WPLEPARAVPRCLRRRLGWIRHHSTRGSACPRVARATCGAGWGHTGRATTHVLTRTHHAT  928
            WPLEPARAVPRCLRRRLGWIRHHSTRGSACPRVARATCGAGWGHTGRATTHVLTRTHHAT
Sbjct  749  WPLEPARAVPRCLRRRLGWIRHHSTRGSACPRVARATCGAGWGHTGRATTHVLTRTHHAT  928

Query  929  R  931
            R
Sbjct  929  R  931


 Score =  113 bits (242),  Expect = 0.0
 Identities = 54/54 (100%), Positives = 54/54 (100%), Gaps = 0/54 (0%)
 Frame = +3/+3

Query  3    QSHSGDrnrqrdrrchrEWRFCFGEHTGENGRHAGCDIGGVCAPAHHGQEE*GE  164
            QSHSGDRNRQRDRRCHREWRFCFGEHTGENGRHAGCDIGGVCAPAHHGQEE*GE
Sbjct  3    QSHSGDRNRQRDRRCHREWRFCFGEHTGENGRHAGCDIGGVCAPAHHGQEE*GE  164


 Score =  107 bits (229),  Expect = 0.0
 Identities = 42/42 (100%), Positives = 42/42 (100%), Gaps = 0/42 (0%)
 Frame = +2/+2

Query  170  LLWPCFECGRRGCGVSMTT*IACIAIRRACFWLSARKLSVLN  295
            LLWPCFECGRRGCGVSMTT*IACIAIRRACFWLSARKLSVLN
Sbjct  170  LLWPCFECGRRGCGVSMTT*IACIAIRRACFWLSARKLSVLN  295


 Score =  363 bits (788),  Expect = 4e-152
 Identities = 179/179 (100%), Positives = 179/179 (100%), Gaps = 0/179 (0%)
 Frame = +1/+1

Query  79   TQEKMEGMQAAISEVFVRRLITARRNRVKHLAVALLRVWQEGVrrlhddldrmhrhTEGM  258
            TQEKMEGMQAAISEVFVRRLITARRNRVKHLAVALLRVWQEGVRRLHDDLDRMHRHTEGM
Sbjct  79   TQEKMEGMQAAISEVFVRRLITARRNRVKHLAVALLRVWQEGVRRLHDDLDRMHRHTEGM  258

Query  259  LLAERKEAERAKQLQAAAHAQVDAVITPMSVAIDAARNRVELLELSVAEAFLRRSVGALR  438
            LLAERKEAERAKQLQAAAHAQVDAVITPMSVAIDAARNRVELLELSVAEAFLRRSVGALR
Sbjct  259  LLAERKEAERAKQLQAAAHAQVDAVITPMSVAIDAARNRVELLELSVAEAFLRRSVGALR  438

Query  439  MVARQQLRDAVMSWTRNAFINHTGKLQGALDDVQQRLFRTEAELVQAVRDLSENQAALD  615
            MVARQQLRDAVMSWTRNAFINHTGKLQGALDDVQQRLFRTEAELVQAVRDLSENQAALD
Sbjct  439  MVARQQLRDAVMSWTRNAFINHTGKLQGALDDVQQRLFRTEAELVQAVRDLSENQAALD  615


 Score = 82.2 bits (173),  Expect = 4e-152
 Identities = 33/33 (100%), Positives = 33/33 (100%), Gaps = 0/33 (0%)
 Frame = +2/+2

Query  2   SKPQWR*KSTARSTLPSRVALLLWRAHRRKWKA  100
           SKPQWR*KSTARSTLPSRVALLLWRAHRRKWKA
Sbjct  2   SKPQWR*KSTARSTLPSRVALLLWRAHRRKWKA  100


 Score = 77.6 bits (163),  Expect = 4e-152
 Identities = 29/29 (100%), Positives = 29/29 (100%), Gaps = 0/29 (0%)
 Frame = +3/+3

Query  756  WSQPGQYRGACGGAWAGLGTIAPEVRLAR  842
            WSQPGQYRGACGGAWAGLGTIAPEVRLAR
Sbjct  756  WSQPGQYRGACGGAWAGLGTIAPEVRLAR  842



ORF finding

SMS ORFfinder/any codons/cadre 1,2,3/minimum codon long : 60/directe/code standard



>ORF number 1 in reading frame 1 on the direct strand extends from base 1 to base 687.
CTCAAAGCCACAGTGGAGATAGAAATCGACAGCGAGATCGACGCTGCCATCGCGAGTGGC
GCTTCTGCTTTGGCGAGCACACAGGAGAAAATGGAAGGCATGCAGGCTGCGATATCGGAG
GTGTTTGTGCGCCGGCTCATCACGGCCAGGAGGAATAGGGTGAAGCATCTTGCTGTGGCC
CTGCTTCGAGTGTGGCAGGAGGGGGTGCGGCGTCTCCATGACGACTTAGATCGCATGCAT
CGCCATACGGAGGGCATGCTTCTGGCTGAGCGCAAGGAAGCTGAGCGTGCTAAACAGCTC
CAGGCAGCGGCGCATGCGCAGGTGGATGCGGTGATCACCCCAATGTCGGTCGCGATCGAT
GCGGCACGGAATAGAGTGGAGCTGCTGGAGTTGTCCGTGGCCGAGGCTTTCCTGCGTCGC
TCGGTAGGAGCTTTGAGGATGGTCGCAAGGCAGCAGCTGCGGGATGCAGTGATGAGCTGG
ACACGGAATGCGTTCATCAACCACACAGGGAAACTGCAGGGTGCTCTGGACGATGTCCAA
CAGAGACTGTTCCGAACCGAAGCTGAGCTGGTACAAGCAGTTCGCGACTTGTCGGAGAAC
CAAGCGGCGCTGGATGCCATGGAATACATGGCAGCCAGGCTGATGGCTCGAGCGGAAGCG
CGGGCAGGGGAAGCGCCGCCCATGTAA

>Translation of ORF number 1 in reading frame 1 on the direct strand.
LKATVEIEIDSEIDAAIASGASALASTQEKMEGMQAAISEVFVRRLITARRNRVKHLAVA
LLRVWQEGVRRLHDDLDRMHRHTEGMLLAERKEAERAKQLQAAAHAQVDAVITPMSVAID
AARNRVELLELSVAEAFLRRSVGALRMVARQQLRDAVMSWTRNAFINHTGKLQGALDDVQ
QRLFRTEAELVQAVRDLSENQAALDAMEYMAARLMARAEARAGEAPPM*

>ORF number 1 in reading frame 2 on the direct strand extends from base 644 to base 934.
TGGCTCGAGCGGAAGCGCGGGCAGGGGAAGCGCCGCCCATGTAACAGTAAGGGCACTGCC
ACAGGCCCCGGGGGCACCCCGAGGGCAGGCCGCCTGAGGCGCCCCTGGCCTTTGGAGCCA
GCCCGGGCAGTACCGCGGTGCCTGCGGCGGCGCTTGGGCTGGATTAGGCACCATAGCACC
CGAGGTTCGGCTTGCCCGCGGGTCGCCCGGGCCACATGTGGGGCCGGGTGGGGGCACACT
GGCAGGGCCACTACCCACGTACTAACACGTACCCACCACGCTACCCGCGAA

>Translation of ORF number 1 in reading frame 2 on the direct strand.
WLERKRGQGKRRPCNSKGTATGPGGTPRAGRLRRPWPLEPARAVPRCLRRRLGWIRHHST
RGSACPRVARATCGAGWGHTGRATTHVLTRTHHATRE

>ORF number 1 in reading frame 3 on the direct strand extends from base 375 to base 566.
AGTGGAGCTGCTGGAGTTGTCCGTGGCCGAGGCTTTCCTGCGTCGCTCGGTAGGAGCTTT
GAGGATGGTCGCAAGGCAGCAGCTGCGGGATGCAGTGATGAGCTGGACACGGAATGCGTT
CATCAACCACACAGGGAAACTGCAGGGTGCTCTGGACGATGTCCAACAGAGACTGTTCCG
AACCGAAGCTGA

>Translation of ORF number 1 in reading frame 3 on the direct strand.
SGAAGVVRGRGFPASLGRSFEDGRKAAAAGCSDELDTECVHQPHRETAGCSGRCPTETVP
NRS*

---------------------------------------------------------------------------------------------------

SMS ORFfinder/any codons/cadre 1,2,3/minimum codon long : 60/reverse/code standard

>ORF number 1 in reading frame 1 on the reverse strand extends from base 31 to base 438.
TACGTGGGTAGTGGCCCTGCCAGTGTGCCCCCACCCGGCCCCACATGTGGCCCGGGCGAC
CCGCGGGCAAGCCGAACCTCGGGTGCTATGGTGCCTAATCCAGCCCAAGCGCCGCCGCAG
GCACCGCGGTACTGCCCGGGCTGGCTCCAAAGGCCAGGGGCGCCTCAGGCGGCCTGCCCT
CGGGGTGCCCCCGGGGCCTGTGGCAGTGCCCTTACTGTTACATGGGCGGCGCTTCCCCTG
CCCGCGCTTCCGCTCGAGCCATCAGCCTGGCTGCCATGTATTCCATGGCATCCAGCGCCG
CTTGGTTCTCCGACAAGTCGCGAACTGCTTGTACCAGCTCAGCTTCGGTTCGGAACAGTC
TCTGTTGGACATCGTCCAGAGCACCCTGCAGTTTCCCTGTGTGGTTGA

>Translation of ORF number 1 in reading frame 1 on the reverse strand.
YVGSGPASVPPPGPTCGPGDPRASRTSGAMVPNPAQAPPQAPRYCPGWLQRPGAPQAACP
RGAPGACGSALTVTWAALPLPALPLEPSAWLPCIPWHPAPLGSPTSRELLVPAQLRFGTV
SVGHRPEHPAVSLCG*

>ORF number 1 in reading frame 2 on the reverse strand extends from base 2 to base 709.
TCGCGGGTAGCGTGGTGGGTACGTGTTAGTACGTGGGTAGTGGCCCTGCCAGTGTGCCCC
CACCCGGCCCCACATGTGGCCCGGGCGACCCGCGGGCAAGCCGAACCTCGGGTGCTATGG
TGCCTAATCCAGCCCAAGCGCCGCCGCAGGCACCGCGGTACTGCCCGGGCTGGCTCCAAA
GGCCAGGGGCGCCTCAGGCGGCCTGCCCTCGGGGTGCCCCCGGGGCCTGTGGCAGTGCCC
TTACTGTTACATGGGCGGCGCTTCCCCTGCCCGCGCTTCCGCTCGAGCCATCAGCCTGGC
TGCCATGTATTCCATGGCATCCAGCGCCGCTTGGTTCTCCGACAAGTCGCGAACTGCTTG
TACCAGCTCAGCTTCGGTTCGGAACAGTCTCTGTTGGACATCGTCCAGAGCACCCTGCAG
TTTCCCTGTGTGGTTGATGAACGCATTCCGTGTCCAGCTCATCACTGCATCCCGCAGCTG
CTGCCTTGCGACCATCCTCAAAGCTCCTACCGAGCGACGCAGGAAAGCCTCGGCCACGGA
CAACTCCAGCAGCTCCACTCTATTCCGTGCCGCATCGATCGCGACCGACATTGGGGTGAT
CACCGCATCCACCTGCGCATGCGCCGCTGCCTGGAGCTGTTTAGCACGCTCAGCTTCCTT
GCGCTCAGCCAGAAGCATGCCCTCCGTATGGCGATGCATGCGATCTAA

>Translation of ORF number 1 in reading frame 2 on the reverse strand.
SRVAWWVRVSTWVVALPVCPHPAPHVARATRGQAEPRVLWCLIQPKRRRRHRGTARAGSK
GQGRLRRPALGVPPGPVAVPLLLHGRRFPCPRFRSSHQPGCHVFHGIQRRLVLRQVANCL
YQLSFGSEQSLLDIVQSTLQFPCVVDERIPCPAHHCIPQLLPCDHPQSSYRATQESLGHG
QLQQLHSIPCRIDRDRHWGDHRIHLRMRRCLELFSTLSFLALSQKHALRMAMHAI*

>ORF number 2 in reading frame 2 on the reverse strand extends from base 710 to base 934.
GTCGTCATGGAGACGCCGCACCCCCTCCTGCCACACTCGAAGCAGGGCCACAGCAAGATG
CTTCACCCTATTCCTCCTGGCCGTGATGAGCCGGCGCACAAACACCTCCGATATCGCAGC
CTGCATGCCTTCCATTTTCTCCTGTGTGCTCGCCAAAGCAGAAGCGCCACTCGCGATGGC
AGCGTCGATCTCGCTGTCGATTTCTATCTCCACTGTGGCTTTGAG

>Translation of ORF number 2 in reading frame 2 on the reverse strand.
VVMETPHPLLPHSKQGHSKMLHPIPPGRDEPAHKHLRYRSLHAFHFLLCARQSRSATRDG
SVDLAVDFYLHCGFE

>ORF number 1 in reading frame 3 on the reverse strand extends from base 129 to base 932.
TCCAGCCCAAGCGCCGCCGCAGGCACCGCGGTACTGCCCGGGCTGGCTCCAAAGGCCAGG
GGCGCCTCAGGCGGCCTGCCCTCGGGGTGCCCCCGGGGCCTGTGGCAGTGCCCTTACTGT
TACATGGGCGGCGCTTCCCCTGCCCGCGCTTCCGCTCGAGCCATCAGCCTGGCTGCCATG
TATTCCATGGCATCCAGCGCCGCTTGGTTCTCCGACAAGTCGCGAACTGCTTGTACCAGC
TCAGCTTCGGTTCGGAACAGTCTCTGTTGGACATCGTCCAGAGCACCCTGCAGTTTCCCT
GTGTGGTTGATGAACGCATTCCGTGTCCAGCTCATCACTGCATCCCGCAGCTGCTGCCTT
GCGACCATCCTCAAAGCTCCTACCGAGCGACGCAGGAAAGCCTCGGCCACGGACAACTCC
AGCAGCTCCACTCTATTCCGTGCCGCATCGATCGCGACCGACATTGGGGTGATCACCGCA
TCCACCTGCGCATGCGCCGCTGCCTGGAGCTGTTTAGCACGCTCAGCTTCCTTGCGCTCA
GCCAGAAGCATGCCCTCCGTATGGCGATGCATGCGATCTAAGTCGTCATGGAGACGCCGC
ACCCCCTCCTGCCACACTCGAAGCAGGGCCACAGCAAGATGCTTCACCCTATTCCTCCTG
GCCGTGATGAGCCGGCGCACAAACACCTCCGATATCGCAGCCTGCATGCCTTCCATTTTC
TCCTGTGTGCTCGCCAAAGCAGAAGCGCCACTCGCGATGGCAGCGTCGATCTCGCTGTCG
ATTTCTATCTCCACTGTGGCTTTG

>Translation of ORF number 1 in reading frame 3 on the reverse strand.
SSPSAAAGTAVLPGLAPKARGASGGLPSGCPRGLWQCPYCYMGGASPARASARAISLAAM
YSMASSAAWFSDKSRTACTSSASVRNSLCWTSSRAPCSFPVWLMNAFRVQLITASRSCCL
ATILKAPTERRRKASATDNSSSSTLFRAASIATDIGVITASTCACAAAWSCLARSASLRS
ARSMPSVWRCMRSKSSWRRRTPSCHTRSRATARCFTLFLLAVMSRRTNTSDIAACMPSIF
SCVLAKAEAPLAMAASISLSISISTVAL