ORF VL16600

From Metagenes
Warning: this metagenomic sequence has been carefully annotated by students during bioinformatics assignments. These quality annotations are therefore the result of a teaching exercise that you are most welcome to amend and extend if necessary!


Sequence
CAMERA AccNum : AACY01160021.1
Annotathon code: ORF_VL16600
Sample :
  • GPS :31°10'30n; 64°19'27.6w
  • Sargasso Sea: Sargasso Sea, Station 11 - Bermuda (UK)
  • Open Ocean (-5m, 20.5°C, 0.1-0.8 microns)
Authors
Team : BioCell 2006
Username : maxvaness
Annotated on : 2008-03-19 18:52:37
  • ASSOUS maxime
  • SEKATCHEFF vanessa

Synopsis

Genomic Sequence

>AACY01160021.1 ORF_VL16600 genomic DNA
GACAATTTATAGTTCCCTCTGCATATCCTCTGTACCATAATATACTATTTTCATCTAAAAATAATTCCTGAAACTCTTTACTATCTTCAAGATTTATATT
TCCAGACAAATATTCATGCATAGTTTTATTGATATAGTCTATGGTATATTTTTTAGAAAGATAAGGTCTTATTCCACCATGTACAAAAATCCATGGCCCA
ATCTTGTAGATTGCTTTAAAAAGTTTAGTCATTGAAATAGCCATCTTATTACCTGGTTTATAAAATTTCTTTCTTTTTTTTTTTGTACCAAATTTTTTGA
TGCCTAATTTACTACAATAATCATATACGCCTTGTAAATTTAACATTTCATGATTACCAATAATACATATTACATTACCCTTATTTTTTATCGCTTGTTT
TTTTAATCTTTTGAATAAATAAATTATTTTTATTTCCGAGTCTTCATCACCATATGTAACATCACGGCCTCCTCTATCTAAAATATCACCCATTTGAATA
AGAATGGTATTTTTACCAATCCAATTTAAATTTTTATTGATTATATTTACTAATTGTAAAGATTTTATAGTGGCTACATAGTCTCCATGTACATCGCCAA
TAGCTACAATTCGTTGCTTTAGAGTTTTTTTCTCGGCTACATTGATAACTTTTGACATATCTATAATATATATAATATCATAGATATATATTATAGTTTA
TAAATCTTATTTACTACTTAATTGGATTCAATAATAATGTTCTGAGAGCTTTCGTCTGCAACTGTTTCATTATTCGTATCATTTCTATTTTTATTGTAAT
TTTCAAAAAAAAATTTTATTTCGTTCTTCAATAATTTGTTTCTTTGATTTAA

Translation

[156 - 851/852]   indirect strand
>ORF_VL16600 Translation [156-851   indirect strand]
TIIYIYDIIYIIDMSKVINVAEKKTLKQRIVAIGDVHGDYVATIKSLQLVNIINKNLNWIGKNTILIQMGDILDRGGRDVTYGDEDSEIKIIYLFKRLKK
QAIKNKGNVICIIGNHEMLNLQGVYDYCSKLGIKKFGTKKKRKKFYKPGNKMAISMTKLFKAIYKIGPWIFVHGGIRPYLSKKYTIDYINKTMHEYLSGN
INLEDSKEFQELFLDENSILWYRGYAEGTINC

[ Warning ] 5' incomplete: does not start with a Methionine
[ Warning ] 3' incomplete: following codon is not a STOP

Phylogeny

Neighbor-joining method

 Negative branch lengths allowed


  +---------------------------------------Solibacter (Fibrobacteres)
  ! 
  !                                        +2Crypto   (Cryptosporidium parvum)
  !              +-------------------------1 
  !          +---8                         +1Cryp  (Cryptosporidium hominis)
to   
  !          !   ! 
  !          !   +--------------------------Cryptococ(Cryptococcus neoformans)  
  !       +-12  
  !       !  !  +--------------------------------Flavobacte (Flavobacterium)
  !       !  +--9 
  !       !     !   +-----------------gammax12  (Gamma proteobacterie)
  !       !     +---7 
  !       !         +----------------Stigmatell  (Stigmatella aurantiaca)
  !       !  
  5------13                           +---------Medicago  (Medicago)
  !       !          +----------------3 
  !       !          !                !  +--Araxxx8    (Arabidopsis)

  !       !     +----6                +--2 
  !       !     !    !                   +---Brassica  (Brasssica)
  !       !  +-10    ! 
  !       !  !  !    +-----------------------Arabidopsi (Arabidopsis)
  !       !  !  !  
  !       +-11  +---------------------notresq   
  !          !  
  !          !                  +----------Leishmania  (Leishmania)
  !          +------------------4 
  !                             +---------Trypanosom (Trypanosoma cruzi)
  ! 
  +-----------------------------------------------VIRUSx14  (Myoviridae)

Annotator commentaries

Par la recherche d'ORF via SMS, nous n'avons pas trouvé de cadre de lecture codant avec "any codon" dans le sens direct. Par contre, on en a trouvé un dans le sens indirect.

Aprés avoir effectué un BLAST (blast p) dans le serveur NCBI, on s'est apperçu que les sequences obtenues appartiennent à des espèces eucaryotes et procaryotes.

On a pensé que cette sequence pouvait être une sequence d'ADN mitochondrial qui se serait retrouvé chez les eucaryotes par mécanisme symbiotique. Nous sommes donc allé voir la fiche EMBL de la sequence présentant le plus haut score : Leishmania major ; afin de verifier notre hypothése. Cependant, la fiche n'a pas confirmé notre hypothése; la sequence étant une sequence d'ADN génomique.

On a essayé de determiner notre groupe d'etude mais cela n'etait pas possible à la vision de notre BLAST et du taxonomy report. Ainsi, nous avons donc décidé de faire une phylogenie de diffèrentes sequences de bacteries, d'eucaryotes ainsi que notre sequence en se basant sur les meilleurs scores.

Notre hypothèse etait que: Les sequences issues de bacteries devraient se regrouper entre elles, de même que pour les sequences eucaryotes. On aurait donc du obtenir un arbre avec deux groupes distincts : les bacteries d'une part, les eucaryotes d'autre part. Si notre sequence avait été associée au groupe des eucaryotes, on aurait pu penser que notre groupe d'etude serait les eucaryotes et donc choisir comme groupe exterieur les bacteries (ou inversement). Or, notre hypothése n'a pas été verifiée à la decouverte de l'arbre. En effet, l'arbre obtenu ne nous a pas devoilé deux groupes distincts: Nous avions des sequences eucaryotes au sein du "groupe" des bacteries ou inversement (des bacteries au sein du "groupe" des eucaryotes). Nous n'avons donc pas pu, à ce stade là, determiner notre groupe d'étude.

Nous nous sommes reportés au BLAST afin d'eclaircir les anomalies de l'arbre. En visionnant les sequences procaryotes choisies pour l'alignement multiple, on s'est aperçu que ces sequences présentent un pourcentage d'identité trés faible (autour de 20%). On a donc cherché des sequences procaryotes qui auraient des meilleurs pourcentages d'identité. Ceci ne fût pas le cas : toutes les sequences procaryotes présentent des identités faibles avec de trés longues insertions.

Ceci n'a pas pu nous aider à determiner l'appartenance de notre sequence (eucaryote ou procaryote). Mais à la vision des arbres obtenus dans ClustalW et les resultats du Blast nous amène à supposer que notre sèquence est plus proche des eucaryotes que des procaryotes.

Cette sequence est retrouvée chez differents organismes mais pas chez tous leurs proches parents, ceci nous amène à considerer 2 hypothéses : - La première est de penser que notre sequence a été perdue au cours de l'evolution, c'est pour cela que l'on n'arrive pas à la detecter. - La deuxième serait de penser que notre sequence a subit de nombreuses modifications au cours de l'évolution telles que l'on ne la detecte pas lors de l'analyse du blast.

Ensuite, nous voulions obtenir un arbre phylogénétique mais du fait que nous n'avons pas pu trancher entre eucaryote et procaryote pour notre groupe d'etude , nous avons décidé, afin de pouvoir obtenir un arbre raciné, de prendre comme groupe exterieur un Virus : Staphylococcus phage Twort.

D'après les résultats de l'arbre obtenu, nous ne pouvons pas trancher sur l'origine de notre sequence mais étant donné que les plus proches orthologues proviennent d'organismes eucaryotes, on peut émettre l'hypothèse que notre sequence proviendrait d'un organisme eucaryote unicellulaire.

Par contre, nous n'avons pas trouvé de nom de gene commun à tous les orthologues de notre sequence mais juste une région conservée : "Metallophos"

Multiple Alignement

ClustalW alignement multiple PBIL


                         10        20        30        40        50        60
                          |         |         |         |         |         |
1Crypto          --------------------------------MRKLLKKTLFSNIIILCLYLGILKCVNS
2Crypto          --------------------------------MRKLLKKTLFSNIIILCLYLGILKCVNS
Stigmatella      -----------------------------------MSAFLSMTRTLCVLLVAPVLGAAAP
gammax12         --------------------------------------------MRFLLLCLAVLMAPFT
Leishmania       --------------------------------------MVLPWKKSLSVLLSAALIIAAF
Trypanosoma      ---------------------------------------------MLRALWLWVLLFFFF
Brassica         -------------------------------MST-RENHNGICKTVPNLISSFVDAFVDY
Araxxx8          -------------------------------MSS-RENPSGICKSIPKLISSFVDTFVDY
Medicago         -------------------------------MEFEKQNSNTFCNQIPNFLSSFIDTFVDF
Arabidopsis      ------------------------------MASLYLNSLLPLPPSHPQKLLEPSSSSLLS
notresq          -----------------------------------------TIIYIYDIIYIIDMSKVIN
Cryptococcus     --------------------------------MPARQTIVIVSVVSLLLLYLFVHHTTSS
Flavobacterium   MAYIGYLVIAVSLGFFGVIYVIPLAPGWKLKTPHYSKKAYNLHTEPPLVFYQNESIHVVS
Solibacter       -----------------------------------MTLSRAVAAAAALSLFSGIPRAQNK
VIRUSx14         ------------------------------------------------------------
                                                                             
Prim.cons.       MAYIGYLVIAVSLGFFGVIYVIPLAPGWKL2M2R223224LF223IPLLLY2FVL4FV2S

                         70        80        90       100       110       120
                          |         |         |         |         |         |
1Crypto          HISNEIQKQNQTFDKENLESKNTQNLQESKNPTSKLSQTILKEEFNFLPNDVEINWKGRV
2Crypto          HISNEIQKQNQTFDKENLESKNTQNLQKSKNPTSQLSKTILKEEFNFLPNDVEINWKGRV
Stigmatella      GAPPRL---------EEVV-------------------------------EDTFSGVERV
gammax12         AAQS------------------------------------------------DWQGVERV
Leishmania       AVAL-------------------------------------------------SAEARRL
Trypanosoma      ATN--------------------------------------------------HGLGRRI
Brassica         SFSG-IFSPHHPTPLNDTPQ-------------------------------TRFEKPDRL
Araxxx8          SVSG-IFLPQDPSSQNEILQ-------------------------------TRFEKPERL
Medicago         SVSGGLFLPPPPSSPPPIP--------------------------------TRLPSPSRL
Arabidopsis      TSNGNELALKPIVINGDPPT-------------------------------FVSAPARRI
notresq          VAEKK------------------------------------------------TLK-QRI
Cryptococcus     HKPSVPNVPSRGAG--------------------------------------EAAYRQRL
Flavobacterium   IANEQNTFIKKQHSYPGNDSLVLKVN-------TPENSFKVRLKDSFAPAQDTYALPERL
Solibacter       TARDWAKN---------P-------------------------------AVVQIDTAEDV
VIRUSx14         ---------------------------------------------------------MSI
                                                                            :
Prim.cons.       3ASGEIF2P22P2SKE33PSKNTQNLQ2SKNPTS3LS3TILKEEFNFLP2D222AKPERL

                        130       140       150       160       170       180
                          |         |         |         |         |         |
1Crypto          LVIGDIHGDLKSLITSLFLSGVINSNLD---WIAKNTLLIQLGDVVDRGSH---------
2Crypto          LVIGDIHGDLKSLITSLFLSGVINSNLD---WIAKNTLLIQLGDVVDRGSH---------
Stigmatella      VAVGDVHGDVEALKEVLRLAGLIDAKDQ---WTGGKTHLVQTGDIADRGAR---------
gammax12         VAVADLHGDYDNYITVLRQAGVIDRRGR---WDAGKTHLVQLGDVPDRGPD---------
Leishmania       VAVGDLHGDYEQTVSVLRLTRLIDKRNH---WIGEDALLVQLGDILDVGPD---------
Trypanosoma      VAVGDLHGDLNQTLSILHLAGLVNKRQH---WIGKDTYFVQLGDILDVGPD---------
Brassica         VAIGDLHGDLEKSKEAFRIAGLIDSSDR---WTGGSTVVVQVGDLLDRGGD---------
Araxxx8          VAIGDLHGDLEKSREAFKIAGLIDSSDR---WTGGSTMVVQVGDVLDRGGE---------
Medicago         IAIGDLHGDLKKSKEALSIAGLIDSSGN---YTGGSATVVQIGDVLDRGGD---------
Arabidopsis      VAVGDLHGDLGKARDALQLAGVLSSDGRDQ-WVGQDTVLVQVGDILDRGDE---------
notresq          VAIGDVHGDYVATIKSLQLVNIINKNLN---WIGKNTILIQMGDILDRGGRDVTYGDEDS
Cryptococcus     VAVGDLHGDIDNAKKTLQMARIIDDDSK---WVASTDILVQTGDIVDRGAY---------
Flavobacterium   LCLSDIEGNFEGLVRFLKGTGVVDQDLA---WQYGTGHLVLLGDFFDRGTQ---------
Solibacter       FAIGDVHGDCDRLLKLLSAAGLVEGSPAQVHWAAGRKVLLFTGDMVDKGPK---------
VIRUSx14         FVIPDIHGEYDKLMRLMN--KIIEERKP-------EDTIVFLGDYIDRGDR---------
                 . : *:.*:       :    ::.              .:  **  * *           
Prim.cons.       VAIGDLHGDLEKL2EALRLAGLIDS2LR22HWIGG3T3LVQLGDILDRG2DDVTYGDEDS

                        190       200       210       220       230       240
                          |         |         |         |         |         |
1Crypto          ALQIYKLFNKLKSQAPSLGSKFVGLLGNHEVMNLCGQLHYVTDEDFQTYGGRDNR-----
2Crypto          ALQIYKLFNKLKSQAPSLGSKFVGLLGNHEVMNLCGQLHYVTDEDFQTYGGRDNR-----
Stigmatella      TREAFELMMRLEREALAAGGRVHLLLGNHEVMNMRGDLRYVTPEELASFAGLEAT-----
gammax12         SDKIIRHLMKLEEQAEKAGGKVHPLIGNHEVMNITGDLRYVHPGEYEALTSRNSKRL---
Leishmania       DILIVRLLMRLQQEAHAKGGDVIELLGNHELRNFRGDYKAVDKASLAASGGQKGR-----
Trypanosoma      DLMIVRLLMRLEKEAQAEGGDVIQILGNHEIRNLLGDFSAVDPVSLAQSGGKAGR-----
Brassica         ELKILFFLERLKREAEREGGKVVTMNGNHEIMNVEGDFRFVTKEGLEEFRVWSDWYS---
Araxxx8          ELKILYFLEKLKREAERAGGKILTMNGNHEIMNIEGDFRYVTKKGLEEFQIWADWYC---
Medicago         EIKILYLLEKLKRQAAIHGGNFITMNGNHEIMNAEGDFRFATKNGVEEFKVWLEWFR---
Arabidopsis      EIAILSLLRSLDDQAKANGGAVFQVNGNHETMNVEGDFRYVDARAFDECTDFLDYLEDYA
notresq          EIKIIYLFKRLKKQAIKNKGNVICIIGNHEMLNLQGVYDYCSKLGIKKFGTKKKR-----
Cryptococcus     ADDIYRLMQSLRGQAASQGGKVVSILGNHEVMNAIGDWRYVTKGDIARFGGTKSR-----
Flavobacterium   VNECLWLIYKLEQEAARAGGKLHFILGNHETMNLMGAYDARMYKYVHGSYFKKAD-----
Solibacter       APEVLALLQHLRTEAAQAGGQVVVLTGNHEIDFLRGPLSDKAKEFAGQLEAAGFDP----
VIRUSx14         SKDVVNYLFDLLLNDE----NVVALLGNHDDELYR-IIENIDRLGIYDIEWLSRY-----
                        :  *  :       .  : ***:                              
Prim.cons.       ELKILRLLMKLKR2A2AAGGKVVTLLGNHE2MNL2GD2RYVTKEGL2EFGG3KDRY5DYA

                        250       260       270       280       290       300
                          |         |         |         |         |         |
1Crypto          -------------------------------------------TFEWSKEGFVGKYLRTM
2Crypto          -------------------------------------------TFEWSKEGFVGKYLRTM
Stigmatella      ---------------PDAPG---------------APKGLEGHRAAYGLEGRYGRWLRSH
gammax12         ---QENYFERVAAYLKENKGKDSVDEAFREQWFKEHPRGFVEHRQHWHPEGQFGAWVASH
Leishmania       -------------------------------------------DVLLSNATDLGRYLRTR
Trypanosoma      -------------------------------------------RELLSNRTPLGIYLRTR
Brassica         --LGNKMKSLCHGLDK-VKDLYEGIPMSFPRAREECFEGMRARIAALRPEGPIAKRFLSK
Araxxx8          --LGNKMKTLCSGLDK-PKDPYEGIPMSFPRMRADCFEGIRARIAALRPDGPIAKRFLTK
Medicago         --QGNKMKNLCKGLEETVVDPLENVHVAFRGVREEFHDGFRARVAALRPNGPISKRFFTQ
Arabidopsis      QDWDKAFRNWIFESRQWKEDRRSSQTYWDQWNVVKRQKEVIARSVLLRPGGRLACELSRH
notresq          -------------------------------------------KKFYKPGNKMAISMTKL
Cryptococcus     -------------------------------------------QHALSAEGWLGQEWLAN
Flavobacterium   -------------FLK--ID----------------------YSQWYTPDTALGRWLRSK
Solibacter       -------------------------------------------VTVAACKGDLGAFLCSL
VIRUSx14         ---------------------------------------------CMETLDSYGVDIATL
                                                                      .      
Prim.cons.       QDLGNKMKNLC5GL2K4VKDPYEGIPMSFPR5REEC22GFRARRAALSPEGPLGKYLRT3

                        310       320       330       340       350       360
                          |         |         |         |         |         |
1Crypto          K---------LAIRVNDSLYVHAGLLPKYAKLG-----LDRLDKLSNDLLEGDFCDFYSS
2Crypto          K---------LAIRVNDSLYVHAGLLPKYAKLG-----LDKLDKLSNDLLEGDFCDFYSS
Stigmatella      P---------AVVRIDGTLFLHGGLHPEVPAKT-----LGALNRWTRQDLFPDAAPGG--
gammax12         N---------TVIRINRSLFVHGGIGPDYLKAS-----MEDINDTVREELRAPDGADRR-
Leishmania       K---------AVFHYGPFLFMHGGFSTATAGMITSLSKVEQFNSELTKALLNGTISPLAR
Trypanosoma      R---------AIFHHKEFLFMHGGLSTATGNMITGIKAVEKFNKALRDTLINNTLSPMGK
Brassica         NQ--------TVAVVGDSVFVHGGLLAEHVEYG-----LERMNEEVTSWINGFRGGRYAP
Araxxx8          NQ--------TVAVVGDSVFVHGGLLAEHIEYG-----LERINEEVRGWINGFKGGRYAP
Medicago         NV--------TVLVVGDSIFVHGGLLKEHVDYG-----LEKINGEVSDWYKGLFGNRFSP
Arabidopsis      G---------VILRVNNWLFCHGGLLPHHVAYG-----IERINREVSTWMRSPTNYEDSP
notresq          FK--------AIYKIGPWIFVHGGIRPYLSKKYT----IDYINKTMHEYLSGNINLEDSK
Cryptococcus     YSTTALVPISPYPSSPTFSFTHGSLRPSYANLTPYPAAINDLGHSLLTKALTPPMAPPYP
Flavobacterium   N---------SIVKIGDYLFVHGGVSPQLVAAG---LRLEQINSGIREGLDKTPNDQTKQ
Solibacter       P---------FAARVNGWFFSHAGNSGGRTMQQLIADLQSGFDREGYSTAQLTGANSLVQ
VIRUSx14         TCN-----IVEDVLRNDYDFIKNELNKLKESED-----YRKFKVFMTNCRRYYKKDKY--
                                    : :                   :                  
Prim.cons.       NQ2TALVPI22V3RV2DSLFVHGGLLPE22KYGT445ALE2INKEVRDWLRG22GD2YSP

                        370       380       390       400       410       420
                          |         |         |         |         |         |
1Crypto          L----------------FFVEDGPLWTRDISL-------GEEEKACKLVDETLQILGLSR
2Crypto          L----------------FFVEDGPLWTRDISL-------GEEEKACKLVDETLQILGLSR
Stigmatella      -----------------GTDAKGPLWFRGYAQ-------EEEALWSQGLDAVLERFGARR
gammax12         V----------------VEDEEGPLWYRGLIMG------EETDKIAAHVDALMERFDVDR
Leishmania       NGLDL--------TEDDVDDVANPILVRSILT-------VKCNALSKVLDKKFP--GIQS
Trypanosoma      VGVSL--------KENKVKEVANPILVRSILN-------VRCSELKKVLSKKFH--GIKS
Brassica         ---------------GYCRGGNSVVWLRKFSDER-----PHRCDCAALEHALSTIPGVKR
Araxxx8          ---------------AYCRGGNSVVWLRKFSEEM-----AHKCDCAALEHALSTIPGVKR
Medicago         ---------------PYCRGRNALVWLRKFSDG--------NCDCSSLEHVLSTIPGVKR
Arabidopsis      Q---M--------PFIATRGYDSVVWSRLYSRETSELEDYQIEQVNKILHDTLEAVGAKA
notresq          E------------FQELFLDENSILWYRGYAE------------------------G--T
Cryptococcus     PNPYSGLPKGTTHEEADLYAEGGPLWWRGLAE-------REEAQVCEWAKNLKQKIGARR
Flavobacterium   E--------------RLLLRTEGPLWYRGLAN---------ESLTAEEVSRILDAFDSTK
Solibacter       AR-------------LGEQGPGGKSWFENGDK-------TFLPDHTAALGVAHVVQGHQH
VIRUSx14         -----------------IFTHSGGVSWKPVEG--------QTVDQLMWSRDFQPRKDGFI
                                            .                            .   
Prim.cons.       3G33LGLPKGTTH5EAY3RGENGPLW2RGISEE3SELED3EE2DCAKLLDALL3I2GVKR

                        430       440       450       460       470       480
                          |         |         |         |         |         |
1Crypto          MVVGHTIQH--DNRINIKCDNKLVLADTGFSEAIYGKPCMLEILYHNDNPDPSNYKSFHP
2Crypto          MVVGHTIQH--DNRINIKCDNKLILADTGFSEAIYGKPCMLEILYHNDNPDPSNYKSFHP
Stigmatella      MVMGHTPTK--DGRIGVRFGGRAVLIDTGLSTYYGRHLAALEIRGDRLTALYPDGRVSLL
gammax12         IVMGHTPG---FGTVVPRYHGRVLAADSGIAEYYGGNLASVLIENGQAFTLQSGERVAIP
Leishmania       VVVGHVPHDPRDFDGWRLCGGRLIDIDFGMSRWKKGDPGHVAALEIEEATWHVQLIETST
Trypanosoma      VVVGHVPHDTDDFSDWRLCGGRLIAIDFGLSRWKKGDPGHVAALEWDDVSGHVQLMES-T
Brassica         MIMGHTIQ---EAGINGVCGDKAIRIDVGMSKGCSDGLPEVLEIRKDSGVRIVTSNPLYK
Araxxx8          MIMGHTIQ---DAGINGVCNDKAIRIDVGMSKGCADGLPEVLEIRRDSGVRIVTSNPLYK
Medicago         MIMGHTIQ---KEGINGVCENKAIRIDVGMSKGCGGGLPEVLEIDR-YGVRILTSNPLYN
Arabidopsis      MVVGHTPQ---LSGVNCEYGCGIWRVDVGMSSGVLDSRPEVLEIRGDKARVIRSNRDRLH
notresq          INC---------------------------------------------------------
Cryptococcus     IIGGHTPN---FEKIVARCNASVIIIDTGISSAYGGVLSALEIVYTLTPVDRRGRDHSQD
Flavobacterium   IIIGHSVFD----QVQTLYTDQVIGIDLKHAENQHVYGLLYNASGFHAINDQNTTRFLMK
Solibacter       AVLRLPDGKHRKLGQIYQWRGLLFLIDVGLSQDIDESHGAVLRMRPNEASAICPDGRVKK
VIRUSx14         HVCGHTPT----SSGQVEKHNDMLLCDVG-AVFRDIELPFIKLEELE-------------
                                                                             
Prim.cons.       MV2GHTPQD3RD6GING3CGGKLILIDVGMSEG3GGGLPEVLI2R3DD2VDIVTSRPLYK

                        490       500       510       520       530       540
                          |         |         |         |         |         |
1Crypto          YSINELQVKS----NPSKPNIKYIQVTSLFLDKTSTKDEL--------------------
2Crypto          YSINELQVKS----NPSKPNIKYIQVTSLFLDKTSTRDEL--------------------
Stigmatella      TP--------------------GVKQGAGKPAAGKAASGR--------------------
gammax12         DEDAGLLAYYRA-IDKISPDINNLKVLIQRLEAAEAAEGSAMLHELRDLQWTHRVLVIHE
Leishmania       ASSLVSGT------NAPFPDSVAAYYLVSFSVLIVVGVVGVLLA-VYCLPLCYAAACEPL
Trypanosoma      GKFLNYEP------DVHSPTEHNVKRLLPFIRLAGAVVFLVLLVGLLGRWICSRAINNPL
Brassica         EKPNSQLVP---------ESKTGLGLLVPVEH-VTKQVEVKA------------------
Araxxx8          ENLYSHVAP---------DSKTGLGLLVPVP----KQVEVKA------------------
Medicago         QMNKENVDIG--------KVEEGFGLLLNNQDGRPRQVEVKA------------------
Arabidopsis      ELQVADYI----------------------------------------------------
notresq          ------------------------------------------------------------
Cryptococcus     PLLLSTSSESIAGLKGRFVEREEVHAIYEHSRKWLALEEREVVLD---------------
Flavobacterium   AY----------------------------------------------------------
Solibacter       LWDAAKPE----------SS-KGARCGS--------------------------------
VIRUSx14         ------------------------------------------------------------
                                                                             
Prim.cons.       ES322L3V2S2AG2NPS3PSIKG222L2PFLDK2SAQVE2KAL43L33L33C3RA333PL

                        550       560       570       580       590       600
                          |         |         |         |         |         |
1Crypto          ------------------------------------------------------------
2Crypto          ------------------------------------------------------------
Stigmatella      ------------------------------------------------------------
gammax12         PEASDQLRDVFRDNREAFDERRLAWFMIADGELQSNLDGRVGRDLLENIRIQLNPAVNEV
Leishmania       LEAGTNQYGTV-------------------------------------------------
Trypanosoma      SEKQEGSYGSVAPCVTP-------------------------------------------
Brassica         ------------------------------------------------------------
Araxxx8          ------------------------------------------------------------
Medicago         ------------------------------------------------------------
Arabidopsis      ------------------------------------------------------------
notresq          ------------------------------------------------------------
Cryptococcus     ------------------------------------------------------------
Flavobacterium   ------------------------------------------------------------
Solibacter       ------------------------------------------------------------
VIRUSx14         ------------------------------------------------------------
                                                                             
Prim.cons.       3EA3333YG3V222222FDERRLAWFMIADGELQSNLDGRVGRDLLENIRIQLNPAVNEV

                        610       620       630
                          |         |         |
1Crypto          -------------------------------------
2Crypto          -------------------------------------
Stigmatella      -------------------------------------
gammax12         FLVGLDGGVKLRDKTLDLNALYATIDAMPMRRAEVRR
Leishmania       -------------------------------------
Trypanosoma      -------------------------------------
Brassica         -------------------------------------
Araxxx8          -------------------------------------
Medicago         -------------------------------------
Arabidopsis      -------------------------------------
notresq          -------------------------------------
Cryptococcus     -------------------------------------
Flavobacterium   -------------------------------------
Solibacter       -------------------------------------
VIRUSx14         -------------------------------------
                                                                             
Prim.cons.       FLVGLDGGVKLRDKTLDLNALYATIDAMPMRRAEVRR       

BLAST

Blast contre nr : 

BLASTP 2.2.15 [Oct-15-2006]

Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman 
(1997), "Gapped BLAST and PSI-BLAST: a new generation of 
protein database search programs", Nucleic Acids Res. 25:3389-3402.

Reference:
Schäffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei 
Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and 
Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST 
protein database searches with composition-based statistics 
and other refinements", Nucleic Acids Res. 29:2994-3005.

RID: 1164807440-25306-58504666540.BLASTQ2


Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
           4,194,990 sequences; 1,443,877,211 total letters

If you have any problems or questions with the results of this search
please refer to the BLAST FAQs Taxonomy reports
Query= ORF_VL16600 traduction [156-851 sens indirect] Length=232 



Sequences producing significant alignments:                        (Bits)  Value

gi|68128007|emb|CAJ06778.1|  serine/threonine phosphatase, putati   105    1e-21
gi|71410022|ref|XP_807326.1|  serine/threonine protein phospha...   104    4e-21  Gene info
gi|71416592|ref|XP_810310.1|  serine/threonine protein phospha...   103    5e-21  Gene info
gi|67611611|ref|XP_667167.1|  hypothetical protein Chro.80040 ...   101    2e-20  Gene info
gi|66356628|ref|XP_625492.1|  serine-threonine protein phospha...   100    4e-20  Gene info
gi|19310495|gb|AAL84981.1|  At1g07010/F10K1_19 [Arabidopsis thali   100    4e-20  UniGene info
gi|22329383|ref|NP_172182.2|  hydrolase/ protein serine/threon...   100    5e-20  UniGene infoGene info
gi|58270466|ref|XP_572389.1|  hypothetical protein [Cryptococc...  96.7    8e-19  Gene info
gi|58743497|gb|AAW81738.1|  Putative [Brassica oleracea]           96.7    8e-19
gi|6714305|gb|AAF26001.1|AC013354_20  F15H18.4 [Arabidopsis thali  96.3    1e-18
gi|21536655|gb|AAM60987.1|  unknown [Arabidopsis thaliana]         95.9    1e-18  UniGene info
gi|18394613|ref|NP_564053.1|  hydrolase/ protein serine/threon...  95.9    1e-18  UniGene infoGene info
gi|92896768|gb|ABE93336.1|  Metallophosphoesterase [Medicago trun  95.9    1e-18
gi|23509882|ref|NP_702549.1|  hypothetical protein PF14_0660 [...  90.5    5e-17  GeoGene info
gi|82538812|ref|XP_723832.1|  serine/threonine protein phospha...  89.0    2e-16  Gene info
gi|68066879|ref|XP_675411.1|  hypothetical protein [Plasmodium...  88.6    2e-16  Gene info
gi|116057491|emb|CAL51918.1|  unnamed protein product [Ostreococc  87.0    6e-16
gi|70936765|ref|XP_739281.1|  hypothetical protein [Plasmodium...  86.7    8e-16  Gene info
gi|71407988|ref|XP_806425.1|  hypothetical protein [Trypanosom...  86.7    8e-16  Gene info
gi|71018465|ref|XP_759463.1|  hypothetical protein UM03316.1 [...  86.7    9e-16  Gene info
gi|71663684|ref|XP_818832.1|  hypothetical protein [Trypanosom...  85.1    2e-15  Gene info
gi|115376591|ref|ZP_01463822.1|  hypothetical protein STIAU_73...  85.1    3e-15
gi|115485009|ref|NP_001067648.1|  Os11g0261900 [Oryza sativa (...  84.7    3e-15  Gene info
gi|116500206|gb|EAU83101.1|  hypothetical protein CC1G_11185 [...  84.3    4e-15
gi|20503032|gb|AAM22720.1|AC092388_4  putative protein-tyrosin...  83.2    8e-15  Gene info
gi|116057822|emb|CAL54025.1|  unnamed protein product [Ostreococc  83.2    9e-15
gi|71852187|ref|XP_825406.1|  serine/threonine protein phospha...  83.2    9e-15  Gene info
gi|86143170|ref|ZP_01061572.1|  hypothetical protein MED217_08...  82.8    1e-14
gi|88706344|ref|ZP_01104050.1|  protein-tyrosine-phosphatase [...  80.9    4e-14
gi|68224184|emb|CAJ04811.1|  Serine-threonin protein phosphata...  80.5    6e-14
gi|73536556|ref|XP_847702.1|  hypothetical protein LMJ_0679 [L...  79.7    1e-13  Gene info
gi|116620563|ref|YP_822719.1|  metallophosphoesterase [Solibac...  79.3    1e-13  Gene info
gi|19076006|ref|NP_588506.1|  hypothetical protein SPCC1840.07...  79.0    2e-13  Gene info
gi|8954044|gb|AAF82218.1|AC067971_26  Contains similarity to s...  76.3    1e-12
gi|115481858|ref|NP_001064522.1|  Os10g0394100 [Oryza sativa (...  75.5    2e-12  Gene info
gi|71753577|ref|XP_826385.1|  serine/threonine protein phospha...  75.1    2e-12  Gene info
gi|108763826|ref|YP_633610.1|  metallophosphoesterase [Myxococ...  72.8    1e-11  Gene info
gi|71655876|ref|XP_816494.1|  serine/threonine protein phospha...  72.8    1e-11  Gene info
gi|71425796|ref|XP_813172.1|  serine/threonine protein phospha...  71.2    4e-11  Gene info
gi|68062724|ref|XP_673372.1|  hypothetical protein PB300296.00...  69.7    1e-10  Gene info
gi|86142572|ref|ZP_01061011.1|  hypothetical protein MED217_06...  68.9    2e-10
gi|117919535|ref|YP_868727.1|  metallophosphoesterase [Shewane...  66.6    9e-10  Gene info
gi|113969430|ref|YP_733223.1|  metallophosphoesterase [Shewane...  65.9    1e-09  Gene info
gi|24374986|ref|NP_719029.1|  hypothetical protein phosphatase...  65.1    2e-09  Gene info
gi|5596634|gb|AAD45611.1|AF164202_1  protein-tyrosine-phosphatase  64.3    4e-09
gi|78365734|ref|ZP_00836019.1|  Metallophosphoesterase [Shewan...  64.3    4e-09
gi|61680099|pdb|1V73|A  Chain A, Crystal Structure Of Cold-Act...  63.9    6e-09  Related structures
gi|114046658|ref|YP_737208.1|  metallophosphoesterase [Shewane...  63.5    7e-09  Gene info
gi|68543324|ref|ZP_00583033.1|  Metallophosphoesterase [Shewan...  63.2    8e-09
gi|68546842|ref|ZP_00586386.1|  Metallophosphoesterase [Shewan...  63.2    9e-09
gi|113948621|ref|ZP_01434279.1|  metallophosphoesterase [Shewa...  62.8    1e-08
gi|23508757|ref|NP_701425.1|  phosphoesterase, putative [Plasm...  62.0    2e-08  GeoGene info
gi|91218202|ref|ZP_01255150.1|  hypothetical protein P700755_1...  61.6    2e-08
gi|55819273|ref|YP_142752.1|  unknown [Acanthamoeba polyphaga ...  60.5    6e-08  Gene info
gi|88860845|ref|ZP_01135481.1|  hypothetical protein PTD2_0986...  60.1    9e-08
gi|118071565|ref|ZP_01539759.1|  metallophosphoesterase [Shewa...  58.2    3e-07
gi|57234343|ref|YP_181629.1|  Ser/Thr protein phosphatase fami...  58.2    3e-07  Gene info
gi|88857278|ref|ZP_01131921.1|  hypothetical protein PTD2_0192...  57.8    4e-07
gi|94985649|ref|YP_605013.1|  metallophosphoesterase [Deinococ...  51.2    3e-05  Gene info
gi|88932726|ref|ZP_01138408.1|  Metallophosphoesterase [Dehalo...  50.8    5e-05
gi|70952726|ref|XP_745512.1|  phosphoesterase [Plasmodium chab...  49.7    1e-04  Gene info
gi|50554273|ref|XP_504545.1|  hypothetical protein [Yarrowia l...  49.7    1e-04  Gene info
gi|82753511|ref|XP_727707.1|  serine/threonine protein phospha...  49.3    2e-04  Gene info
gi|116623122|ref|YP_825278.1|  metallophosphoesterase [Solibac...  48.5    2e-04  Gene info
gi|75910295|ref|YP_324591.1|  Metallophosphoesterase [Anabaena...  45.8    0.001  Gene info
gi|50304095|ref|XP_451997.1|  unnamed protein product [Kluyver...  45.8    0.002  Gene info
gi|17227869|ref|NP_484417.1|  serine/threonine protein phospha...  45.8    0.002  Gene info
gi|21222346|ref|NP_628125.1|  serine/threonineprotein kinase [...  45.1    0.003  Gene info
gi|29830806|ref|NP_825440.1|  serine/threonine protein phospha...  44.3    0.004  Gene info
gi|115523105|ref|YP_780016.1|  metallophosphoesterase [Rhodops...  44.3    0.004  Gene info
gi|45198468|ref|NP_985497.1|  AFL051Wp [Eremothecium gossypii]...  43.1    0.011  Gene info
gi|6324112|ref|NP_014182.1|  hypothetical protein; Ynl217wp [S...  43.1    0.011  Gene info
gi|85858209|ref|YP_460411.1|  calcineurin-like phosphoesterase...  42.4    0.017  Gene info
gi|83309876|ref|YP_420140.1|  Diadenosine tetraphosphatase and...  42.4    0.018  Gene info
gi|15805959|ref|NP_294659.1|  phosphatase, putative [Deinococc...  41.6    0.028  Gene info
gi|89360824|ref|ZP_01198641.1|  putative serine/threonine prot...  41.6    0.031
gi|67938067|ref|ZP_00530597.1|  Metallophosphoesterase [Chloro...  41.2    0.033
gi|6325078|ref|NP_015146.1|  Putative protein serine/threonine...  41.2    0.039  Gene info
gi|68551960|ref|ZP_00591353.1|  Metallophosphoesterase [Prosth...  41.2    0.039
gi|46201615|ref|ZP_00208173.1|  COG0639: Diadenosine tetraphos...  40.4    0.060
gi|71649430|ref|XP_813439.1|  phosphoprotein phosphatase [Tryp...  40.4    0.070  Gene info
gi|116872043|ref|YP_848824.1|  serine/threonine protein phosph...  40.0    0.095  Gene info
gi|71414044|ref|XP_809138.1|  phosphoprotein phosphatase [Tryp...  39.7    0.11   Gene info
gi|19114285|ref|NP_593373.1|  hypothetical protein SPAC57A7.08...  39.7    0.13   Gene info
gi|50419431|ref|XP_458242.1|  hypothetical protein DEHA0C13981...  39.3    0.13   Gene info
gi|111070459|gb|EAT91579.1|  hypothetical protein SNOG_00084 [Pha  38.5    0.26 
gi|71070657|ref|XP_767901.1|  phosphatase catalytic subunit [G...  38.5    0.26   Gene info
gi|118354944|ref|XP_001010733.1|  Ser/Thr protein phosphatase ...  38.5    0.27   Gene info
gi|16799733|ref|NP_470001.1|  hypothetical protein lin0658 [Li...  38.1    0.29   Gene info
gi|14600961|ref|NP_147487.1|  serine/threonine protein phospha...  38.1    0.33   Gene info
gi|110605901|ref|ZP_01393947.1|  Metallophosphoesterase [Therm...  38.1    0.35 
gi|115964570|ref|XP_001195791.1|  PREDICTED: similar to fibrop...  38.1    0.37   Gene info
gi|16802697|ref|NP_464182.1|  hypothetical protein lmo0655 [Li...  37.7    0.37   Gene info
gi|52548528|gb|AAU82377.1|  serine/threonine protein phosphata...  37.7    0.38 
gi|71416005|ref|XP_810049.1|  serine/threonine-protein phospha...  37.7    0.45   Gene info
gi|46107938|ref|XP_381028.1|  hypothetical protein FG00852.1 [...  37.7    0.46   Gene info
gi|68488001|ref|XP_712159.1|  hypothetical protein CaO19.6707 ...  37.7    0.46   Gene info
gi|27366499|ref|NP_762026.1|  Diadenosine tetraphosphatase [Vi...  37.4    0.50   Gene info
gi|118394669|ref|XP_001029699.1|  Ser/Thr protein phosphatase ...  37.4    0.51   Gene info
gi|116180828|ref|XP_001220263.1|  hypothetical protein CHGG_01...  37.4    0.52   Gene info
gi|50285623|ref|XP_445240.1|  unnamed protein product [Candida...  37.4    0.54   Gene info
gi|71401079|ref|XP_803255.1|  serine/threonine protein phospha...  37.4    0.54   Gene info
gi|37676207|ref|NP_936603.1|  diadenosine tetraphosphatase [Vi...  37.4    0.57   Gene info
gi|85116346|ref|XP_965036.1|  hypothetical protein ( (AF071751...  37.4    0.61   Gene info
gi|57867430|ref|YP_189120.1|  serine/threonine protein phospha...  37.0    0.63   Gene info
gi|47095217|ref|ZP_00232828.1|  serine/threonine protein phosp...  37.0    0.63 
gi|115668683|ref|XP_001199366.1|  PREDICTED: similar to serine...  37.0    0.65   Gene info
gi|71080141|ref|XP_779151.1|  protein phosphatase catalytic su...  37.0    0.68   Gene info
gi|113475889|ref|YP_721950.1|  metallophosphoesterase [Trichod...  37.0    0.75   Gene info
gi|23010306|ref|ZP_00051041.1|  COG0639: Diadenosine tetraphos...  37.0    0.76 
gi|89094175|ref|ZP_01167118.1|  bis(5'-nucleosyl)-tetraphospha...  36.6    0.87 
gi|4193367|gb|AAD09995.1|  protein phosphatase-Z-like serine/t...  36.6    0.97 
gi|67477408|ref|XP_654180.1|  protein phosphatase [Entamoeba h...  36.6    1.00   Gene info
gi|2760343|gb|AAC46052.1|  serine/threonine protein phosphatase 1  36.6    1.0  
gi|50303277|ref|XP_451580.1|  unnamed protein product [Kluyver...  36.6    1.1    Gene info
gi|114704992|ref|ZP_01437900.1|  serine/threonine protein phos...  36.2    1.1  
gi|2668561|gb|AAC46051.1|  serine/threonine protein phosphatase 1  36.2    1.2  
gi|84704623|ref|ZP_01018123.1|  serine/threonine protein phosp...  36.2    1.2  
gi|116253529|ref|YP_769367.1|  putative serine/threonine prote...  36.2    1.2    Gene info
gi|46906901|ref|YP_013290.1|  serine/threonine protein phospha...  36.2    1.3    Gene info
gi|67937368|ref|ZP_00530329.1|  Metallophosphoesterase [Chloro...  35.8    1.5  
gi|6323465|ref|NP_013537.1|  Calcineurin A; one isoform (the o...  35.8    1.5    Gene info
gi|47091614|ref|ZP_00229410.1|  serine/threonine protein phosp...  35.8    1.5  
gi|87311625|ref|ZP_01093742.1|  serine/threonine protein phosp...  35.8    1.5  
gi|71078053|ref|XP_771537.1|  serine/threonine protein phospha...  35.8    1.5    Gene info
gi|67526671|ref|XP_661397.1|  hypothetical protein AN3793.2 [A...  35.8    1.5    Gene info
gi|171149|gb|AAA34465.1|  calcineurin A1                           35.8    1.6  
gi|110668091|ref|YP_657902.1|  putative protein-tyrosine-phosp...  35.8    1.7    Gene info
gi|116059065|emb|CAL54772.1|  unnamed protein product [Ostreococc  35.8    1.7  
gi|81428657|ref|YP_395657.1|  hypothetical protein [Lactobacil...  35.4    2.0    Gene info
gi|74831040|emb|CAI39150.1|  calcineurin-A6-2 [Paramecium tetraur  35.4    2.2  
gi|68485979|ref|XP_713108.1|  hypothetical protein CaO19.8345 ...  35.0    2.4    Gene info
gi|68223697|emb|CAJ01926.1|  phosphoprotein phosphatase, putative  35.0    2.8  
gi|39595658|emb|CAE67160.1|  Hypothetical protein CBG12586 [Caeno  35.0    2.9  
gi|50422049|ref|XP_459586.1|  hypothetical protein DEHA0E06831...  35.0    2.9    Gene info
gi|67482970|ref|XP_656780.1|  protein phosphatase [Entamoeba h...  35.0    2.9    Gene info
gi|70725356|ref|YP_252270.1|  hypothetical protein SH0355 [Sta...  35.0    3.1    Gene info
gi|110802206|ref|YP_699754.1|  Ser/Thr protein phosphatase fam...  34.7    3.7    Gene info
gi|58262860|ref|XP_568840.1|  protein serine/threonine phospha...  34.7    3.7    Gene info
gi|71423457|ref|XP_812468.1|  phosphoprotein phosphatase [Tryp...  34.7    3.7    Gene info
gi|67467641|ref|XP_649912.1|  Ser/Thr protein phosphatase [Ent...  34.7    3.7    Gene info
gi|50259855|gb|EAL22523.1|  hypothetical protein CNBB4010 [Cry...  34.7    3.7  
gi|45190936|ref|NP_985190.1|  AER334Cp [Eremothecium gossypii]...  34.7    4.0    Gene info
gi|110632731|ref|YP_672939.1|  metallophosphoesterase [Mesorhi...  34.3    4.1    Gene info
gi|67475921|ref|XP_653591.1|  protein phosphatase [Entamoeba h...  34.3    4.1    Gene info
gi|73670645|ref|YP_306660.1|  serine/threonine specific protei...  34.3    4.4    Gene info
gi|66391303|ref|YP_238676.1|  ORF049 [Staphylococcus phage Two...  34.3    4.5  
gi|75909513|ref|YP_323809.1|  Metallophosphoesterase [Anabaena...  34.3    4.5    Gene info
gi|74834581|emb|CAI44588.1|  calcineurin-A6-1 [Paramecium tetraur  34.3    5.2  
gi|67473946|ref|XP_652722.1|  protein phosphatase [Entamoeba h...  33.9    5.3    Gene info
gi|15643505|ref|NP_228551.1|  serine/threonine protein phospha...  33.9    5.7    Gene info
gi|311119|gb|AAA34899.1|  type 1-related protein phosphatase       33.9    5.7  
gi|6320644|ref|NP_010724.1|  Serine/threonine protein phosphat...  33.9    5.7    Gene info
gi|20092075|ref|NP_618150.1|  serine/threonine specific protei...  33.9    6.1    Gene info
gi|67475468|ref|XP_653428.1|  protein phosphatase [Entamoeba h...  33.9    6.7    Gene info
gi|18311480|ref|NP_563414.1|  hypothetical protein CPE2498 [Cl...  33.5    7.2    Gene info
gi|110799445|ref|YP_697185.1|  Ser/Thr protein phosphatase fam...  33.5    7.3    Gene info
gi|67478175|ref|XP_654504.1|  protein phosphatase [Entamoeba h...  33.5    7.8    Gene info
gi|71075576|ref|XP_770312.1|  serine/threonine protein phospha...  33.5    8.0    Gene info
gi|84997333|ref|XP_953388.1|  ribosomal protein S27 [Theileria...  33.5    8.4    Gene info
gi|90590888|ref|ZP_01246534.1|  Metallophosphoesterase [Flavob...  33.5    8.5  
gi|67478229|ref|XP_654528.1|  protein phosphatase [Entamoeba h...  33.5    8.5    Gene info
gi|50287361|ref|XP_446110.1|  unnamed protein product [Candida...  33.5    8.9    Gene info
gi|71414288|ref|XP_809251.1|  protein phosphotase [Trypanosoma...  33.5    8.9    Gene info
gi|39975409|ref|XP_369095.1|  hypothetical protein MG00149.4 [Mag  33.1    9.3    Gene info
gi|66811004|ref|XP_639209.1|  hypothetical protein DDBDRAFT_01...  33.1    9.8    Gene info
gi|106884174|ref|ZP_01351560.1|  conserved hypothetical protei...  33.1    9.9  
gi|67471319|ref|XP_651611.1|  protein phosphatase [Entamoeba h...  33.1    9.9    Gene info
gi|116056038|emb|CAL58571.1|  unnamed protein product [Ostreococc  33.1    10.0 

>gi|68128007|emb|CAJ06778.1|  serine/threonine phosphatase, putative [Leishmania major]
Length=371

 Score =  105 bits (263),  Expect = 1e-21, Method: Composition-based stats.
 Identities = 54/148 (36%), Positives = 87/148 (58%), Gaps = 10/148 (6%)

Query  28   QRIVAIGDVHGDYVATIKSLQLVNIINKNLNWIGKNTILIQMGDILDRGGRDVTYGDEDS  87
            +R+VA+GD+HGDY  T+  L+L  +I+K  +WIG++ +L+Q+GDILD G  D        
Sbjct  31   RRLVAVGDLHGDYEQTVSVLRLTRLIDKRNHWIGEDALLVQLGDILDVGPDD--------  82

Query  88   EIKIIYLFKRLKKQAIKNKGNVICIIGNHEMLNLQGVYDYCSKLGIKKFGTKKKRKKFYK  147
             I I+ L  RL+++A    G+VI ++GNHE+ N +G Y    K  +   G +K R     
Sbjct  83   -ILIVRLLMRLQQEAHAKGGDVIELLGNHELRNFRGDYKAVDKASLAASGGQKGRDVLLS  141

Query  148  PGNKMAISMTKLFKAIYKIGPWIFVHGG  175
                +   + +  KA++  GP++F+HGG
Sbjct  142  NATDLGRYL-RTRKAVFHYGPFLFMHGG  168


>gi|71410022|ref|XP_807326.1| Gene info serine/threonine protein phosphatase [Trypanosoma cruzi strain 
CL Brener]
 gi|70871303|gb|EAN85475.1| Gene info serine/threonine protein phosphatase, putative [Trypanosoma cruzi]
Length=369

 Score =  104 bits (259),  Expect = 4e-21, Method: Composition-based stats.
 Identities = 61/184 (33%), Positives = 97/184 (52%), Gaps = 14/184 (7%)

Query  21   AEKKTLKQRIVAIGDVHGDYVATIKSLQLVNIINKNLNWIGKNTILIQMGDILDRGGRDV  80
            A    L +RIVA+GD+HGD   T+  L L  ++NK  +WIGK+T  +Q+GDILD G  D 
Sbjct  16   ATNHGLGRRIVAVGDLHGDLNQTLSILHLAGLVNKRQHWIGKDTYFVQLGDILDVGPDD-  74

Query  81   TYGDEDSEIKIIYLFKRLKKQAIKNKGNVICIIGNHEMLNLQGVYDYCSKLGIKKFGTKK  140
                    + I+ L  RL+K+A    G+VI I+GNHE+ NL G +     + + + G K 
Sbjct  75   --------LMIVRLLMRLEKEAQAEGGDVIQILGNHEIRNLLGDFSAVDPVSLAQSGGKA  126

Query  141  KRKKFYKPGNKMAISMTKLFKAIYKIGPWIFVHGGIRPYLSKKYT----IDYINKTMHEY  196
             R++       + I + +  +AI+    ++F+HGG+        T    ++  NK + + 
Sbjct  127  GRRELLSNRTPLGIYL-RTRRAIFHHKEFLFMHGGLSTATGNMITGIKAVEKFNKALRDT  185

Query  197  LSGN  200
            L  N
Sbjct  186  LINN  189


>gi|71416592|ref|XP_810310.1| Gene info serine/threonine protein phosphatase [Trypanosoma cruzi strain 
CL Brener]
 gi|70874822|gb|EAN88459.1| Gene info serine/threonine protein phosphatase, putative [Trypanosoma cruzi]
Length=369

 Score =  103 bits (258),  Expect = 5e-21, Method: Composition-based stats.
 Identities = 60/179 (33%), Positives = 96/179 (53%), Gaps = 14/179 (7%)

Query  26   LKQRIVAIGDVHGDYVATIKSLQLVNIINKNLNWIGKNTILIQMGDILDRGGRDVTYGDE  85
            L +RIVA+GD+HGD   T+  L L  ++NK  +WIGK+T  +Q+GDILD G  D      
Sbjct  21   LGRRIVAVGDLHGDLNQTLSILHLAGLVNKRQHWIGKDTYFVQLGDILDVGPDD------  74

Query  86   DSEIKIIYLFKRLKKQAIKNKGNVICIIGNHEMLNLQGVYDYCSKLGIKKFGTKKKRKKF  145
               + I+ L  RL+K+A    G+VI I+GNHE+ NL G +     + + + G K  R++ 
Sbjct  75   ---LMIVRLLMRLEKEAQAEGGDVIQILGNHEIRNLLGDFSAVDPVSLAQSGGKAGRREL  131

Query  146  YKPGNKMAISMTKLFKAIYKIGPWIFVHGGIRPYLSKKYT----IDYINKTMHEYLSGN  200
                  + I + +  +AI+    ++F+HGG+        T    ++  NK + + L  N
Sbjct  132  LSNRTPLGIYL-RTRRAIFHHKEFLFMHGGLSTATGNMITGIKAVEEFNKALRDTLINN  189


>gi|67611611|ref|XP_667167.1| Gene info hypothetical protein Chro.80040 [Cryptosporidium hominis TU502]
 gi|54658271|gb|EAL36934.1| Gene info hypothetical protein Chro.80040 [Cryptosporidium hominis]
Length=385

 Score =  101 bits (252),  Expect = 2e-20, Method: Composition-based stats.
 Identities = 64/202 (31%), Positives = 102/202 (50%), Gaps = 15/202 (7%)

Query  27   KQRIVAIGDVHGDYVATIKSLQLVNIINKNLNWIGKNTILIQMGDILDRGGRDVTYGDED  86
            K R++ IGD+HGD  + I SL L  +IN NL+WI KNT+LIQ+GD++DRG          
Sbjct  85   KGRVLVIGDIHGDLKSLITSLFLSGVINSNLDWIAKNTLLIQLGDVVDRGSH--------  136

Query  87   SEIKIIYLFKRLKKQAIKNKGNVICIIGNHEMLNLQGVYDYCSKLGIKKFGTKKKRKKFY  146
              ++I  LF +LK QA       + ++GNHE++NL G   Y +    + +G +  R  F 
Sbjct  137  -ALQIYKLFNKLKSQAPSLGSKFVGLLGNHEVMNLCGQLHYVTDEDFQTYGGRDNR-TFE  194

Query  147  KPGNKMAISMTKLFKAIYKIGPWIFVHGGIRPYLSKKYTIDYINKTMHEYLSGNINLEDS  206
                       +  K   ++   ++VH G+ P  + K  +D ++K  ++ L G+      
Sbjct  195  WSKEGFVGKYLRTMKLAIRVNDSLYVHAGLLPKYA-KLGLDRLDKLSNDLLEGDF----C  249

Query  207  KEFQELFLDENSILWYRGYAEG  228
              +  LF  E+  LW R  + G
Sbjct  250  DFYSSLFFVEDGPLWTRDISLG  271


>gi|66356628|ref|XP_625492.1| Gene info serine-threonine protein phosphatase [Cryptosporidium parvum 
Iowa II]
 gi|46226486|gb|EAK87480.1| Gene info serine-threonine protein phosphatase [Cryptosporidium parvum]
Length=385

 Score =  100 bits (250),  Expect = 4e-20, Method: Composition-based stats.
 Identities = 64/202 (31%), Positives = 102/202 (50%), Gaps = 15/202 (7%)

Query  27   KQRIVAIGDVHGDYVATIKSLQLVNIINKNLNWIGKNTILIQMGDILDRGGRDVTYGDED  86
            K R++ IGD+HGD  + I SL L  +IN NL+WI KNT+LIQ+GD++DRG          
Sbjct  85   KGRVLVIGDIHGDLKSLITSLFLSGVINSNLDWIAKNTLLIQLGDVVDRGSH--------  136

Query  87   SEIKIIYLFKRLKKQAIKNKGNVICIIGNHEMLNLQGVYDYCSKLGIKKFGTKKKRKKFY  146
              ++I  LF +LK QA       + ++GNHE++NL G   Y +    + +G +  R  F 
Sbjct  137  -ALQIYKLFNKLKSQAPSLGSKFVGLLGNHEVMNLCGQLHYVTDEDFQTYGGRDNR-TFE  194

Query  147  KPGNKMAISMTKLFKAIYKIGPWIFVHGGIRPYLSKKYTIDYINKTMHEYLSGNINLEDS  206
                       +  K   ++   ++VH G+ P  + K  +D ++K  ++ L G+      
Sbjct  195  WSKEGFVGKYLRTMKLAIRVNDSLYVHAGLLPKYA-KLGLDKLDKLSNDLLEGDF----C  249

Query  207  KEFQELFLDENSILWYRGYAEG  228
              +  LF  E+  LW R  + G
Sbjct  250  DFYSSLFFVEDGPLWTRDISLG  271


>gi|19310495|gb|AAL84981.1| UniGene info At1g07010/F10K1_19 [Arabidopsis thaliana]
Length=389

 Score =  100 bits (250),  Expect = 4e-20, Method: Composition-based stats.
 Identities = 69/253 (27%), Positives = 117/253 (46%), Gaps = 62/253 (24%)

Query  28   QRIVAIGDVHGDYVATIKSLQLVNIINKN--LNWIGKNTILIQMGDILDRGGRDVTYGDE  85
            +RIVA+GD+HGD      +LQL  +++ +    W+G++T+L+Q+GDILDRG         
Sbjct  57   RRIVAVGDLHGDLGKARDALQLAGVLSSDGRDQWVGQDTVLVQVGDILDRG---------  107

Query  86   DSEIKIIYLFKRLKKQAIKNKGNVICIIGNHEMLNLQGVYDYCSKLGIKK----------  135
            D EI I+ L + L  QA  N G V  + GNHE +N++G + Y       +          
Sbjct  108  DEEIAILSLLRSLDDQAKANGGAVFQVNGNHETMNVEGDFRYVDARAFDECTDFLDYLED  167

Query  136  -------------FGTKK------------------KRKK-------FYKPGNKMAISMT  157
                         F +++                  KR+K         +PG ++A  ++
Sbjct  168  YAQDWDKAFRNWIFESRQWKEDRRSSQTYWDQWNVVKRQKEVIARSVLLRPGGRLACELS  227

Query  158  KLFKAIYKIGPWIFVHGGIRPYLSKKYTIDYINKTMHEYLSGNINLEDSKEFQELFL-DE  216
            +    I ++  W+F HGG+ P+    Y I+ IN+ +  ++    N EDS +   +     
Sbjct  228  R-HGVILRVNNWLFCHGGLLPH-HVAYGIERINREVSTWMRSPTNYEDSPQMPFIATRGY  285

Query  217  NSILWYRGYAEGT  229
            +S++W R Y+  T
Sbjct  286  DSVVWSRLYSRET  298


>gi|22329383|ref|NP_172182.2| UniGene infoGene info hydrolase/ protein serine/threonine phosphatase [Arabidopsis 
thaliana]
 gi|22531180|gb|AAM97094.1| UniGene infoGene info unknown protein [Arabidopsis thaliana]
 gi|30725626|gb|AAP37835.1| UniGene infoGene info At1g07010 [Arabidopsis thaliana]
Length=389

 Score =  100 bits (250),  Expect = 5e-20, Method: Composition-based stats.
 Identities = 69/253 (27%), Positives = 117/253 (46%), Gaps = 62/253 (24%)

Query  28   QRIVAIGDVHGDYVATIKSLQLVNIINKN--LNWIGKNTILIQMGDILDRGGRDVTYGDE  85
            +RIVA+GD+HGD      +LQL  +++ +    W+G++T+L+Q+GDILDRG         
Sbjct  57   RRIVAVGDLHGDLGKARDALQLAGVLSSDGRDQWVGQDTVLVQVGDILDRG---------  107

Query  86   DSEIKIIYLFKRLKKQAIKNKGNVICIIGNHEMLNLQGVYDYCSKLGIKK----------  135
            D EI I+ L + L  QA  N G V  + GNHE +N++G + Y       +          
Sbjct  108  DDEIAILSLLRSLDDQAKANGGAVFQVNGNHETMNVEGDFRYVDARAFDECTDFLDYLED  167

Query  136  -------------FGTKK------------------KRKK-------FYKPGNKMAISMT  157
                         F +++                  KR+K         +PG ++A  ++
Sbjct  168  YAQDWDKAFRNWIFESRQWKEDRRSSQTYWDQWNVVKRQKGVIARSVLLRPGGRLACELS  227

Query  158  KLFKAIYKIGPWIFVHGGIRPYLSKKYTIDYINKTMHEYLSGNINLEDSKEFQELFL-DE  216
            +    I ++  W+F HGG+ P+    Y I+ IN+ +  ++    N EDS +   +     
Sbjct  228  R-HGVILRVNNWLFCHGGLLPH-HVAYGIERINREVSTWMRSPTNYEDSPQMPFIATRGY  285

Query  217  NSILWYRGYAEGT  229
            +S++W R Y+  T
Sbjct  286  DSVVWSRLYSRET  298


>gi|58270466|ref|XP_572389.1| Gene info hypothetical protein [Cryptococcus neoformans var. neoformans 
JEC21]
 gi|50254959|gb|EAL17699.1|  hypothetical protein CNBL2140 [Cryptococcus neoformans var. neoformans 
B-3501A]
 gi|57228647|gb|AAW45082.1| Gene info conserved hypothetical protein [Cryptococcus neoformans var. 
neoformans JEC21]
Length=385

 Score = 96.7 bits (239),  Expect = 8e-19, Method: Composition-based stats.
 Identities = 70/232 (30%), Positives = 105/232 (45%), Gaps = 35/232 (15%)

Query  21   AEKKTLKQRIVAIGDVHGDYVATIKSLQLVNIINKNLNWIGKNTILIQMGDILDRGGRDV  80
            A +   +QR+VA+GD+HGD     K+LQ+  II+ +  W+    IL+Q GDI+DRG    
Sbjct  41   AGEAAYRQRLVAVGDLHGDIDNAKKTLQMARIIDDDSKWVASTDILVQTGDIVDRGA---  97

Query  81   TYGDEDSEIKIIYLFKRLKKQAIKNKGNVICIIGNHEMLNLQGVYDYCSKLGIKKFGTKK  140
             Y D+     I  L + L+ QA    G V+ I+GNHE++N  G + Y +K  I +FG  K
Sbjct  98   -YADD-----IYRLMQSLRGQAASQGGKVVSILGNHEVMNAIGDWRYVTKGDIARFGGTK  151

Query  141  KRKKFYKPGNKMAISMTKLFK--AIYKIGPW------IFVHGGIRP-YLSKKYTIDYINK  191
             R+        +       +   A+  I P+       F HG +RP Y +       IN 
Sbjct  152  SRQHALSAEGWLGQEWLANYSTTALVPISPYPSSPTFSFTHGSLRPSYANLTPYPAAIND  211

Query  192  TMHEYLSGNINLE----------------DSKEFQELFLDENSILWYRGYAE  227
              H  L+  +                    + E  +L+  E   LW+RG AE
Sbjct  212  LGHSLLTKALTPPMAPPYPPNPYSGLPKGTTHEEADLYA-EGGPLWWRGLAE  262


>gi|58743497|gb|AAW81738.1|  Putative [Brassica oleracea]
Length=394

 Score = 96.7 bits (239),  Expect = 8e-19, Method: Composition-based stats.
 Identities = 44/124 (35%), Positives = 81/124 (65%), Gaps = 12/124 (9%)

Query  29   RIVAIGDVHGDYVATIKSLQLVNIINKNLNWIGKNTILIQMGDILDRGGRDVTYGDEDSE  88
            R+VAIGD+HGD   + ++ ++  +I+ +  W G +T+++Q+GD+LDRGG          E
Sbjct  55   RLVAIGDLHGDLEKSKEAFRIAGLIDSSDRWTGGSTVVVQVGDLLDRGG---------DE  105

Query  89   IKIIYLFKRLKKQAIKNKGNVICIIGNHEMLNLQGVYDYCSKLGIKKFGTKKKRKKFYKP  148
            +KI++  +RLK++A +  G V+ + GNHE++N++G + + +K G+++F   +    +Y  
Sbjct  106  LKILFFLERLKREAEREGGKVVTMNGNHEIMNVEGDFRFVTKEGLEEF---RVWSDWYSL  162

Query  149  GNKM  152
            GNKM
Sbjct  163  GNKM  166


>gi|6714305|gb|AAF26001.1|AC013354_20  F15H18.4 [Arabidopsis thaliana]
Length=1702

 Score = 96.3 bits (238),  Expect = 1e-18, Method: Composition-based stats.
 Identities = 61/244 (25%), Positives = 120/244 (49%), Gaps = 59/244 (24%)

Query  28   QRIVAIGDVHGDYVATIKSLQLVNIINKNLNWIGKNTILIQMGDILDRGGRDVTYGDEDS  87
            +R+VAIGD+HGD   + ++ ++  +I+ +  W G +T+++Q+GD+LDRGG          
Sbjct  54   ERLVAIGDLHGDLEKSREAFKIAGLIDSSDRWTGGSTMVVQVGDVLDRGGE---------  104

Query  88   EIKIIYLFKRLKKQAIKNKGNVICIIGNHEMLNLQGVYDYCSKLGIKKF---------GT  138
            E+KI+Y  ++LK++A +  G ++ + GNHE++N++G + Y +K G+++F         G 
Sbjct  105  ELKILYFLEKLKREAERAGGKILTMNGNHEIMNIEGDFRYVTKKGLEEFQIWADWYCLGN  164

Query  139  KKK---------------------------------RKKFYKPGNKMAISMTKLFKAIYK  165
            K K                                 R    +P   +A       + +  
Sbjct  165  KMKTLCSGLDKPKDPYEGIPMSFPRMRADCFEGIRARIAALRPDGPIAKRFLTKNQTVAV  224

Query  166  IGPWIFVHGGIRPYLSK--KYTIDYINKTMHEYLSGNINLEDSKEFQELFLDENSILWYR  223
            +G  +FVHGG+   L++  +Y ++ IN+ +  +++G    +  +         NS++W R
Sbjct  225  VGDSVFVHGGL---LAEHIEYGLERINEEVRGWING---FKGGRYAPAYCRGGNSVVWLR  278

Query  224  GYAE  227
             ++E
Sbjct  279  KFSE  282



BLAST contre Swissprot:

BLASTP 2.2.15 [Oct-15-2006]

Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman 
(1997), "Gapped BLAST and PSI-BLAST: a new generation of 
protein database search programs", Nucleic Acids Res. 25:3389-3402.

Reference:
Schäffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei 
Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and 
Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST 
protein database searches with composition-based statistics 
and other refinements", Nucleic Acids Res. 29:2994-3005.

RID: 1164807707-17229-185737545769.BLASTQ4


Database: Non-redundant SwissProt sequences
           217,875 sequences; 82,042,039 total letters

If you have any problems or questions with the results of this search
please refer to the BLAST FAQs Taxonomy reports
Query= ORF_VL16600 traduction [156-851 sens indirect] Length=232 


                                                                   Score     E
Sequences producing significant alignments:                        (Bits)  Value

gi|82000198|sp|Q5UQJ8|YR398_MIMIV  Uncharacterized protein R398    45.8    1e-04
gi|732181|sp|P40152|YNV7_YEAST  Putative metallophosphoesterase Y  43.5    6e-04  Gene info
gi|417746|sp|P32945|PPQ1_YEAST  Serine/threonine-protein phosphat  37.0    0.059  Gene info
gi|2499734|sp|P78968|PPZ_SCHPO  Serine/threonine-protein phosphat  34.7    0.26 
gi|34921744|sp|Q8D3I0|APAH_WIGBR  Bis(5'-nucleosyl)-tetraphosp...  33.1    0.77   Gene info
gi|76800646|sp|Q9Y4K1|AIM1_HUMAN  Absent in melanoma 1 protein     32.7    1.2  
gi|1723220|sp|Q10145|YAS9_SCHPO  Hypothetical RNA-binding protein  32.3    1.2  
gi|130792|sp|P03772|PP_LAMBD  Serine/threonine-protein phosphatas  32.3    1.4    Gene info
gi|135820|sp|P20178|THS1_ARAHY  Stilbene synthase 1 (Resveratr...  32.3    1.4  
gi|135821|sp|P20077|THS2_ARAHY  Stilbene synthase 2 (Resveratr...  32.3    1.5  
gi|417439|sp|P23287|PP2B1_YEAST  Serine/threonine-protein phos...  30.8    3.5    Gene info
gi|1729958|sp|P51069|THS3_ARAHY  Stilbene synthase 3 (Resverat...  30.8    3.8  
gi|2499738|sp|P55799|PRP2_ECOLI  Serine/threonine-protein phospha  30.0    5.8  
gi|12644128|sp|P15795|TOXR_VIBCH  Cholera toxin transcriptional a  30.0    6.3  
gi|76363299|sp|Q8ZMH3|PRP2_SALTY  Serine/threonine-protein phosph  30.0    6.8  


>gi|82000198|sp|Q5UQJ8|YR398_MIMIV  Uncharacterized protein R398
Length=252

 Score = 45.8 bits (107),  Expect = 1e-04, Method: Composition-based stats.
 Identities = 37/158 (23%), Positives = 70/158 (44%), Gaps = 29/158 (18%)

Query  94   LFKRLKKQAIKNKGNVICIIGNHEMLNLQGVYDYCSklgikkfgtkkkrkkfykpgnkMA  153
             F  +  +A K+ G V  ++GNHE++N QG +DY S      F       + Y      +
Sbjct  3    FFDMMHNKASKHGGAVYSLLGNHELMNTQGNFDYVSYENYHNFDYDSPSGEKYTG----S  58

Query  154  ISMTKLFK--------------AIYKIGPWIFVHGGIRPYLSKKY---------TIDYIN  190
            +    +FK              ++  IG  +F H G+ P L++K           ++Y+N
Sbjct  59   LGRQNVFKPGSNFVKKMACNRLSVLVIGSTMFTHAGVLPVLARKLDKLDLDSNKKLEYLN  118

Query  191  KTMHEYLSGNINLEDSKEFQELFLDENSI--LWYRGYA  226
              + ++L   ++ +  +E++ LF+++  I   W R Y 
Sbjct  119  MIVRKWLLNKLSGKQDEEYKSLFINDTKISPFWNRIYG  156


>gi|732181|sp|P40152|YNV7_YEAST Gene info Putative metallophosphoesterase YNL217W precursor
Length=326

 Score = 43.5 bits (101),  Expect = 6e-04, Method: Composition-based stats.
 Identities = 26/94 (27%), Positives = 48/94 (51%), Gaps = 22/94 (23%)

Query  26   LKQRIVAIGDVHGDYVATIKSLQLVNIINKNLNWIGKNTILIQMGDILDRGGRDVTYGDE  85
            L +  V +GDVHG+Y       + + +I+  +  +G+N  +I +GD + +G         
Sbjct  59   LNKEYVFVGDVHGNYD------EFIELIDDKIGGLGENITMILLGDFIHKG--------P  104

Query  86   DSEIKIIYLFKRLKKQAIKNKGNVICIIGNHEML  119
            DS+  + Y+        + +K  V C++GNHE+L
Sbjct  105  DSDKVVSYI--------LNHKDQVKCVLGNHEIL  130


>gi|417746|sp|P32945|PPQ1_YEAST Gene info Serine/threonine-protein phosphatase PPQ
Length=549

 Score = 37.0 bits (84),  Expect = 0.059, Method: Composition-based stats.
 Identities = 43/177 (24%), Positives = 71/177 (40%), Gaps = 51/177 (28%)

Query  26   LKQRIVAIGDVHGDYVATIKSLQLVNIINKNLNWIGKNTILIQMGDILDRGGRDVTYGDE  85
            L+  I  +GDVHG +   ++ L+        L+ +  +T  + +GD +DRG   +     
Sbjct  292  LQAPIKVVGDVHGQFNDLLRILK--------LSGVPSDTNYLFLGDYVDRGKNSL-----  338

Query  86   DSEIKIIYLFKRLKKQAIKNKGNVICIIGNHEMLN---LQGVYDYCSklgikkfgtkkkr  142
               I ++  +K      IK K N   + GNHE  N   + G YD C              
Sbjct  339  -ETILLLLCYK------IKYKDNFFMLRGNHESANVTKMYGFYDECK-------------  378

Query  143  kkfykpgnkMAISMTKLFKAIYKIGPW-------IF-VHGGIRPYLSKKYTIDYINK  191
                    +++  + K+F  ++   P        IF VHGGI P L     I+ + +
Sbjct  379  -------RRLSSKVWKMFVDVFNTLPLAAIIQDKIFCVHGGISPDLHDMKQIEKVAR  428


>gi|2499734|sp|P78968|PPZ_SCHPO  Serine/threonine-protein phosphatase PP-Z
Length=515

 Score = 34.7 bits (78),  Expect = 0.26, Method: Composition-based stats.
 Identities = 26/99 (26%), Positives = 44/99 (44%), Gaps = 23/99 (23%)

Query  33   IGDVHGDYVATIKSLQLVNIINKNLNWIGKNTILIQMGDILDRGGRDVTYGDEDSEIKII  92
            +GDVHG Y   I+  ++             ++  + +GD +DRG + +        I ++
Sbjct  246  VGDVHGQYSDLIRLFEMCG--------FPPSSNYLFLGDYVDRGKQSL------ETILLL  291

Query  93   YLFKRLKKQAIKNKGNVICIIGNHEMLNLQ---GVYDYC  128
            +L+K      I+   N   + GNHE  N+    G YD C
Sbjct  292  FLYK------IRYPENFFLLRGNHECANITRVYGFYDEC  324


>gi|34921744|sp|Q8D3I0|APAH_WIGBR Gene info Bis(5'-nucleosyl)-tetraphosphatase, symmetrical (Diadenosine 
tetraphosphatase) (Ap4A hydrolase) (Diadenosine 5',5'''-P1,P4-tetraphosphate 
pyrophosphohydrolase)
Length=272

 Score = 33.1 bits (74),  Expect = 0.77, Method: Composition-based stats.
 Identities = 28/86 (32%), Positives = 45/86 (52%), Gaps = 23/86 (26%)

Query  33   IGDVHGDYVATIKSLQLVNIINKNLNWIGKNTILIQMGDILDRGGRDVTYGDEDSEIKII  92
            IGD+HG Y +  KS  ++++IN NL    KN I+   GD + RG            +K++
Sbjct  6    IGDIHGCY-SEFKS--MLDLINFNL----KNDIIWIAGDFIGRG---------PDSLKVL  49

Query  93   YLFKRLKKQAIKNKGNVICIIGNHEM  118
             L  +LK+       N+  ++GNHE+
Sbjct  50   RLIYKLKR-------NIFVVLGNHEI  68


>gi|76800646|sp|Q9Y4K1|AIM1_HUMAN  Absent in melanoma 1 protein
Length=1723

 Score = 32.7 bits (73),  Expect = 1.2, Method: Composition-based stats.
 Identities = 19/62 (30%), Positives = 31/62 (50%), Gaps = 5/62 (8%)

Query  174   GGIRPYLSKKYTIDYINKTMHEYLSGNINLEDSKEFQELFLDE---NSILWYRGYAEGTI  230
             G +RP++ K+      NK    ++S N NLED K  +   +++   +  +W   Y EG I
Sbjct  1578  GSLRPFVQKRIYFRLRNKATGLFMSTNGNLEDLKLLRIQVMEDVGADDQIWI--YQEGCI  1635

Query  231   NC  232
              C
Sbjct  1636  KC  1637


>gi|1723220|sp|Q10145|YAS9_SCHPO  Hypothetical RNA-binding protein C3H8.09c in chromosome I
Length=738

 Score = 32.3 bits (72),  Expect = 1.2, Method: Composition-based stats.
 Identities = 17/39 (43%), Positives = 21/39 (53%), Gaps = 9/39 (23%)

Query  195  EYLSGN-------INLEDSKEFQELFLDENSIL--WYRG  224
            EYL  N       +NLED + FQ+   DE SI+  WY G
Sbjct  319  EYLGNNAAEKSLQMNLEDEQRFQQFLKDEESIMSNWYPG  357


>gi|130792|sp|P03772|PP_LAMBD Gene info Serine/threonine-protein phosphatase
Length=221

 Score = 32.3 bits (72),  Expect = 1.4, Method: Composition-based stats.
 Identities = 17/53 (32%), Positives = 27/53 (50%), Gaps = 7/53 (13%)

Query  28  QRIVAIGDVHGDYVATIKSLQLVNIINKNLNWIGKNTILIQMGDILDRGGRDV  80
           + I  +GD+HG Y   +  L  +   NK         +LI +GD++DRG  +V
Sbjct  13  RNIWVVGDLHGCYTNLMNKLDTIGFDNKK-------DLLISVGDLVDRGAENV  58


>gi|135820|sp|P20178|THS1_ARAHY  Stilbene synthase 1 (Resveratrol synthase 1) (RS1) (Trihydroxystilbene 
synthase 1)
Length=389

 Score = 32.3 bits (72),  Expect = 1.4, Method: Composition-based stats.
 Identities = 20/50 (40%), Positives = 27/50 (54%), Gaps = 1/50 (2%)

Query  173  HGGIRPYLSKKYTIDYINKTMHEYLSGNINLEDSKEFQELFL-DENSILW  221
            HG I   L +     Y+NK++ + +S NIN   SK F  L + D NSI W
Sbjct  251  HGAIGGLLREVGLTFYLNKSVPDIISQNINGALSKAFDPLGISDYNSIFW  300


>gi|135821|sp|P20077|THS2_ARAHY  Stilbene synthase 2 (Resveratrol synthase 2) (RS2) (Trihydroxystilbene 
synthase 2)
Length=313

 Score = 32.3 bits (72),  Expect = 1.5, Method: Composition-based stats.
 Identities = 20/50 (40%), Positives = 27/50 (54%), Gaps = 1/50 (2%)

Query  173  HGGIRPYLSKKYTIDYINKTMHEYLSGNINLEDSKEFQELFL-DENSILW  221
            HG I   L +     Y+NK++ + +S NIN   SK F  L + D NSI W
Sbjct  175  HGAIGGLLREVGLTFYLNKSVPDIISQNINDALSKAFDPLGISDYNSIFW  224

ORF finding

Par la recherche d'ORF via SMS, nous n'avons pas trouvé de cadre de lecture 
avec "any codon" dans le sens direct.

Sens indirect : 

No ORFs were found in reading frame 1.

No ORFs were found in reading frame 2.

>ORF number 1 in reading frame 3 on the reverse strand extends from base 156 to base 851.
ACTATAATATATATCTATGATATTATATATATTATAGATATGTCAAAAGTTATCAATGTA
GCCGAGAAAAAAACTCTAAAGCAACGAATTGTAGCTATTGGCGATGTACATGGAGACTAT
GTAGCCACTATAAAATCTTTACAATTAGTAAATATAATCAATAAAAATTTAAATTGGATT
GGTAAAAATACCATTCTTATTCAAATGGGTGATATTTTAGATAGAGGAGGCCGTGATGTT
ACATATGGTGATGAAGACTCGGAAATAAAAATAATTTATTTATTCAAAAGATTAAAAAAA
CAAGCGATAAAAAATAAGGGTAATGTAATATGTATTATTGGTAATCATGAAATGTTAAAT
TTACAAGGCGTATATGATTATTGTAGTAAATTAGGCATCAAAAAATTTGGTACAAAAAAA
AAAAGAAAGAAATTTTATAAACCAGGTAATAAGATGGCTATTTCAATGACTAAACTTTTT
AAAGCAATCTACAAGATTGGGCCATGGATTTTTGTACATGGTGGAATAAGACCTTATCTT
TCTAAAAAATATACCATAGACTATATCAATAAAACTATGCATGAATATTTGTCTGGAAAT
ATAAATCTTGAAGATAGTAAAGAGTTTCAGGAATTATTTTTAGATGAAAATAGTATATTA
TGGTACAGAGGATATGCAGAGGGAACTATAAATTGT

>Translation of ORF number 1 in reading frame 3 on the reverse strand.
TIIYIYDIIYIIDMSKVINVAEKKTLKQRIVAIGDVHGDYVATIKSLQLVNIINKNLNWI
GKNTILIQMGDILDRGGRDVTYGDEDSEIKIIYLFKRLKKQAIKNKGNVICIIGNHEMLN
LQGVYDYCSKLGIKKFGTKKKRKKFYKPGNKMAISMTKLFKAIYKIGPWIFVHGGIRPYL
SKKYTIDYINKTMHEYLSGNINLEDSKEFQELFLDENSILWYRGYAEGTINC