GOS 1509010
From Metagenes
| Warning: this metagenomic sequence has been carefully annotated by students during bioinformatics assignments. These quality annotations are therefore the result of a teaching exercise that you are most welcome to amend and extend if necessary! |
| Sequence | |||
|---|---|---|---|
| CAMERA AccNum : | JCVI_READ_1091143176892 | ||
| Annotathon code: | GOS_1509010 | ||
| Sample : |
| ||
| Authors | |||
| Team : | Algarve | ||
| Username : | BioinfCMJ1 | ||
| Annotated on : | 2010-07-06 17:10:15
| ||
Contents |
Synopsis
- Gene symbol: Unknown gene symbol
- Biological Process: unknown biological process
- Molecular Function: unknown molecular function
- Taxonomy: unknown organism (NCBI info)
Genomic Sequence
>JCVI_READ_1091143176892 GOS_1509010 Genomic DNA TGAGCCAGATTCTACACCCTTGGCGGAGATCGCATCGGAAGCAATGCGCAATGCACCATATATTAGGCATGACTCGGCCTTACATACGCTACCAATAGTA ACGAAGAAAGAAATTGGAAAACTATTAAAAGACAAAGAATGTTTGGACTTGGATCCAGTAGATCTTCTACCATTTTTTGTAAAACTATTTCGTGTGAATC GGAACAAGGCATTACATTATATTAGGGACGAGGAAGAGGCAAGTCTTAGTGCACAGGATATTGGTGGTTTGTTGCAACTTTTAACTCAATACTCGGAAGC CTCTATTGCTAAATTGGTAAAATTTTTGGATGTAAATTGTGATGAACAGATCAGTCTAGACGAATCACTTCATTTTTTCCTGGAAGATTCAAGGCTAAAA CTTAGAACAGAAATCAATCGTACTTTCCACATGAAATTAATGATGTCAAAAAAAGTTGGTAAGTCAGTTTATTCCAGAGCGTTTTCCATCAAACCATTAA GCAAAAATTCAACCATACCTTTTGGGATTGCTACGGCGAGCAATGACGGGGTTGTTCGTCTGTGGAGCAGTGGACTGTTACTAAAGTTTAGGCTGCTGGT TACTAGGTCTAAAGACTTGTTTGGGTATCAATACGAATCTAGTGGTGCAAAACTAAAGAGAGTTCTGCCAACTGATGTCGCAGAAATAAAATTAGATGTT AATGGTATGAAATCATCAATTGTTATTGGTTACACAGATCGTAAAATTAGAATATTTCATCTTCCTAATTCTTTTGGAGTTGCAGCCACAGTGACCCCAA GCAAAAAACTGGCAGTAAGAGGAATACCTATGGTGATTGATAGTCATGAGGACTTACTATTTATTGGAGACAATGATGGCTTCATTGAAGTATATAATAC CAAAAATTGTCTGGAAAAATTGCATACCGTTCAAGCGCACATACATACTCCCGTTAGACAATTAAAGTTTGTGCCGTTCCTGAATGCAATAGTGTCTCAG ATTAGACGGAGTTTAATTATAA
Translation
[2 - 1021/1022] direct strand
>GOS_1509010 Translation [2-1021 direct strand] EPDSTPLAEIASEAMRNAPYIRHDSALHTLPIVTKKEIGKLLKDKECLDLDPVDLLPFFVKLFRVNRNKALHYIRDEEEASLSAQDIGGLLQLLTQYSEA SIAKLVKFLDVNCDEQISLDESLHFFLEDSRLKLRTEINRTFHMKLMMSKKVGKSVYSRAFSIKPLSKNSTIPFGIATASNDGVVRLWSSGLLLKFRLLV TRSKDLFGYQYESSGAKLKRVLPTDVAEIKLDVNGMKSSIVIGYTDRKIRIFHLPNSFGVAATVTPSKKLAVRGIPMVIDSHEDLLFIGDNDGFIEVYNT KNCLEKLHTVQAHIHTPVRQLKFVPFLNAIVSQIRRSLII[ Warning ] 5' incomplete: does not start with a Methionine
[ Warning ] 3' incomplete: following codon is not a STOP
Annotator commentaries
It was not found any reliable homology with any known protein domain, so we can't correlate the sequence to any known organism or family of organisms neither deduce anything about the protein molecular function or biological process.
ORF finding
PROTOCOL
a) SMS ORFinder / forward strand / frames 1, 2 & 3 / min 60 AA / 'any codon' initiation / 'standard' genetic code
b) SMS ORFinder / reverse strand / frames 1, 2 3 / min 60 AA / 'any codon' initiation / 'standard' genetic code
RESULTS ANALYSIS
In the forward strand, there are 3 frames. No ORFs were found in reading frame 1 and 3. In the reading frame 2 an ORF were found on the direct strand extends from base 2 to base 1021, this ORF starts with a GAG Glutamic acid (E) and it ends with ATA Isoleucine (I).
In the reverse strand, there are 3 frames. No ORFs were found in reading frame 1 and 2. In the reading frame 3 an ORF were found on the direct strand extends from base 807 to base 1022, this ORF starts with TGT Cysteine (C) and it ends with TCA Serine (S).
The ORF selected for further studies was the one in reading frame 2 on the forward strand.
RAW RESULTS a) forward strand No ORFs were found in reading frame 1. >ORF number 1 in reading frame 2 on the direct strand extends from base 2 to base 1021. GAGCCAGATTCTACACCCTTGGCGGAGATCGCATCGGAAGCAATGCGCAATGCACCATAT ATTAGGCATGACTCGGCCTTACATACGCTACCAATAGTAACGAAGAAAGAAATTGGAAAA CTATTAAAAGACAAAGAATGTTTGGACTTGGATCCAGTAGATCTTCTACCATTTTTTGTA AAACTATTTCGTGTGAATCGGAACAAGGCATTACATTATATTAGGGACGAGGAAGAGGCA AGTCTTAGTGCACAGGATATTGGTGGTTTGTTGCAACTTTTAACTCAATACTCGGAAGCC TCTATTGCTAAATTGGTAAAATTTTTGGATGTAAATTGTGATGAACAGATCAGTCTAGAC GAATCACTTCATTTTTTCCTGGAAGATTCAAGGCTAAAACTTAGAACAGAAATCAATCGT ACTTTCCACATGAAATTAATGATGTCAAAAAAAGTTGGTAAGTCAGTTTATTCCAGAGCG TTTTCCATCAAACCATTAAGCAAAAATTCAACCATACCTTTTGGGATTGCTACGGCGAGC AATGACGGGGTTGTTCGTCTGTGGAGCAGTGGACTGTTACTAAAGTTTAGGCTGCTGGTT ACTAGGTCTAAAGACTTGTTTGGGTATCAATACGAATCTAGTGGTGCAAAACTAAAGAGA GTTCTGCCAACTGATGTCGCAGAAATAAAATTAGATGTTAATGGTATGAAATCATCAATT GTTATTGGTTACACAGATCGTAAAATTAGAATATTTCATCTTCCTAATTCTTTTGGAGTT GCAGCCACAGTGACCCCAAGCAAAAAACTGGCAGTAAGAGGAATACCTATGGTGATTGAT AGTCATGAGGACTTACTATTTATTGGAGACAATGATGGCTTCATTGAAGTATATAATACC AAAAATTGTCTGGAAAAATTGCATACCGTTCAAGCGCACATACATACTCCCGTTAGACAA TTAAAGTTTGTGCCGTTCCTGAATGCAATAGTGTCTCAGATTAGACGGAGTTTAATTATA >Translation of ORF number 1 in reading frame 2 on the direct strand. EPDSTPLAEIASEAMRNAPYIRHDSALHTLPIVTKKEIGKLLKDKECLDLDPVDLLPFFV KLFRVNRNKALHYIRDEEEASLSAQDIGGLLQLLTQYSEASIAKLVKFLDVNCDEQISLD ESLHFFLEDSRLKLRTEINRTFHMKLMMSKKVGKSVYSRAFSIKPLSKNSTIPFGIATAS NDGVVRLWSSGLLLKFRLLVTRSKDLFGYQYESSGAKLKRVLPTDVAEIKLDVNGMKSSI VIGYTDRKIRIFHLPNSFGVAATVTPSKKLAVRGIPMVIDSHEDLLFIGDNDGFIEVYNT KNCLEKLHTVQAHIHTPVRQLKFVPFLNAIVSQIRRSLII No ORFs were found in reading frame 3. b) reverse strand No ORFs were found in reading frame 1. No ORFs were found in reading frame 2. >ORF number 1 in reading frame 3 on the reverse strand extends from base 807 to base 1022. TGTAATGCCTTGTTCCGATTCACACGAAATAGTTTTACAAAAAATGGTAGAAGATCTACT GGATCCAAGTCCAAACATTCTTTGTCTTTTAATAGTTTTCCAATTTCTTTCTTCGTTACT ATTGGTAGCGTATGTAAGGCCGAGTCATGCCTAATATATGGTGCATTGCGCATTGCTTCC GATGCGATCTCCGCCAAGGGTGTAGAATCTGGCTCA >Translation of ORF number 1 in reading frame 3 on the reverse strand. CNALFRFTRNSFTKNGRRSTGSKSKHSLSFNSFPISFFVTIGSVCKAESCLIYGALRIAS DAISAKGVESGS
Multiple Alignement
PROTOCOL
RESULTS ANALYSIS
We can't make the multiple alignement because the E-value is very high (0,008), which will give us inconclusive results.
RAW RESULTS
Protein Domains
PROTOCOL
InterProScan, default parameters at EBI
RESULTS ANALYSIS
The InterPro Scan analysis gaves only one result for the translation of ORF number 1, WD40 repeat-like-containing domain and the superfamily is WD40 repeat-like.
WD40 are known to serve as mediator or plattaform for assembly for protein-protein interaction.
RAW RESULTS Sequence_1 8E90BE6F38F8DB2B 340 superfamily SSF50978 WD40 repeat-like 176 332 2.7e-07 T 15-Mar-2010 IPR011046 WD40 repeat-like-containing domain
Phylogeny
PROTOCOL
RESULTS ANALYSIS
We can't make the taxonomy report or the multiple alignement because the E-value is very high (0,008), which makes the results inconclusive. So we don't have any results to build the phylogenetic tree.
RAW RESULTS
Taxonomy report
PROTOCOL
BLASTp versus NR, NCBI default parameters apart from "Number of descriptions_1000"
RESULTS ANALYSIS
We can say that is a Fungus, but we can't make the taxonomy report because the E-value is very high (0,008) which make the results inconclusive.
RAW RESULTS Lineage Report root . cellular organisms . . Eukaryota [eukaryotes] . . . Fungi/Metazoa group [eukaryotes] . . . . Fungi [fungi] . . . . . Dikarya [fungi] . . . . . . Ascomycota [ascomycetes] . . . . . . . Saccharomyceta [ascomycetes] . . . . . . . . Leotiomyceta [ascomycetes] . . . . . . . . . Eurotiomycetidae [ascomycetes] . . . . . . . . . . Uncinocarpus reesii 1704 ---------------- 45 2 hits [ascomycetes] chromatin assembly factor 1 subunit C [Uncinocarpus reesii . . . . . . . . . . Aspergillus terreus NIH2624 ............. 37 2 hits [ascomycetes] conserved hypothetical protein [Aspergillus terreus NIH2624 . . . . . . . . . . Neosartorya fischeri NRRL 181 ........... 36 2 hits [ascomycetes] F-box and WD40 domain protein, putative [Neosartorya fische . . . . . . . . . Podospora anserina DSM 980 ---------------- 36 1 hit [ascomycetes] unnamed protein product [Podospora anserina] >gi|170937597| . . . . . . . . . Podospora anserina ........................ 36 1 hit [ascomycetes] unnamed protein product [Podospora anserina] >gi|170937597| . . . . . . . . Vanderwaltozyma polyspora DSM 70294 --------- 36 2 hits [ascomycetes] hypothetical protein Kpol_1048p59 [Vanderwaltozyma polyspor . . . . . . . . Zygosaccharomyces rouxii CBS 732 ............ 35 1 hit [ascomycetes] ZYRO0F07282p [Zygosaccharomyces rouxii] >gi|238940405|emb|C . . . . . . . . Zygosaccharomyces rouxii .................... 35 1 hit [ascomycetes] ZYRO0F07282p [Zygosaccharomyces rouxii] >gi|238940405|emb|C . . . . . . . Schizosaccharomyces pombe --------------------- 38 6 hits [ascomycetes] WD repeat protein Swd3 [Schizosaccharomyces pombe] >gi|7467 . . . . . . Cryptococcus neoformans var. neoformans B-3501A - 36 2 hits [basidiomycetes] hypothetical protein CNBA0610 [Cryptococcus neoformans var. . . . . . . Cryptococcus neoformans var. neoformans JEC21 ... 36 2 hits [basidiomycetes] nuclear mRNA splicing protein [Cryptococcus neoformans var. . . . . . Nosema ceranae BRL01 ------------------------------ 36 1 hit [microsporidians] hypothetical protein NCER_102316 [Nosema ceranae BRL01] . . . . Monosiga brevicollis MX1 ---------------------------- 41 2 hits [choanoflagellates] hypothetical protein [Monosiga brevicollis MX1] >gi|1637749 . . . . Caenorhabditis briggsae ............................. 41 3 hits [nematodes] RecName: Full=Serine/threonine kinase NLK; AltName: Full=Ne . . . . Drosophila sechellia ................................ 37 4 hits [flies] GM21242 [Drosophila sechellia] >gi|194125277|gb|EDW47320.1| . . . . Drosophila melanogaster ............................. 37 4 hits [flies] RE21021p [Drosophila melanogaster] . . . . Bos taurus (cow) .................................... 36 1 hit [even-toed ungulates] PREDICTED: similar to raptor [Bos taurus] . . . . Drosophila erecta ................................... 36 2 hits [flies] GG17057 [Drosophila erecta] >gi|190651980|gb|EDV49235.1| GG . . . . Homo sapiens (man) .................................. 36 10 hits [primates] regulatory-associated protein of mTOR isoform 2 [Homo sapie . . . . Equus caballus (equine) ............................. 36 1 hit [odd-toed ungulates] PREDICTED: similar to raptor [Equus caballus] . . . . Pan troglodytes ..................................... 36 2 hits [primates] PREDICTED: raptor isoform 1 [Pan troglodytes] . . . . Ailuropoda melanoleuca .............................. 36 1 hit [carnivores] hypothetical protein PANDA_010765 [Ailuropoda melanoleuca] . . . . Macaca mulatta (rhesus macaque) ..................... 36 2 hits [primates] PREDICTED: similar to raptor isoform 2 [Macaca mulatta] . . . . Canis lupus familiaris (dogs) ....................... 36 1 hit [carnivores] PREDICTED: similar to raptor [Canis familiaris] . . . . Mus musculus (mouse) ................................ 36 12 hits [rodents] unnamed protein product [Mus musculus] . . . . Pediculus humanus corporis (human body lice) ........ 36 2 hits [lice] WD-repeat protein, putative [Pediculus humanus corporis] >g . . . . Monodelphis domestica ............................... 36 1 hit [marsupials] PREDICTED: hypothetical protein [Monodelphis domestica] . . . . Drosophila simulans ................................. 36 4 hits [flies] GD20503 [Drosophila simulans] >gi|194199499|gb|EDX13075.1| . . . . Gallus gallus (bantam) .............................. 36 1 hit [birds] PREDICTED: similar to p150 target of rapamycin (TOR)-scaffo . . . . Drosophila grimshawi ................................ 35 2 hits [flies] GH24936 [Drosophila grimshawi] >gi|193893598|gb|EDV92464.1| . . . . Rattus norvegicus (brown rat) ....................... 35 1 hit [rodents] similar to p150 target of rapamycin (TOR)-scaffold protein . . . Trypanosoma brucei gambiense DAL972 ------------------- 43 1 hit [kinetoplastids] hypothetical protein, conserved [Trypanosoma brucei gambien . . . Trypanosoma brucei TREU927 ............................ 43 1 hit [kinetoplastids] hypothetical protein [Trypanosoma brucei TREU927] >gi|70834 . . . Trypanosoma brucei .................................... 43 1 hit [kinetoplastids] hypothetical protein [Trypanosoma brucei TREU927] >gi|70834 . . . Paramecium tetraurelia strain d4-2 .................... 39 3 hits [ciliates] hypothetical protein [Paramecium tetraurelia strain d4-2] > . . . Paramecium tetraurelia ................................ 39 3 hits [ciliates] hypothetical protein [Paramecium tetraurelia strain d4-2] > . . . Micromonas pusilla CCMP1545 ........................... 38 1 hit [green algae] predicted protein [Micromonas pusilla CCMP1545] . . . Ostreococcus tauri .................................... 38 1 hit [green algae] putative WD-repeat membrane protein (ISS) [Ostreococcus tau . . . Cryptosporidium muris RN66 ............................ 37 2 hits [apicomplexans] hypothetical protein [Cryptosporidium muris RN66] >gi|20955 . . . Dictyostelium discoideum AX4 .......................... 36 2 hits [cellular slime molds] WD40 repeat-containing protein [Dictyostelium discoideum AX . . . Dictyostelium discoideum .............................. 36 1 hit [cellular slime molds] WD40 repeat-containing protein [Dictyostelium discoideum AX . . . Ostreococcus lucimarinus CCE9901 ...................... 36 2 hits [green algae] predicted protein [Ostreococcus lucimarinus CCE9901] >gi|14 . . Cyanothece sp. CCY0110 ---------------------------------- 37 2 hits [cyanobacteria] beta transducin-like protein [Cyanothece sp. CCY0110] >gi|1 . . Hoeflea phototrophica DFL-43 ............................ 36 2 hits [a-proteobacteria] adenylosuccinate synthetase protein [Hoeflea phototrophica . . Listeria monocytogenes FSL J1-208 ....................... 35 1 hit [firmicutes] 6-phospho-beta-glucosidase [Listeria monocytogenes FSL J1-2 . synthetic construct --------------------------------------- 35 1 hit [other sequences] regulatory-associated protein of mTOR [Mus musculus] >gi|12
BLAST
PROTOCOL
a) BLASTp versus NR, NCBI default parameters apart from "Number of descriptions_1000"
b) BLASTx versus NR, NCBI default parameters apart from "Number of descriptions_1000"
RESULTS ANALYSIS
Since the E values from the BLASTp (0.008) and BLASTx (0.56) are very high, the analysis made from this ORF will have no reliability. The first hit on BLASTp says that it is a chromatin assembly factor 1 subunit C protein so the function of the protein homologous with ours have not been discovered.
RAW RESULTS
a) BLASTp
Score E
Sequences producing significant alignments: (Bits) Value
ref|XP_002544394.1| chromatin assembly factor 1 subunit C [Un... 45.8 0.008
emb|CBH17669.1| hypothetical protein, conserved [Trypanosoma ... 43.5 0.041
ref|XP_828768.1| hypothetical protein [Trypanosoma brucei TRE... 43.5 0.042
ref|XP_001746689.1| hypothetical protein [Monosiga brevicolli... 41.6 0.18
sp|A8XSC1.2|NLK_CAEBR RecName: Full=Serine/threonine kinase N... 41.2 0.22
ref|XP_002642294.1| C. briggsae CBR-LIT-1 protein [Caenorhabd... 40.4 0.33
ref|XP_001432665.1| hypothetical protein [Paramecium tetraure... 39.7 0.58
ref|XP_001434313.1| hypothetical protein [Paramecium tetraure... 39.3 0.77
gb|EEH53906.1| predicted protein [Micromonas pusilla CCMP1545] 38.9 0.95
emb|CAL57758.1| putative WD-repeat membrane protein (ISS) [Os... 38.5 1.3
ref|NP_595227.1| WD repeat protein Swd3 [Schizosaccharomyces ... 38.1 1.6
ref|XP_001213911.1| conserved hypothetical protein [Aspergill... 37.7 2.3
ref|XP_001445784.1| hypothetical protein [Paramecium tetraure... 37.7 2.5
ref|XP_002033307.1| GM21242 [Drosophila sechellia] >gb|EDW473... 37.7 2.6
ref|XP_002139707.1| hypothetical protein [Cryptosporidium mur... 37.4 2.8
ref|ZP_01728004.1| beta transducin-like protein [Cyanothece s... 37.4 3.1
gb|AAL68278.1| RE21021p [Drosophila melanogaster] 37.4 3.4
gb|ACI16532.1| FI03249p [Drosophila melanogaster] 37.4 3.4
ref|NP_610623.1| CG6751 [Drosophila melanogaster] >gb|AAF5873... 37.4 3.5
ref|XP_001646486.1| hypothetical protein Kpol_1048p59 [Vander... 37.0 3.9
gb|EEQ81313.1| hypothetical protein NCER_102316 [Nosema ceran... 37.0 4.3
ref|NP_596478.1| WD repeat protein Lub1 [Schizosaccharomyces ... 37.0 4.3
ref|XP_583606.4| PREDICTED: similar to raptor [Bos taurus] 37.0 4.3
ref|XP_001980277.1| GG17057 [Drosophila erecta] >gb|EDV49235.... 37.0 4.4
ref|XP_001904474.1| unnamed protein product [Podospora anseri... 37.0 4.5
ref|XP_640793.1| WD40 repeat-containing protein [Dictyosteliu... 36.6 4.7
ref|XP_001421820.1| predicted protein [Ostreococcus lucimarin... 36.6 4.9
ref|XP_001263278.1| F-box and WD40 domain protein, putative [... 36.6 4.9
ref|NP_001156506.1| regulatory-associated protein of mTOR iso... 36.6 5.2
ref|XP_001489653.1| PREDICTED: similar to raptor [Equus cabal... 36.6 5.3
ref|XP_001161716.1| PREDICTED: raptor isoform 1 [Pan troglody... 36.6 5.4
ref|XP_778058.1| hypothetical protein CNBA0610 [Cryptococcus ... 36.6 5.4
gb|EFB20820.1| hypothetical protein PANDA_010765 [Ailuropoda ... 36.6 5.6
ref|NP_065812.1| regulatory-associated protein of mTOR isofor... 36.6 5.6
ref|XP_566487.1| nuclear mRNA splicing protein [Cryptococcus ... 36.6 5.8
ref|XP_001110471.1| PREDICTED: similar to raptor isoform 2 [M... 36.6 5.9
ref|XP_850487.1| PREDICTED: similar to raptor [Canis familiaris] 36.6 6.0
dbj|BAB29789.1| unnamed protein product [Mus musculus] 36.2 6.1
ref|XP_002425511.1| WD-repeat protein, putative [Pediculus hu... 36.2 6.1
gb|EDL05868.1| RIKEN cDNA 4930434E21 [Mus musculus] 36.2 6.2
ref|NP_083716.2| hypothetical protein LOC381693 [Mus musculus] 36.2 6.2
gb|EAW89619.1| raptor, isoform CRA_b [Homo sapiens] 36.2 6.4
ref|XP_001370890.1| PREDICTED: hypothetical protein [Monodelp... 36.2 6.6
ref|ZP_02165863.1| adenylosuccinate synthetase protein [Hoefl... 36.2 6.7
ref|XP_001110429.1| PREDICTED: similar to raptor isoform 1 [M... 36.2 6.8
dbj|BAA92541.1| KIAA1303 protein [Homo sapiens] 36.2 6.9
ref|XP_002103572.1| GD20503 [Drosophila simulans] >gb|EDX1307... 36.2 7.0
ref|XP_002080964.1| GD10760 [Drosophila simulans] >gb|EDX0654... 36.2 7.1
ref|XP_511729.2| PREDICTED: raptor isoform 2 [Pan troglodytes] 36.2 7.2
ref|XP_002031336.1| GM25942 [Drosophila sechellia] >gb|EDW423... 36.2 7.4
ref|XP_426232.2| PREDICTED: similar to p150 target of rapamyc... 36.2 7.6
dbj|BAC39857.1| unnamed protein product [Mus musculus] 35.8 8.2
ref|XP_001992757.1| GH24936 [Drosophila grimshawi] >gb|EDV924... 35.8 8.6
ref|ZP_05294624.1| 6-phospho-beta-glucosidase [Listeria monoc... 35.8 8.6
ref|XP_002497512.1| ZYRO0F07282p [Zygosaccharomyces rouxii] >... 35.8 9.1
gb|AAI18972.1| 4932417H02Rik protein [Mus musculus] 35.8 9.5
gb|EDM06799.1| similar to p150 target of rapamycin (TOR)-scaf... 35.8 9.6
gb|EDL34713.1| RIKEN cDNA 4932417H02, isoform CRA_a [Mus musc... 35.8 9.8
ref|NP_083174.2| regulatory-associated protein of mTOR [Mus m... 35.8 9.8
ALIGNMENTS
>ref|XP_002544394.1| chromatin assembly factor 1 subunit C [Uncinocarpus reesii 1704]
gb|EEP79065.1| chromatin assembly factor 1 subunit C [Uncinocarpus reesii 1704]
Length=470
Score = 45.8 bits (107), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 47/163 (28%), Positives = 78/163 (47%), Gaps = 9/163 (5%)
Query 229 IATASNDGVVRLW-SSGLLLKFRLLVTRSKDLFGYQYESSGAKLKRVLPTDVAEIKLDV- 286
+AT S D VRLW +LL + ++T S+DL Y + K R T + I DV
Sbjct 222 LATGSEDETVRLWFVRSMLLSSKRVLTPSRDLTQYTKGNRALKPVRTY-THHSSIVNDVQ 280
Query 287 -NGMKSSIVIGYTDR---KIRIFHLPNSFGVAATVTPSKKLAVRGIPMVIDSHEDLLFIG 342
+ + SS++ +D +I P++ AA+ T K A+ I + E +L G
Sbjct 281 YHPLHSSLIGTVSDDITLQILDIREPDTSRSAASATGQHKDAINSI-AFNPAAETVLATG 339
Query 343 DNDGFIEVYNTKNCLEKLHTVQAHIHTPVRQLKFVPFLNAIVS 385
D I +++ +N KLH ++ H + V L + PF A+++
Sbjct 340 SADKSIGLWDLRNLKSKLHALECHQDS-VTTLAWHPFEEAVLA 381
>emb|CBH17669.1| hypothetical protein, conserved [Trypanosoma brucei gambiense
DAL972]
Length=541
Score = 43.5 bits (101), Expect = 0.041, Method: Compositional matrix adjust.
Identities = 23/62 (37%), Positives = 32/62 (51%), Gaps = 7/62 (11%)
Query 293 IVIGYTDRKIRIFHLPNSFGVAATVTPSKKLAVRGIPMVIDSHEDLLFIGDNDGFIEVYN 352
IV G DR IR + + G A PS G+P+ IDS D L +G DG + +++
Sbjct 273 IVTGGADRMIRYW--DSGSGAALQCHPSD-----GVPLCIDSVGDRLIVGCTDGVVRIWD 325
Query 353 TK 354
TK
Sbjct 326 TK 327
>ref|XP_828768.1| hypothetical protein [Trypanosoma brucei TREU927]
gb|EAN79656.1| hypothetical protein, conserved [Trypanosoma brucei]
Length=541
Score = 43.5 bits (101), Expect = 0.042, Method: Compositional matrix adjust.
Identities = 23/62 (37%), Positives = 32/62 (51%), Gaps = 7/62 (11%)
Query 293 IVIGYTDRKIRIFHLPNSFGVAATVTPSKKLAVRGIPMVIDSHEDLLFIGDNDGFIEVYN 352
IV G DR IR + + G A PS G+P+ IDS D L +G DG + +++
Sbjct 273 IVTGGADRMIRYW--DSGSGAALQCHPSD-----GVPLCIDSVGDRLIVGCTDGVVRIWD 325
Query 353 TK 354
TK
Sbjct 326 TK 327
>ref|XP_001746689.1| hypothetical protein [Monosiga brevicollis MX1]
gb|EDQ88585.1| predicted protein [Monosiga brevicollis MX1]
Length=1053
Score = 41.6 bits (96), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 40/141 (28%), Positives = 61/141 (43%), Gaps = 25/141 (17%)
Query 229 IATASNDGVVRLW---SSGLLLKFRLLVTRS--KDLFGYQYESSGAKLK-RVLPTDVAEI 282
IA+AS DG VRLW ++ LL +L R FG S GA + + P VA
Sbjct 401 IASASLDGHVRLWDLDTNRLLAALKLRSARDDPSGSFGTLRASGGASARPSIDPASVAR- 459
Query 283 KLDVNGMKSSIVIGYTDRKIRIFHLPNSFGVAATVTPSKKLAVRGIPMVIDSHEDLLFIG 342
+ G+ ++ LP G A + + +P ID ED++ IG
Sbjct 460 AMSGGGLAAA--------------LPTGRGAAPSSSEGNTDESSLVPWCIDIAEDIVAIG 505
Query 343 DNDGFIEVYNTKN----CLEK 359
+DG + V+ T++ CL +
Sbjct 506 CSDGSVMVWETESGALQCLTQ 526
>sp|A8XSC1.2|NLK_CAEBR RecName: Full=Serine/threonine kinase NLK; AltName: Full=Nemo-like
kinase; AltName: Full=Loss of intestine protein 1
Length=657
Score = 41.2 bits (95), Expect = 0.22, Method: Compositional matrix adjust.
Identities = 33/117 (28%), Positives = 56/117 (47%), Gaps = 14/117 (11%)
Query 105 PVDLLPFFVKLFRVNRNKALHYIRDEEE-----ASLSAQDIGGLLQLLTQYSEASIAKLV 159
P++ L + L +A+ Y + + A A ++ L +L Q ++ ++ LV
Sbjct 473 PIEQLQMIIDLLGTPSQEAMKYACEGAKNHVLRAGPRAPNLQSLYRLSQQTTDDAVDLLV 532
Query 160 KFLDVNCDEQISLDESL-HFFLEDSRLKLRTEINRTFHMKLMMSKKVGKSVYSRAFS 215
K L N DE+IS++E+L H +LE+ RL+ FH + +V SR FS
Sbjct 533 KLLKFNPDERISVEEALSHPYLEEGRLR--------FHSCMCSCCYTKANVPSRIFS 581
>ref|XP_002642294.1| C. briggsae CBR-LIT-1 protein [Caenorhabditis briggsae]
emb|CAP35763.1| C. briggsae CBR-LIT-1 protein [Caenorhabditis briggsae]
Length=435
Score = 40.4 bits (93), Expect = 0.33, Method: Compositional matrix adjust.
Identities = 33/117 (28%), Positives = 56/117 (47%), Gaps = 14/117 (11%)
Query 105 PVDLLPFFVKLFRVNRNKALHYIRDEEE-----ASLSAQDIGGLLQLLTQYSEASIAKLV 159
P++ L + L +A+ Y + + A A ++ L +L Q ++ ++ LV
Sbjct 251 PIEQLQMIIDLLGTPSQEAMKYACEGAKNHVLRAGPRAPNLQSLYRLSQQTTDDAVDLLV 310
Query 160 KFLDVNCDEQISLDESL-HFFLEDSRLKLRTEINRTFHMKLMMSKKVGKSVYSRAFS 215
K L N DE+IS++E+L H +LE+ RL+ FH + +V SR FS
Sbjct 311 KLLKFNPDERISVEEALSHPYLEEGRLR--------FHSCMCSCCYTKANVPSRIFS 359
>ref|XP_001432665.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
emb|CAK65268.1| unnamed protein product [Paramecium tetraurelia]
Length=975
Score = 39.7 bits (91), Expect = 0.58, Method: Compositional matrix adjust.
Identities = 35/129 (27%), Positives = 63/129 (48%), Gaps = 12/129 (9%)
Query 244 GLLLKFRLLVTRSKDLFGYQYESSGAKLKRVLPTDVAEIKLDVNGMKSSIVIGYTDRKIR 303
G + L+ R + ++ QYE LK+V+ D + V+ + + IG + K +
Sbjct 596 GAIFIIDLVSYRQELVWKGQYE-----LKKVMFLDTHNCIVSVDSIGNVYFIGVLESKFK 650
Query 304 IFHLPNSFGVAATVTPSKKLAVRGIPMV-IDSHEDLLFIGDNDGFIEVYNTKNCLEK--L 360
L A ++T ++ P+ I+ H+DLL++GD G ++++N K L+K L
Sbjct 651 SKLLLQKTYKAISLTNQEET----FPVTSINYHDDLLYLGDELGNLKIWNIKQVLDKVDL 706
Query 361 HTVQAHIHT 369
H V+ I T
Sbjct 707 HQVEQKIKT 715
>ref|XP_001434313.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
emb|CAK66916.1| unnamed protein product [Paramecium tetraurelia]
Length=623
Score = 39.3 bits (90), Expect = 0.77, Method: Compositional matrix adjust.
Identities = 29/101 (28%), Positives = 49/101 (48%), Gaps = 9/101 (8%)
Query 290 KSSIVIGYTDRKIRIFHLPNSFGVAA--TVTPSKKLAVRGIPMVIDSHEDLLFIGDNDGF 347
+++ ++ D+ IR + SF + + P + ++I+ +EDLLF G D
Sbjct 374 QNTFILSSDDKTIRCWQ---SFNLQNWNSSQPYNQHTHYVQCLIINENEDLLFSGSFDNS 430
Query 348 IEVYN---TKNCLEKLHTVQAHIHTPVRQLKFVPFLNAIVS 385
I+V+N KNCL +T+ H + PV L P +VS
Sbjct 431 IKVWNVDFNKNCLTYQYTLNKHTN-PVNGLSLSPSEKVLVS 470
>gb|EEH53906.1| predicted protein [Micromonas pusilla CCMP1545]
Length=1142
Score = 38.9 bits (89), Expect = 0.95, Method: Compositional matrix adjust.
Identities = 51/215 (23%), Positives = 91/215 (42%), Gaps = 37/215 (17%)
Query 151 SEASIAKLVKFLDVNCDEQISLDE-SLHFFLEDSRLKLRTEINRTFHMKLMMSKKVGKSV 209
++A + +L +D N D + DE S + LE + ++ ++VG +
Sbjct 108 NDARLRQLFNRVDANADGAVDWDEFSTYVLLEGQAARELRALDAVRKYLPEKEQRVGVAG 167
Query 210 YSRAFS-----IKPLSKNSTIPFGIATASNDGVVRLW--SSGLLLKFRLLVTRSKDLFGY 262
S + + L+K +++ AT S+DG VRLW SS +K+ R DL
Sbjct 168 QSDPEAKHRDVVTSLTKMTSLRDTYATTSHDGTVRLWQLSSDGGVKY----VRKVDLRSR 223
Query 263 QYESSGAKLKRVLPTDVAEIKLDVNGMKSSIVIGYTDRKIRIFHLPNSFGVAATVT---- 318
Y ++G LD +G + + DRK++I L + V ++
Sbjct 224 AYLTAG-------------CHLDTSGR---LAVASHDRKVKI--LDKGWKVCGQLSTFQY 265
Query 319 -PSKKLAVRGIPMVIDSHEDLLFI--GDNDGFIEV 350
P + RG + H+D+ +I GD+ G++ V
Sbjct 266 APLCMTSWRGGAKIAPGHKDMDWIAVGDDGGYVHV 300
>emb|CAL57758.1| putative WD-repeat membrane protein (ISS) [Ostreococcus tauri]
Length=1124
Score = 38.5 bits (88), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 29/88 (32%), Positives = 44/88 (50%), Gaps = 8/88 (9%)
Query 293 IVIGYTDRKIRIFHLPNSFGVAATVTPSKKLAVRGIPMVIDSHEDLLFIGDNDGFIEVYN 352
+ +G D ++ + +L V TVTP L + + D +D+L IGD G + V++
Sbjct 200 VAVGLEDGRVLLVNLLED-SVLFTVTPDHGLKITALAFRTDDQDDVLCIGDESGRVTVWD 258
Query 353 TKNCLEK--LHTVQAHIH-TPVRQLKFV 377
LEK L TV + H PV LKF+
Sbjct 259 ----LEKRSLRTVISQCHEGPVVALKFL 282
>ref|NP_595227.1| WD repeat protein Swd3 [Schizosaccharomyces pombe]
sp|O43017.1|SWD3_SCHPO RecName: Full=Set1 complex component swd3; Short=Set1C component
swd3; AltName: Full=COMPASS component swd3; AltName: Full=Complex
proteins associated with set1 protein swd3
emb|CAA17803.1| WD repeat protein Swd3 [Schizosaccharomyces pombe]
Length=380
Score = 38.1 bits (87), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 45/190 (23%), Positives = 82/190 (43%), Gaps = 40/190 (21%)
Query 229 IATASNDGVVRLWSSGLLLKFRLLVT---------RSKDLFGYQYESSGAKLKRVLPTDV 279
IAT+S+DG +++WS+ L FRL T + K G +Y +S + K + D
Sbjct 69 IATSSSDGTIKIWSA---LTFRLECTLFGHYRGISQVKWATGSKYLASASDDKTIRIWDF 125
Query 280 AE--------------IKLDVNGMKSSIVIGYTDRKIRIFHLPNSFGVAATVTPSKKLAV 325
+ +D N + + +V G D +RI++L + G + P+ +
Sbjct 126 EKRCSVRCLKGHTNYVSSIDFNPLGTLLVSGSWDETVRIWNLQD--GTCLRMLPAHSEPI 183
Query 326 RGIPMVIDSHEDLLFIGDNDGFIEVYN--TKNCLEKLHTVQAHIHTPVRQLKFVP----- 378
I + I + L DG +++ + CL+ T+ I+ P+ L+F
Sbjct 184 --ISVSISADGTLCATASYDGMARIWDVLSGQCLK---TLVEPINVPLSNLQFTENRKYL 238
Query 379 FLNAIVSQIR 388
++ + SQIR
Sbjct 239 LVSNLNSQIR 248
>ref|XP_001213911.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
gb|EAU35180.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
Length=661
Score = 37.7 bits (86), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 27/97 (27%), Positives = 45/97 (46%), Gaps = 20/97 (20%)
Query 229 IATASNDGVVRLW--SSGLLLK-----------------FRLLVTRSKDLFGYQYESSGA 269
I +AS DGV +LW +SGL +K R ++T D YQ++++
Sbjct 510 IVSASGDGVAKLWNITSGLCVKEFPSKDRGLACVEFSDDARTILTGGNDKVIYQFDANTG 569
Query 270 KLKRVLPTDVAEIK-LDVNGMKSSIVIGYTDRKIRIF 305
+ R L V ++ L ++ M IV G D +++F
Sbjct 570 DMVRELKGQVGLVRSLHLDSMNQRIVSGSYDMSVKVF 606
b) BLASTx
Score E
Sequences producing significant alignments: (Bits) Value
ref|XP_001432665.1| hypothetical protein [Paramecium tetraure... 39.7 0.56
ref|XP_001434313.1| hypothetical protein [Paramecium tetraure... 38.9 0.95
emb|CAL57758.1| putative WD-repeat membrane protein (ISS) [Os... 38.9 0.95
ref|XP_002139707.1| hypothetical protein [Cryptosporidium mur... 37.4 2.8
gb|EEQ81313.1| hypothetical protein NCER_102316 [Nosema ceran... 37.0 3.6
emb|CBH17669.1| hypothetical protein, conserved [Trypanosoma ... 36.6 4.7
ref|ZP_05294624.1| 6-phospho-beta-glucosidase [Listeria monoc... 36.6 4.7
sp|A8XSC1.2|NLK_CAEBR RecName: Full=Serine/threonine kinase N... 36.6 4.7
gb|EEH53906.1| predicted protein [Micromonas pusilla CCMP1545] 36.6 4.7
ref|XP_583606.4| PREDICTED: similar to raptor [Bos taurus] 36.6 4.7
ref|XP_001980277.1| GG17057 [Drosophila erecta] >gb|EDV49235.... 36.6 4.7
ref|XP_001421820.1| predicted protein [Ostreococcus lucimarin... 36.6 4.7
ref|XP_002642294.1| C. briggsae CBR-LIT-1 protein [Caenorhabd... 36.6 4.7
ref|XP_828768.1| hypothetical protein [Trypanosoma brucei TRE... 36.6 4.7
ref|XP_640793.1| WD40 repeat-containing protein [Dictyosteliu... 36.6 4.7
gb|EFB20820.1| hypothetical protein PANDA_010765 [Ailuropoda ... 36.2 6.2
ref|NP_001156506.1| regulatory-associated protein of mTOR iso... 36.2 6.2
ref|XP_001746689.1| hypothetical protein [Monosiga brevicolli... 36.2 6.2
ref|XP_001370890.1| PREDICTED: hypothetical protein [Monodelp... 36.2 6.2
gb|EAW89619.1| raptor, isoform CRA_b [Homo sapiens] 36.2 6.2
ref|XP_511729.2| PREDICTED: raptor isoform 2 [Pan troglodytes] 36.2 6.2
ref|XP_001161716.1| PREDICTED: raptor isoform 1 [Pan troglody... 36.2 6.2
ref|XP_001213911.1| conserved hypothetical protein [Aspergill... 36.2 6.2
ref|XP_001110429.1| PREDICTED: similar to raptor isoform 1 [M... 36.2 6.2
ref|XP_001110471.1| PREDICTED: similar to raptor isoform 2 [M... 36.2 6.2
ref|XP_566487.1| nuclear mRNA splicing protein [Cryptococcus ... 36.2 6.2
gb|AAH33258.1| Unknown (protein for IMAGE:5457801) [Homo sapi... 36.2 6.2
ref|XP_850487.1| PREDICTED: similar to raptor [Canis familiaris] 36.2 6.2
dbj|BAA92541.1| KIAA1303 protein [Homo sapiens] 36.2 6.2
ref|NP_065812.1| regulatory-associated protein of mTOR isofor... 36.2 6.2
ref|XP_778058.1| hypothetical protein CNBA0610 [Cryptococcus ... 36.2 6.2
gb|EEU47028.1| hypothetical protein NECHADRAFT_36093 [Nectria... 35.8 8.1
ref|XP_001489653.1| PREDICTED: similar to raptor [Equus cabal... 35.8 8.1
ref|XP_001455319.1| hypothetical protein [Paramecium tetraure... 35.8 8.1
ref|XP_001263278.1| F-box and WD40 domain protein, putative [... 35.8 8.1
ref|XP_426232.2| PREDICTED: similar to p150 target of rapamyc... 35.8 8.1
ALIGNMENTS
>ref|XP_001432665.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
emb|CAK65268.1| unnamed protein product [Paramecium tetraurelia]
Length=975
Score = 39.7 bits (91), Expect = 0.56
Identities = 33/118 (27%), Positives = 58/118 (49%), Gaps = 10/118 (8%)
Frame = +1
Query 613 DLFGYQYE---SSGAKLKRVLPTDVAEIKLDVNGMKSSIVIGYTDRKIRIFHLPNSFGVA 783
DL Y+ E +LK+V+ D + V+ + + IG + K + L A
Sbjct 602 DLVSYRQELVWKGQYELKKVMFLDTHNCIVSVDSIGNVYFIGVLESKFKSKLLLQKTYKA 661
Query 784 ATVTPSKKLAVRGIPMV-IDSHEDLLFIGDNDGFIEVYNTKNCLEK--LHTVQAHIHT 948
++T ++ P+ I+ H+DLL++GD G ++++N K L+K LH V+ I T
Sbjct 662 ISLTNQEET----FPVTSINYHDDLLYLGDELGNLKIWNIKQVLDKVDLHQVEQKIKT 715
>ref|XP_001434313.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
emb|CAK66916.1| unnamed protein product [Paramecium tetraurelia]
Length=623
Score = 38.9 bits (89), Expect = 0.95
Identities = 23/59 (38%), Positives = 33/59 (55%), Gaps = 4/59 (6%)
Frame = +1
Query 829 MVIDSHEDLLFIGDNDGFIEVYN---TKNCLEKLHTVQAHIHTPVRQLKFVPFLNAIVS 996
++I+ +EDLLF G D I+V+N KNCL +T+ H + PV L P +VS
Sbjct 413 LIINENEDLLFSGSFDNSIKVWNVDFNKNCLTYQYTLNKHTN-PVNGLSLSPSEKVLVS 470
>emb|CAL57758.1| putative WD-repeat membrane protein (ISS) [Ostreococcus tauri]
Length=1124
Score = 38.9 bits (89), Expect = 0.95
Identities = 26/86 (30%), Positives = 42/86 (48%), Gaps = 4/86 (4%)
Frame = +1
Query 718 IVIGYTDRKIRIFHLPNSFGVAATVTPSKKLAVRGIPMVIDSHEDLLFIGDNDGFIEVYN 897
+ +G D ++ + +L V TVTP L + + D +D+L IGD G + V++
Sbjct 200 VAVGLEDGRVLLVNLLED-SVLFTVTPDHGLKITALAFRTDDQDDVLCIGDESGRVTVWD 258
Query 898 TKNCLEKLHTVQAHIHT-PVRQLKFV 972
+ L TV + H PV LKF+
Sbjct 259 LEK--RSLRTVISQCHEGPVVALKFL 282
>ref|XP_002139707.1| hypothetical protein [Cryptosporidium muris RN66]
gb|EEA05358.1| hypothetical protein, conserved [Cryptosporidium muris RN66]
Length=956
Score = 37.4 bits (85), Expect = 2.8
Identities = 25/78 (32%), Positives = 41/78 (52%), Gaps = 6/78 (7%)
Frame = +1
Query 709 KSSIVIGYTDRKIRIFHLPNSFGVAATVTPSKKLAVRGIPMVIDSHEDLLFIGDNDGFIE 888
++ + + TD IRIF L N+ + P ++ + + ED +F G +DG I
Sbjct 185 ETQLAVACTDGSIRIFSLLNNMVTYSYGLPRHSSSILSLTFMT---EDYIFAGSSDGCIL 241
Query 889 VYNTKN--CLEKLHTVQA 936
YN K+ C+E++ TVQA
Sbjct 242 QYNLKSKICVERM-TVQA 258
>gb|EEQ81313.1| hypothetical protein NCER_102316 [Nosema ceranae BRL01]
Length=323
Score = 37.0 bits (84), Expect = 3.6
Identities = 27/121 (22%), Positives = 56/121 (46%), Gaps = 5/121 (4%)
Frame = +1
Query 583 KFRLLVTRSKDLFGYQYESSGAKLKRVLPT-DVAEIKLDVNGMKSSIVIGYTDRKIRIFH 759
+F +LV + D+F Y E++G K + L T + + N + + +G D + ++
Sbjct 161 RFLILVGDTNDVFVYTIENNGYKFFKKLKTVNDGGFSVTWNNLSNKFAVGTQDGFVCVWD 220
Query 760 LPNS---FGVAATVTPSKKLAVRGIPMVIDSHEDLLFIGDNDGFIEVYNTKNCLEKLHTV 930
+ + + + + S + A+R + DLLF + ++ VY+T+ K H V
Sbjct 221 IRSDEKLYTLCSKQQGSHRGAIRNVFFSTKKSLDLLFFTEQSSYLSVYDTRT-FTKRHVV 279
Query 931 Q 933
+
Sbjct 280 K 280
>emb|CBH17669.1| hypothetical protein, conserved [Trypanosoma brucei gambiense
DAL972]
Length=541
Score = 36.6 bits (83), Expect = 4.7
Identities = 23/62 (37%), Positives = 32/62 (51%), Gaps = 7/62 (11%)
Frame = +1
Query 718 IVIGYTDRKIRIFHLPNSFGVAATVTPSKKLAVRGIPMVIDSHEDLLFIGDNDGFIEVYN 897
IV G DR IR + + G A PS G+P+ IDS D L +G DG + +++
Sbjct 273 IVTGGADRMIRYWD--SGSGAALQCHPSD-----GVPLCIDSVGDRLIVGCTDGVVRIWD 325
Query 898 TK 903
TK
Sbjct 326 TK 327
>ref|ZP_05294624.1| 6-phospho-beta-glucosidase [Listeria monocytogenes FSL J1-208]
Length=438
Score = 36.6 bits (83), Expect = 4.7
Identities = 20/77 (25%), Positives = 39/77 (50%), Gaps = 3/77 (3%)
Frame = +1
Query 634 ESSGAKLKRVLPTDVAEIKLDVNGMKSSIVIGYTDRKIRIFHLPNSFGVAATVTPSKKLA 813
E +G +K L D E D + + + + +G D +++ +PNS+GV T +
Sbjct 60 EKAGVDMKVHLTLDREEALKDADFVTTQLRVGLLDARVKDERIPNSYGVVGQETNGQAAC 119
Query 814 VRG---IPMVIDSHEDL 855
+G IP+++D +D+
Sbjct 120 SKGLRTIPVILDICKDM 136
>sp|A8XSC1.2|NLK_CAEBR RecName: Full=Serine/threonine kinase NLK; AltName: Full=Nemo-like
kinase; AltName: Full=Loss of intestine protein 1
Length=657
Score = 36.6 bits (83), Expect = 4.7
Identities = 27/80 (33%), Positives = 43/80 (53%), Gaps = 9/80 (11%)
Frame = +1
Query 250 AQDIGGLLQLLTQYSEASIAKLVKFLDVNCDEQISLDESL-HFFLEDSRLKLRTEINRTF 426
A ++ L +L Q ++ ++ LVK L N DE+IS++E+L H +LE+ RL+ F
Sbjct 510 APNLQSLYRLSQQTTDDAVDLLVKLLKFNPDERISVEEALSHPYLEEGRLR--------F 561
Query 427 HMKLMMSKKVGKSVYSRAFS 486
H + +V SR FS
Sbjct 562 HSCMCSCCYTKANVPSRIFS 581
>gb|EEH53906.1| predicted protein [Micromonas pusilla CCMP1545]
Length=1142
Score = 36.6 bits (83), Expect = 4.7
Identities = 50/215 (23%), Positives = 91/215 (42%), Gaps = 37/215 (17%)
Frame = +1
Query 292 SEASIAKLVKFLDVNCDEQISLDE-SLHFFLEDSRLKLRTEINRTFHMKLMMSKKVGKSV 468
++A + +L +D N D + DE S + LE + ++ ++VG +
Sbjct 108 NDARLRQLFNRVDANADGAVDWDEFSTYVLLEGQAARELRALDAVRKYLPEKEQRVGVAG 167
Query 469 YSRAFS-----IKPLSKNSTIPFGIATASNDGVVRLW--SSGLLLKFRLLVTRSKDLFGY 627
S + + L+K +++ AT S+DG VRLW SS +K+ R DL
Sbjct 168 QSDPEAKHRDVVTSLTKMTSLRDTYATTSHDGTVRLWQLSSDGGVKY----VRKVDLRSR 223
Query 628 QYESSGAKLKRVLPTDVAEIKLDVNGMKSSIVIGYTDRKIRIFHLPNSFGVAATVT---- 795
Y ++G LD +G + + DRK++I L + V ++
Sbjct 224 AYLTAGC-------------HLDTSG---RLAVASHDRKVKI--LDKGWKVCGQLSTFQY 265
Query 796 -PSKKLAVRGIPMVIDSHEDL--LFIGDNDGFIEV 891
P + RG + H+D+ + +GD+ G++ V
Sbjct 266 APLCMTSWRGGAKIAPGHKDMDWIAVGDDGGYVHV 300
>ref|XP_583606.4| PREDICTED: similar to raptor [Bos taurus]
Length=1335
Score = 36.6 bits (83), Expect = 4.7
Identities = 38/142 (26%), Positives = 63/142 (44%), Gaps = 15/142 (10%)
Frame = +1
Query 526 IATASNDGVVRLWSSGL-LLKFRLLVTRSKDLFGYQYESSGAKLKRVLPTDVAEIKLDVN 702
+ TA++DG +R+W + L K +VT + L + GA + V+ + L +
Sbjct 1084 LLTATDDGAIRVWKNFADLEKNPEMVTAWQGLSDMLPTTRGAGM--VVDWEQETGLLMSS 1141
Query 703 GMKSSIVIGYTDRKIRIFHLPNSFGVAATVTPSKKLAVRGIPMVIDSHEDLLFIGDNDGF 882
G I I TDR++++ +P G + VT + DSH L+ G DG
Sbjct 1142 GDVRIIRIWDTDREMKVQDIPT--GADSCVTS----------LSCDSHRSLIVAGLGDGS 1189
Query 883 IEVYNTKNCLEKLHTVQAHIHT 948
I VY+ + L + + HT
Sbjct 1190 IRVYDRRMALSECRVMTYREHT 1211
>ref|XP_001980277.1| GG17057 [Drosophila erecta]
gb|EDV49235.1| GG17057 [Drosophila erecta]
Length=921
Score = 36.6 bits (83), Expect = 4.7
Identities = 19/67 (28%), Positives = 38/67 (56%), Gaps = 2/67 (2%)
Frame = +1
Query 718 IVIGYTDRKIRIFHLPNSFGVAATVTPSKKLAVRGIPMVIDSHEDLLFIGDNDGFIEVYN 897
++IGY+ I F++ + F A+ TP K+AVRG + D+ + G ++G ++ ++
Sbjct 469 VIIGYSSGDIERFNIQSGFRRASYGTPGHKMAVRG--LASDNLNQTVISGCSEGLLKFWS 526
Query 898 TKNCLEK 918
K ++K
Sbjct 527 FKGKVDK 533
>ref|XP_001421820.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gb|ABP00114.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length=934
Score = 36.6 bits (83), Expect = 4.7
Identities = 24/93 (25%), Positives = 45/93 (48%), Gaps = 2/93 (2%)
Frame = +1
Query 718 IVIGYTDRKIRIFHLPNSFGVAATVTPSKKLAVRGIPMVIDSHEDLLFIGDNDGFIEVYN 897
+ +G D ++ + ++ V T+TP + + V + D +D+L +GD G + V++
Sbjct 214 VAVGLADGRVLLVNVLED-KVLFTLTPERGVKVTALAFRTDDQDDVLCVGDETGRVTVWD 272
Query 898 TKNCLEKLHTVQAHIHTPVRQLKFVPFLNAIVS 996
+ + VQ H PV LKF+ +VS
Sbjct 273 LEKRSLRTLIVQCH-EGPVVSLKFLDGQPVMVS 304

