When the genomic sequence is run in the ORF finder, 6 ORFs are obtained with the longest length being 864 and shortest is 114. The longest ORF Frame +1 ORF is composed of 864 nucleotides base pairs with 287 amino acids on the 5’ to 3’ strand. It is an incomplete sequence as it does not end with a short codon. From further findings, the unknown sequence may come from the Synechococcus species.
The Synechococcus species is a gram-negative bacterium and photoautotrophs that could be found in oligotrophic, pelagic sea. The lowest E-value result obtained from Frame +1 by the BLAST program is a putative short tail fibre of the Synechococcus phage. The function of the tail fibre is to provide a passage of the phase genome to the host cell and also aids in determination of host specificity for host recognition. These could be the possibe function of the gene. It also contains a DUF32 conserved domain which is still of unknown function. The result predicted from pSORTb shows that the protein location is evenly distributed among the cytoplasmic membrane, periplasmic, outer membrane and extracellular with a value of 2.50 each. These may indicate that the protein is a resident of more than one site. In addition, a signal peptide is present indicating that there is a cleavable N-terminal signal peptide.