Hypothetical Protein 644029933 (Olivia Ho-Shing)
I randomly chose a protein with no predicted function: 644029933 hypothetical protein
In order to search for some possible function I tried using:
- BLASTn
- BLASTp
- Kyte-Doolittle Plot
- PREDATOR
- Search for Shine Dalgarno sequence within 50 bp upstream
BLASTn and BLASTp
The nucleotide sequence for this gene does have a start codon (ATG) and stop codon (TGA) defined. The BLASTn hits for the nucleotide sequence, other than a perfect alignment with itself (Halomicrobium mukohataei complete genome), returned hits with poor Query coverage (<10%) and unreliable E-values (>0.05).
For the BLASTp hits, there were several hits for a monooxygenase protein, but the E-values were very unreliable.
Kyte-Doolittle Plot
Kyte Doolittle hydropathy plots use an amino acid sequence to predict whether the protein crosses the plasma membrane, as in an integral membrane protein. Predicting if this hypothetical protein is an integral membrane protein would give some clue to its function. When I submitted the amino acid sequence, the plot indicated that this sequence probably does not cross the plasma membrane (If it did, there would be peaks spanning above the red line at 1.8).
PREDATOR
PREDATOR predicts the 3D structure of an amino acid sequence, which could also help in predicting the function. Here are the results I got from submitting the hypothetical protein sequence:
I also wanted to see what results I got from a monooxygenase from the BLASTp hits:
The hypothetical protein looks like it could be similar to a portion of the monooxygenase, or it could be something completely different.
Shine Dalgarno
The 50 bp upstream of the start codon are: TCTGGAGATCGACCGCAAGGCCGTGACCGTCCGAGCGGAGGACTGACAGC I found what looks like a Shine Dalgarno sequence (CGGAGGA) 9 bp upstream of the start codon, which around the average separator length for our genome.