Hypothetical Protein 644029933 (Olivia Ho-Shing)

From GcatWiki
Revision as of 14:59, 9 September 2009 by Olhoshing (talk | contribs)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to: navigation, search

I randomly chose a protein with no predicted function: 644029933 hypothetical protein

In order to search for some possible function I tried using:

  1. BLASTn
  2. BLASTp
  3. Kyte-Doolittle Plot
  4. PREDATOR
  5. Search for Shine Dalgarno sequence within 50 bp upstream

BLASTn and BLASTp

The nucleotide sequence for this gene does have a start codon (ATG) and stop codon (TGA) defined. The BLASTn hits for the nucleotide sequence, other than a perfect alignment with itself (Halomicrobium mukohataei complete genome), returned hits with poor Query coverage (<10%) and unreliable E-values (>0.05).

Hyp-blastn.png

For the BLASTp hits, there were several hits for a monooxygenase protein, but the E-values were very unreliable.


Hyp-blastp.png

Hyp-blastp-align.png

Kyte-Doolittle Plot

Kyte Doolittle hydropathy plots use an amino acid sequence to predict whether the protein crosses the plasma membrane, as in an integral membrane protein. Predicting if this hypothetical protein is an integral membrane protein would give some clue to its function. When I submitted the amino acid sequence, the plot indicated that this sequence probably does not cross the plasma membrane (If it did, there would be peaks spanning above the red line at 1.8).


Kyte-doolittle.png

PREDATOR

PREDATOR predicts the 3D structure of an amino acid sequence, which could also help in predicting the function. Here are the results I got from submitting the hypothetical protein sequence:


Hyp-predator.png

I also wanted to see what results I got from a monooxygenase from the BLASTp hits:


Mono-predator.png

The hypothetical protein looks like it could be similar to a portion of the monooxygenase, or it could be something completely different.

Shine Dalgarno

The 50 bp upstream of the start codon are: TCTGGAGATCGACCGCAAGGCCGTGACCGTCCGAGCGGAGGACTGACAGC I found what looks like a Shine Dalgarno sequence (CGGAGGA) 9 bp upstream of the start codon, which around the average separator length for our genome.