Gene Annotation Template
Gene Annotation Log - Template
Basic Information:
DNA Coordinates:
DNA Sequence (FASTA format):
Protein Sequence (FASTA format):
Isoelectric Point:
Similarity Data (Sequence-Based):
BLAST Data:
- Gene Product Name:
- Top hit – organism:
- Length, Score, E-value, Identity, Positives and Gaps
- Alignment of Top Hit and Query Sequence
CDD:
- Significant COG Hits:
- Names of COGs:
- Score:
- E-value:
PDB:
- Significant Structure Hits:
o Length
o Score
o E-value
o Identities
o Positives
o Gaps
o Alignment
T-Coffee:
- Multi-Sequence Alignment
Cellular Localization Data:
TMHMM:
- Number of Predicted TMH’s
- Transmembrane Topology graph and comment
SignalP:
- Signal Peptide Probability
- Signal Peptide Graph
PSORT:
- Cytoplasmic Score:
- Cytoplasmic Membrane Score
- Periplasmic Score:
- Outer Membrane Score:
- Extracellular Score:
- Final Prediction for Protein Location (of the above listed):
Phobius:
- Enter Graph:
Final Hypothesis: Where do you expect to find this protein?
Alternative Open Reading Frames:
Proposed DNA Coordinates:
Reasoning:
Structure-Based Evidence of Function:
Pfam-A:
- Significant Matches:
- Pfam Name:
- Pairwise Alignment:
- HMM logo:
- Key Functional Residues:
PDB:
- Significant Structure Hits:
o Length
o Score
o E-value
o Identities
o Positives
o Gaps
- Alignment:
Pathways:
KEGG – Map:
EcoCyc – Pathway:
E.C. Number:
Duplication and Degradation:
Paralog:
- Length
- Score
- E-value
- Identity
- Positives
- Gaps
Alignment of Top Hit and Query Sequence:
Evidence of Horizontal Gene Transfer:
Phylogenetic Tree Diagram:
Gene Context:
- Ortholog Neighborhood Region of Organism:
- Examples of similarities or Differences:
- Comment:
Chromosome Viewer GC Heat Map:
- Characteristic GC% of genome:
- Average GC% of gene:
RNA (Rfam):
RNA Family:
Bits Score: