February 18, 2016
Classwork
Dr. Campbell and Dr. Heyer made significant progress on coding work for correlation. They changed the p-value from 0.05 to 0.01 so that R-Studio generates a shorter list of genes. (FIND DOCUMENT WITH THE CODE ON IT). Dr. Campbell and Dr. Heyer also mentioned a code that allows us to find gene names, "write.csv(colnames(carp)..." We want to use this code to find a seed gene, and then find genes most correlated to that seed gene.
Other groups in the class are working formatting sequence lists. One group is using Blast 2Go while another group is using gene names from a file and pairing them with gene ontology terms to format sequences with a sequence-based method or name-based method, respectively.
Coding Notes:
- "t" in code = transforming/transposing x and y axis.
Gene Search
All six liver samples are viable. However, we still need to verify intestine samples 3 and 6.
After attaining a list of genes from a previous blast of over represent genes, we tried to match these genes with genes in intestine samples 3 and 6. We googled the names of housekeeping genes from Kathryn's list of housekeeping genes to see if we can decode the differences in gene identifications and match them to genes that would verify our tissue samples. We also attempted to find housekeeping genes given the list of genes in Castoe et al. (2013). As we continued our gene search, we remembered that transcription factors are not always transcribed in high quantities so we must look for change in expression (on vs. off) instead of only looking at quantity of expression.
Identified Genes:
- Housekeeping genes in Python:
- NHE3: a sodium transporter in the intestinal membrane.
- Reference NHE3 article for more information.
Note: Ashlyn left class early this day to travel for Track & Field competition.