Lab Module 8 - Reactome
This work is licensed under a Creative Commons Attribution-ShareAlike 3.0 Unported License. This means that you are able to copy, share and modify the work, as long as the result is distributed under the same license.
Module 8 Practical Lab: Reactome
By Robin Haw
Goal: Analyze gene lists and somatic mutation data to identify biology that contributes to GBM and ovarian cancer.
Example 1: Network-based analysis of GBM gene-sample data
o Open up Cytoscape.
o Go to Apps>Reactome FI and Select “Gene Set/Mutational Analysis”.
o Choose “2016” (Latest) Version.
o Upload/Browse GBM_genesample.txt file.
o Select “Gene/sample number pair” and Choose sample cutoff value of 4.
o Select “Fetch FI annotations”.
o Click OK.
- Describe the size and composition of the GBM sub-network?
- What are the driver mutations?
- Describe the TP53-PEG3 interaction, and the source information to support this interaction?
- Describe the data sources for the TAF1-TAF7L FI?
- After clustering, how many modules are there?
- How many pathway gene sets are there in Module 2 when the FDR Filter is set to 1.0E-4 and Module Size Filter to 10?
o Hint: Analyze Module Functions>Pathway Enrichment. Select appropriate filters at each step. - What are the most significant pathway gene sets in Module 0, 1, and 3?
o Hint: You don’t need to list them all!
Example 2: Network-based analysis of OvCa somatic mutation
o Open up Cytoscape.
o Go to Apps>Reactome FI and Select “Gene Set/Mutational Analysis”.
o Choose “2016” (Latest) Version.
o Upload/Browse OVCA_TCGA_MAF.txt file.
o Select “NCI MAF” (Mutation Annotation File) and Choose sample cutoff value of 4.
o Do not select “Fetch FI annotations”.
o Click OK.
- Describe the size and composition of the OvCa network?
- What are the most frequently mutated genes?
- After clustering, how many modules are there?
- How many pathway gene sets are there in Module 0 when the FDR Filter is set to 0.005 and Module Size Filter to 10?
o Hint: Analyze Module Functions>Pathway Enrichment. Select appropriate filters at each step. - What are the most significant pathway gene sets in Module 1, 2, 3 and 4?
- Do the GO Biological Process annotations correlate with the significant pathway annotations for Module 2?
o Hint: Analyze Module Functions>GO Biological Process. Select appropriate filters at each step. - What are the most significant GO Cell Component gene sets in Module 3 when the FDR Filter is set to 0.005 and Module Size Filter to 10? [Optional]
o Hint: Analyze Module Functions>GO Cell Component. Select appropriate filters at each step. - Are any of the modules annotated with the NCI Disease term: “Stage_IV_Breast_Cancer” [malignant cancer]?
o Hint: Load Cancer Gene Index>Neoplasm>Neoplasm_by_Site>Breast Neoplasm>……. - How many modules are statistically significant in the CoxPH analysis?
o Hint: Analyze Module Functions>Survival Analysis>Upload/Browse OVCA_TCGA_Clinical.txt. Click OK. - What does the Kaplan-Meyer plot show for the most clinically significant modules?
o Hint: Click the most statistically significant module link [blue line] from the CoxPH results panel. Click OK. Click #_plot.pdf to display Kaplan-Meyer plot. Repeat this for the other significant module links. KM plot: samples having genes mutated in a module (red line), and samples having no genes mutated in the module (green line). - Taking into what you have learned about module 4, what is your hypothesis?