Lab Module 8 - Reactome

This work is licensed under a Creative Commons Attribution-ShareAlike 3.0 Unported License. This means that you are able to copy, share and modify the work, as long as the result is distributed under the same license.

Module 8 Practical Lab: Reactome

By Robin Haw

Goal: Analyze gene lists and somatic mutation data to identify biology that contributes to GBM and ovarian cancer.

Example 1: Network-based analysis of GBM gene-sample data

o Open up Cytoscape.
o Go to Apps>Reactome FI and Select “Gene Set/Mutational Analysis”.
o Choose “2016” (Latest) Version.
o Upload/Browse GBM_genesample.txt file.
o Select “Gene/sample number pair” and Choose sample cutoff value of 4.
o Select “Fetch FI annotations”.
o Click OK.

  1. Describe the size and composition of the GBM sub-network?
  2. What are the driver mutations?
  3. Describe the TP53-PEG3 interaction, and the source information to support this interaction?
  4. Describe the data sources for the TAF1-TAF7L FI?
  5. After clustering, how many modules are there?
  6. How many pathway gene sets are there in Module 2 when the FDR Filter is set to 1.0E-4 and Module Size Filter to 10?
    o Hint: Analyze Module Functions>Pathway Enrichment. Select appropriate filters at each step.
  7. What are the most significant pathway gene sets in Module 0, 1, and 3?
    o Hint: You don’t need to list them all!

Example 2: Network-based analysis of OvCa somatic mutation

o Open up Cytoscape.
o Go to Apps>Reactome FI and Select “Gene Set/Mutational Analysis”.
o Choose “2016” (Latest) Version.
o Upload/Browse OVCA_TCGA_MAF.txt file.
o Select “NCI MAF” (Mutation Annotation File) and Choose sample cutoff value of 4.
o Do not select “Fetch FI annotations”.
o Click OK.

  1. Describe the size and composition of the OvCa network?
  2. What are the most frequently mutated genes?
  3. After clustering, how many modules are there?
  4. How many pathway gene sets are there in Module 0 when the FDR Filter is set to 0.005 and Module Size Filter to 10?
    o Hint: Analyze Module Functions>Pathway Enrichment. Select appropriate filters at each step.
  5. What are the most significant pathway gene sets in Module 1, 2, 3 and 4?
  6. Do the GO Biological Process annotations correlate with the significant pathway annotations for Module 2?
    o Hint: Analyze Module Functions>GO Biological Process. Select appropriate filters at each step.
  7. What are the most significant GO Cell Component gene sets in Module 3 when the FDR Filter is set to 0.005 and Module Size Filter to 10? [Optional]
    o Hint: Analyze Module Functions>GO Cell Component. Select appropriate filters at each step.
  8. Are any of the modules annotated with the NCI Disease term: “Stage_IV_Breast_Cancer” [malignant cancer]?
    o Hint: Load Cancer Gene Index>Neoplasm>Neoplasm_by_Site>Breast Neoplasm>…….
  9. How many modules are statistically significant in the CoxPH analysis?
    o Hint: Analyze Module Functions>Survival Analysis>Upload/Browse OVCA_TCGA_Clinical.txt. Click OK.
  10. What does the Kaplan-Meyer plot show for the most clinically significant modules?
    o Hint: Click the most statistically significant module link [blue line] from the CoxPH results panel. Click OK. Click #_plot.pdf to display Kaplan-Meyer plot. Repeat this for the other significant module links. KM plot: samples having genes mutated in a module (red line), and samples having no genes mutated in the module (green line).
  11. Taking into what you have learned about module 4, what is your hypothesis?