High-Throughput Biology - From Sequence to Networks Module 15 Lab
This work is licensed under a Creative Commons Attribution-ShareAlike 3.0 Unported License. This means that you are able to copy, share and modify the work, as long as the result is distributed under the same license.
Module 15 Practical Lab: Reactome
By Robin Haw
Goal: Analyze gene lists and somatic mutation data to identify biology that contributes to GBM and ovarian cancer.
Example 1: Pathway-based analysis of GBM genelist
o Open up Cytoscape.
o Go to Apps >Reactome FI>Reactome Pathways.
o Unfurl the “Signal Transduction” events, by clicking the triangle to the left of the event name, in the “Reactome” tab on the left.
o Click on “Signaling by EGFR” or your favourite pathway.
o Right-click on highlighted pathway name to display drop-down menu, select “Show Diagram” to display Signaling by EGFR pathway.
o Right-click on highlighted pathway name to display drop-down menu, select “Analyze Pathway Enrichment”
o Upload/Browse GBM_genelist.txt into Reactome Pathway Enrichment Analysis, and click “OK”.
- What are the most significant biological pathways when the FDR Filter is set to 0.05? o Hint: Right-click on selected pathway in Table Panel, and click “View in Diagram”. Purple-coloured nodes reflect hits in the dataset. Right-click on highlighted nodes to invoke additional features.
Example 2: Network-based analysis of GBM gene-sample data
o Open up Cytoscape.
o Go to Apps>Reactome FI and Select “Gene Set/Mutational Analysis”.
o Choose “2015 (Latest)” Version.
o Upload/Browse GBM_genesample.txt file.
o Select “Gene/sample number pair” and Choose sample cutoff value of 4.
o Select “Fetch FI annotations”.
o Click OK.
- Describe the size and composition of the GBM sub-network?
- What are the driver mutations?
- Describe the TP53-PEG3 interaction, and the source information to support this interaction?
- Describe the data sources for the TAF1-TAF7L FI?
- After clustering, how many modules are there?
- How many pathway gene sets are there in Module 2 when the FDR Filter is set to 0.0001 and Module Size Filter to 10?
o Hint: Analyze Module Functions>Pathway Enrichment. Select appropriate filters at each step. - What are the most significant pathway gene sets in Module 1, 2, 3?
o Hint: You don’t need to list them all!
Example 3: Network-based analysis of OvCa somatic mutation
o Open up Cytoscape.
o Go to Apps>Reactome FI and Select “Gene Set/Mutational Analysis”.
o Choose “2015 (Latest)” Version.
o Upload/Browse OVCA_TCGA_MAF.txt file.
o Select “NCI MAF” (Mutation Annotation File) and Choose sample cutoff value of 4.
o Do not select “Fetch FI annotations”.
o Click OK.
- Describe the size and composition of the OvCa network?
- What are the most frequently mutated genes?
- After clustering, how many modules are there?
- How many pathway gene sets are there in Module 0 when the FDR Filter is set to 0.005 and Module Size Filter to 10?
o Hint: Analyze Module Functions>Pathway Enrichment. Select appropriate filters at each step. - What are the most significant pathway gene sets in Module 0, 2, 3 and 5?
- Do the GO Biological Process annotations correlate with the significant pathway annotations for Module 0?
o Hint: Analyze Module Functions>GO Biological Process. Select appropriate filters at each step. - What are the most significant GO Cell Component gene sets in Module 3 when the FDR Filter is set to 0.005 and Module Size Filter to 10? [Optional]
o Hint: Analyze Module Functions>GO Cell Component. Select appropriate filters at each step. - Are any of the modules annotated with the NCI Disease term: “Stage_IV_Breast_Cancer” [malignant cancer]?
o Hint: Load Cancer Gene Index>Neoplasm>Neoplasm_by_Site>Breast Neoplasm>……. - How many modules are statistically significant in the CoxPH analysis?
o Hint: Analyze Module Functions>Survival Analysis>Upload/Browse OVCA_TCGA_Clinical.txt. Click OK. - What does the Kaplan-Meyer plot show for the most clinically significant modules?
o Hint: Click the most statistically significant module link [blue line] from the CoxPH results panel. Click OK. Click #_plot.pdf to display Kaplan-Meyer plot. Repeat this for the other significant module links. KM plot: samples having genes mutated in a module (red line), and samples having no genes mutated in the module (green line). - Taking into what you have learned about module 2, what is your hypothesis?