L20 worksheet ¶
Breast cancer is one of the leading causes of cancer-related deaths worldwide, with its prognosis and treatment response varying significantly among patients. Advances in transcriptomics allow us to study gene expression levels and identify patterns that contribute to these differences. By analyzing this data, we can uncover valuable insights into the genetic underpinnings of breast cancer outcomes.
In this activity, we will explore a processed subset of the METABRIC dataset , focusing on patients with invasive ductal carcinoma who underwent mastectomy without chemotherapy or radiotherapy. The data includes gene expression levels for 331 genes and survival outcomes for 278 patients (139 survivors and 139 non-survivors).
The data is provided in three formats: Google Sheets , Excel , and NumPy files.