Credits to: Haohan Wang, Jinglin Jian from the University of Illinois Urbana-Champaign
Understanding the genetic basis of diseases can open doors to tailored treatments and personalized medicine. Furthermore, the interplay between genes, diseases, and external factors, such as demographics and other traits, can offer deeper insights into disease progression and outcomes.
This project focuses on deciphering the intricate relationships between specific genes, cancer development, and various conditions through a comprehensive analysis of genetic data. Utilizing the Xena dataset, which encompasses clinical and genetic information about 36 types of cancers, this project employs advanced data analysis and regression modeling techniques to explore gene-trait pairs in the context of cancer genetics.
The objective is to understand how certain genes contribute to cancer risk and progression under different demographic and environmental conditions. This project aims not only to deepen our understanding of cancer genetics but also to contribute to personalized medicine approaches in oncology.
Data Analysis: Python Data Visualization: Matplotlib, Seaborn Version Control: GitHub
To achieve the project objective, I explored 48 gene-trait pairs and conditions, which form the unit of each research question. Each of the 48 research questions roughly follows the following methodology.