# Project data overview

# Project Data Overview

This is the description of project overview:

'./data_rppa.tsv': Detailing protein expression (RPPA) across various samples, with 'Sample_ID' indicating samples and subsequent columns showing proteins and expression levels. 

'./data_clinical.tsv': Contains clinical data with fields such as:
    'Sample_ID': (string), Identifier to uniquely specify a patient.
    'Subtype': (string), Subtype
    'Age': (number), Age at which a condition or disease was first diagnosed.
    'Sex': (string), Sex/Gender, can be either "Male" or "Female".
    "Grade": (string), Neoplasm Histologic Grade.
    'Stage': (string), The extent of a cancer, especially whether the disease has spread from the original site to other parts of the body based on AJCC staging criteria. Could be one of them: "STAGE I", "STAGE II", "STAGE III" or "STAGE IV"
    'Overall_Survival_Status': (number), Overall patient survival status: "0" for LIVING and "1" for DECEASED.
    'Overall_Survival_In_Months': (number), Overall survival in months since initial diagnosis.
    'Progress_Free_Survival_Status': (number), Progression Free Status: "0" for CENSORED and "1" for PROGRESSION.
    'Progress_Free_Survival_In_Months': (number), Progress Free Survival (Months)

'./data_mrna.tsv': Lists gene expression at the mRNA level, with 'Sample_ID' and subsequent columns for genes and expression levels.

'./data_cna.tsv': Shows gene copy number alterations (CNA), with 'Sample_ID' and gene columns marked 1 for duplications, 0 for no mutations, and -1 for deletions.

'./data_mutations.tsv': Details gene mutations, with 'Sample_ID' and gene columns marked 1 for mutations and 0 for no mutations.

'./data_methylation.tsv': Outlines gene methylation levels, with 'Sample_ID' and subsequent columns for genes and methylation levels.