Built with Tableau; data from Kaggle’s Breast Cancer Wisconsin (Diagnostic) dataset
Each sample in the dataset is described by 30 features of cell nuclei, measuring size, shape, smoothness, and texture. For example, malignant tumors often have larger radii, irregular edges (high concavity), and higher complexity (fractal dimension). These measurements allow us to distinguish between benign and malignant tumors.
The 6-feature correlation heatmap is generated in Python and imported into Tableau:
notebooks/correlation_mini.ipynb (or a Colab notebook) to produce data/correlation_matrix_mini.csv.