minerva.analysis.clustering_analysis¶
Classes¶
Perform a clustering analysis on the embeddings generated by some model, |
Module Contents¶
- class minerva.analysis.clustering_analysis.ClusteringAnalysis(data_split='predict')[source]¶
Bases:
minerva.analysis.model_analysis._ModelAnalysisPerform a clustering analysis on the embeddings generated by some model, using the Silhouette score and Davies-Bouldin score, functions implemented in sklearn. The results are returned in a dictionary.
Initialize the analysis with the specified data split.
Parameters¶
- data_splitstr, optional
The data split to use for the analysis, by default “predict”. This specifies which part of the dataset to analyze. Can be one of: [“train”, “validation”, “test”, “predict”].
- compute(model, data)[source]¶
Compute the clustering analysis metrics.
Parameters¶
- modelL.LightningModule
The trained model from which to extract embeddings.
- dataL.LightningDataModule
The data module containing the dataset to analyze.
Returns¶
- dict
A dictionary containing the Silhouette score and Davies-Bouldin score.
- Parameters:
model (lightning.LightningModule)
data (lightning.LightningDataModule)
- data_split = 'predict'¶
- Parameters:
data_split (str)