PR4: Add deep_mmd_loss files #170

sanaAyrml · 2024-06-07T00:44:13Z

PR Type

[Feature]

Short Description

This is a tentative implementation for deep mmd loss.

Tests Added

No tests added yet.

fl4health/clients/deep_mmd_clients/ditto_deep_mmd_client.py

emersodb · 2024-06-19T20:56:36Z

fl4health/clients/deep_mmd_clients/ditto_deep_mmd_client.py

+            EvaluationLosses: an instance of EvaluationLosses containing checkpoint loss and additional losses
+                indexed by name.
+        """
+        for layer in self.flatten_feature_extraction_layers.keys():


If you're going to be indexing into self.deep_mmd_losses anyway, could we simply do

for layer_loss_module in self.deep_mmd_losses.values(): layer_loss_module.training = False

For Ditto, we do this process in validate and train_by_steps/train_by_epochs for the global model, maybe we can just do this there?

I think it's still worth overriding compute_evaluation_loss and compute_training_loss and asserting that all layer_loss_module.training == False or vice versa though to be safe 🙂

I also might be missing this, but I don't see where we set layer_loss_module.training to True in the client. Based on the loss code, this would mean that we won't run training of the deep kernels after the first server round, which I think we want to keep doing?

Good catch! The True setting was indeed missing, so I added it to the update_before_train function. Following your suggestion, I moved the False setting to the validate function. I kept the assertions in both compute_evaluation_loss and compute_training_loss functions for consistency.

We can just iterate through the self.deep_mmd_losses values and do assertions I think?

for layer_loss_module in self.deep_mmd_losses.values(): assert not layer_loss_module.training

fl4health/clients/deep_mmd_clients/ditto_deep_mmd_client.py

fl4health/losses/deep_mmd_loss.py

emersodb · 2024-06-19T21:53:17Z

fl4health/losses/deep_mmd_loss.py

+            list(self.featurizer.parameters()) + [self.epsilonOPT] + [self.sigmaOPT] + [self.sigma0OPT], lr=self.lr
+        )
+
+    def Pdist2(self, x: torch.Tensor, y: Optional[torch.Tensor]) -> torch.Tensor:


maybe expand this to pairwise_distiance_squared?

It looks like we don't leverage the fact that y can be none to get the distances of x with itself. Maybe we just drop that option and require y to be passed to simplify this function.

fl4health/losses/deep_mmd_loss.py

emersodb · 2024-06-19T22:09:55Z

fl4health/losses/deep_mmd_loss.py

+        # Compute output of deep network
+        model_output = self.featurizer(features)
+        # Compute epsilon, sigma and sigma_0
+        ep = torch.exp(self.epsilonOPT) / (1 + torch.exp(self.epsilonOPT))


rename epsilon and note that it is the epsilon in $\kappa_w(x, y)$ in the paper

It doesn't look like we did this? I think both the rename and comment are worthwhile

Yeah I missed this

fl4health/losses/deep_mmd_loss.py

emersodb · 2024-06-19T22:16:35Z

fl4health/losses/deep_mmd_loss.py

+        # Compute epsilon, sigma and sigma_0
+        ep = torch.exp(self.epsilonOPT) / (1 + torch.exp(self.epsilonOPT))
+        sigma = self.sigmaOPT**2
+        sigma0_u = self.sigma0OPT**2


based on the implementation of MMDu I would suggest renaming sigma0 to sigma_phi, sigma0OPT to sigma_phi_opt and sigma0_u to sigma_phi (since there doesn't seem to be any reason to have _u in there anyway. Similarly, anything that is sigma or sigmaOPT can be sigma_q or sigma_q_opt to match the notation of the paper.

fl4health/losses/deep_mmd_loss.py

research/flamby/fed_isic2019/ditto_deep_mmd/client.py

research/flamby/fed_isic2019/ditto_deep_mmd/run_fold_experiment.slrm

for more information, see https://pre-commit.ci

…not get updated

emersodb

Really nice changes. Just added a few small comments and reminders of a few pieces you might have overlooked in my comments. Very close to ready to go!

fl4health/clients/deep_mmd_clients/ditto_deep_mmd_client.py

emersodb · 2024-10-16T21:41:31Z

fl4health/clients/deep_mmd_clients/ditto_deep_mmd_client.py

+            EvaluationLosses: an instance of EvaluationLosses containing checkpoint loss and additional losses
+                indexed by name.
+        """
+        for layer in self.flatten_feature_extraction_layers.keys():


We can just iterate through the self.deep_mmd_losses values and do assertions I think?

for layer_loss_module in self.deep_mmd_losses.values(): assert not layer_loss_module.training

emersodb · 2024-10-16T21:45:17Z

fl4health/clients/deep_mmd_clients/ditto_deep_mmd_client.py

+            for layer, layer_deep_mmd_loss in self.deep_mmd_losses.items():
+                deep_mmd_loss = layer_deep_mmd_loss(features[layer], features[" ".join(["init_global", layer])])
+                additional_losses["_".join(["deep_mmd_loss", layer])] = deep_mmd_loss
+                total_deep_mmd_loss += deep_mmd_loss
            total_loss += self.deep_mmd_loss_weight * total_deep_mmd_loss
            additional_losses["deep_mmd_loss_total"] = total_deep_mmd_loss


Just to be safe, maybe we can clone total_deep_mmd_loss here?

I added that but I am checking bunch of other ditto versions and we don't have any where. I am wondering whether I should update them or not.

fl4health/losses/deep_mmd_loss.py

emersodb · 2024-10-16T22:47:33Z

fl4health/losses/deep_mmd_loss.py

+        # Compute output of deep network
+        model_output = self.featurizer(features)
+        # Compute epsilon, sigma and sigma_0
+        ep = torch.exp(self.epsilonOPT) / (1 + torch.exp(self.epsilonOPT))


It doesn't look like we did this? I think both the rename and comment are worthwhile

fl4health/losses/deep_mmd_loss.py

emersodb · 2024-10-16T22:56:06Z

fl4health/losses/deep_mmd_loss.py

+        ep = torch.exp(self.epsilonOPT) / (1 + torch.exp(self.epsilonOPT))
+        sigma = self.sigmaOPT**2
+        sigma0_u = self.sigma0OPT**2
+        # Compute Compute J (STAT_u)


I'd include the notation mention in your comment as well if you're alright with it (\hat{J}_{\lambda})

for more information, see https://pre-commit.ci

sanaAyrml requested a review from emersodb June 7, 2024 06:38