Add pFedHN baseline #2377

achiverram28 · 2023-09-15T11:30:41Z

Description

Creating a new baseline called pFedHN by reproducing the original paper

Checklist

Any other comments?

Signed-off-by: achiverram28 <[email protected]>

This reverts commit 7372b8d.

Signed-off-by: achiverram28 <[email protected]>

jafermarq

Hi @achiverram28,

Thanks for adding all the new content to the PR. You'll see I made a commit just before my review. It mostly addressed some formatting issues in the README.md.

Please find below a few comments. They are mostly about how to set the device for clients and server. I wasn't able to run the code (python3 -m pFedHN.main) due to device mismatch. My suggestion is to let the client device to be set in a simpler way, then let the user specify the server device from the config.

Please ping me when you have a chance to implement the changes. They shouldn't take long.

Also, as discussed earlier with you, for this baseline the preferable way of showing the results is by means of tables (as in the paper). Including line plots is welcome.

baselines/pFedHN/pyproject.toml

baselines/pFedHN/pFedHN/client.py

baselines/pFedHN/pFedHN/main.py

baselines/pFedHN/pFedHN/utils.py

jafermarq · 2023-10-31T12:44:39Z

baselines/pFedHN/pFedHN/comparison_experiments/conf/base.yaml

+  fraction_fit: 0.1
+  min_fit_clients: 5
+  min_available_clients: 5
+


Suggested change

server_device: cpu

Other baselines introduce this to set the device to be used by the server. Use this in your server.py instead of utils.get_client() (that i recommend to remove).

The code should be ready to support the server running on either CPU or GPU regardless of the device the clients use. Currently the code doesn't work if clients don't use the gpu (fails in the strategy), when a client uses gpu my attempt to run python3 -m pFedHN.main failed on the client side.

Here I have given device as torch.device("cuda:0" if torch.cuda.is_available() else "cpu") and with respect to that I have taken care in the rest in the server.py

baselines/pFedHN/pFedHN/comparison_experiments/conf/base.yaml

baselines/pFedHN/pFedHN/main.py

Signed-off-by: achiverram28 <[email protected]>

jafermarq

Hi @achiverram28.

You'll see I've pushed some changes. They are mostly quite minor. This is the summary of changes:

moved num_rounds and num_nodes outside of client in the config. Then renamed num_nodes to num_clients. This primarily has implications in main.py and server.py
It’s better to set the periodicity at which the federated evaluation (i.e. configure_valuate() and aggregate_evaluate()) take place, via the config. Please see the new evaluate_every passed as input argument to your custom strategy. Note that it’s redundant to have this type of check in aggrgate_evaluate() since it’s never called if configure_evaluate() doesn’t return instructions. So I have removed that if statement from inside aggregate_evaluate().
Formatting fixes

Please double check if you think looks good.

For the next review could you:

please ensure all types all defined in public methods and functions.
I understand you need a custom server class for your baseline. Could you document better pFedHNServer? Specially the additional lines added to fit_round() and evaluate_round()?
Please include the FedAvg results when you have some time. It should be as simple as doing the below. But could you do this reusing your top-level main.py? instead of having (effectively a new project) it all in comparison_experiments/?
- defining a new config file (maybe conf/fedavg.yaml?)
- launch simulation using the default server.
- the idea (as with other baselines already merged) would be to run the FedAvg experiments as:

python -m pFedHN.main --config-name fedavg # which will point to a file in conf/fedavg

Also, i've run the CIFAR-10 experiment four times:

python -m pFedHN.main model.local=True model.variant=1 server.lr=5e-2

and observed quite different results sometimes ( run attempt 1,3 in one machine and 2,4 in another):

attempt 1: best seen avg_acc is 0.8926 @ round 4380
attempt 2: best seen avg_acc is 0.7406 @ round 1350 -- later rounds go as low as ~0.3
attempt 3: best seen avg_acc is 0.8831 @ round 4530
attempt 4: best seen avg_acc is 0.8841 @ round 4920

jafermarq · 2023-11-14T14:51:31Z

baselines/pFedHN/README.md

+| Batch size | 64 |
+| Classes per client | 2|
+| Total clients | 50 |
+| Tocal epoch(client-side) | 50 |


I think it's worth clarifying that this actually means "number of batches". If I see your train() function in trainer.py, each "epoch" uses a single batch of the trainloader. This seems to be exactly what Algorithm 1 in the paper indicates. Maybe it would be better if you rename it to "local step"? (and make that change in the code+config as well)

jafermarq · 2023-11-14T14:53:10Z

baselines/pFedHN/pFedHN/trainer.py

+
+    # inner updates -> obtaining theta_tilda
+    for _i in range(epochs):
+        net.train()


you can move this outside the for loop.

Yup would do that

Signed-off-by: achiverram28 <[email protected]>

WilliamLindskog · 2024-12-04T22:11:08Z

Hi @achiverram28,

Just checking in here. Are you still eager to complete this baseline?

Best regards
William

achiverram28 added 16 commits September 15, 2023 00:09

Adding the initial things

1b54aa5

Signed-off-by: achiverram28 <[email protected]>

Adding the conf

56c8e5f

Signed-off-by: achiverram28 <[email protected]>

Adding the models

1fceaaa

Signed-off-by: achiverram28 <[email protected]>

Adding the data

2f3afd8

Signed-off-by: achiverram28 <[email protected]>

Adding the main

a21f94d

Signed-off-by: achiverram28 <[email protected]>

Making a user friendly approach

0d4f7b4

Signed-off-by: achiverram28 <[email protected]>

Modifying main

458a5af

Signed-off-by: achiverram28 <[email protected]>

Modyfing th dataset.py

bd95e07

Signed-off-by: achiverram28 <[email protected]>

Modifying the models

10f8080

Signed-off-by: achiverram28 <[email protected]>

will update these later

57e7816

Signed-off-by: achiverram28 <[email protected]>

Adding the trainer

d2d792d

Signed-off-by: achiverram28 <[email protected]>

Adding the strategy

2c52399

Signed-off-by: achiverram28 <[email protected]>

Adding the client.py

daaf571

Signed-off-by: achiverram28 <[email protected]>

Modifying the conf

09fbdaa

Signed-off-by: achiverram28 <[email protected]>

Updating the readme

b0ade10

Signed-off-by: achiverram28 <[email protected]>

Updating the README

1f4752c

Signed-off-by: achiverram28 <[email protected]>

achiverram28 requested review from jafermarq, tanertopal and danieljanes as code owners September 15, 2023 11:30

jafermarq added the summer-of-reproducibility About a baseline for Summer of Reproducibility label Sep 15, 2023

achiverram28 added 10 commits September 15, 2023 17:12

Code formatting

7372b8d

Signed-off-by: achiverram28 <[email protected]>

Revert "Code formatting"

19a3821

This reverts commit 7372b8d.

Code formatting

71da489

Signed-off-by: achiverram28 <[email protected]>

Docstring formatting

e032d33

Signed-off-by: achiverram28 <[email protected]>

Modifying the pyproject.toml

e6c71e0

Signed-off-by: achiverram28 <[email protected]>

Formatting and modifying the imports

170b336

Signed-off-by: achiverram28 <[email protected]>

Modifying the conf

585cb2d

Signed-off-by: achiverram28 <[email protected]>

Updating the README.md

d2df7d2

Signed-off-by: achiverram28 <[email protected]>

Updating the README.md

15a681a

Signed-off-by: achiverram28 <[email protected]>

Updating the README.md

0ffccae

Signed-off-by: achiverram28 <[email protected]>

achiverram28 added 5 commits October 25, 2023 19:12

Adding more results

e4d54cb

Signed-off-by: achiverram28 <[email protected]>

Adding more results

5de13b7

Signed-off-by: achiverram28 <[email protected]>

Adding more results

c900476

Signed-off-by: achiverram28 <[email protected]>

Modifying the hyp-param values

27835e7

Signed-off-by: achiverram28 <[email protected]>

Adding more results

76beea6

Signed-off-by: achiverram28 <[email protected]>

jafermarq changed the title ~~pFedHN baseline~~ Add pFedHN baseline Oct 31, 2023

updates here and there

a548dbd

jafermarq reviewed Oct 31, 2023

View reviewed changes

achiverram28 added 7 commits November 2, 2023 19:05

Modifying changes

03b8ee4

Signed-off-by: achiverram28 <[email protected]>

Modifying the toml

7b8044a

Signed-off-by: achiverram28 <[email protected]>

Modifying in comparison_algorithms

58cad5c

Signed-off-by: achiverram28 <[email protected]>

Making device related changes

919164e

Signed-off-by: achiverram28 <[email protected]>

Device related changes

57ca494

Signed-off-by: achiverram28 <[email protected]>

Modifying few more things

7f19d7c

Signed-off-by: achiverram28 <[email protected]>

Adding results

ba54c7f

Signed-off-by: achiverram28 <[email protected]>

achiverram28 requested a review from jafermarq November 2, 2023 22:25

achiverram28 and others added 5 commits November 3, 2023 04:37

Adding more results

96df338

Signed-off-by: achiverram28 <[email protected]>

Adding more results

1e4fbc1

Signed-off-by: achiverram28 <[email protected]>

Modified commmit to run proper results

36a8800

Signed-off-by: achiverram28 <[email protected]>

Small change

1e199b2

Signed-off-by: achiverram28 <[email protected]>

formatting;evaluate_every; other minor changes

0b565a3

jafermarq reviewed Nov 14, 2023

View reviewed changes

kishan-droid and others added 7 commits December 6, 2023 16:43

Merge branch 'adap:main' into main

6c55df4

Modified changes

e79c6eb

Signed-off-by: achiverram28 <[email protected]>

Modifying the config

377702a

Signed-off-by: achiverram28 <[email protected]>

Modifying the README.md

fb57216

Signed-off-by: achiverram28 <[email protected]>

Merge branch 'main' of https://github.com/achiverram28/flower

60262a7

Modifying changes

bca8124

Signed-off-by: achiverram28 <[email protected]>

Modified changes

8b657c0

Signed-off-by: achiverram28 <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add pFedHN baseline #2377

Add pFedHN baseline #2377

achiverram28 commented Sep 15, 2023

jafermarq left a comment

jafermarq Oct 31, 2023

jafermarq Oct 31, 2023

jafermarq Oct 31, 2023 •

edited

Loading

achiverram28 Nov 2, 2023

jafermarq left a comment •

edited

Loading

jafermarq Nov 14, 2023

jafermarq Nov 14, 2023

achiverram28 Dec 19, 2023

WilliamLindskog commented Dec 4, 2024

Add pFedHN baseline #2377

Are you sure you want to change the base?

Add pFedHN baseline #2377

Conversation

achiverram28 commented Sep 15, 2023

Description

Checklist

Any other comments?

jafermarq left a comment

Choose a reason for hiding this comment

jafermarq Oct 31, 2023

Choose a reason for hiding this comment

jafermarq Oct 31, 2023

Choose a reason for hiding this comment

jafermarq Oct 31, 2023 • edited Loading

Choose a reason for hiding this comment

achiverram28 Nov 2, 2023

Choose a reason for hiding this comment

jafermarq left a comment • edited Loading

Choose a reason for hiding this comment

jafermarq Nov 14, 2023

Choose a reason for hiding this comment

jafermarq Nov 14, 2023

Choose a reason for hiding this comment

achiverram28 Dec 19, 2023

Choose a reason for hiding this comment

WilliamLindskog commented Dec 4, 2024

jafermarq Oct 31, 2023 •

edited

Loading

jafermarq left a comment •

edited

Loading