why the accuracy is so low? #6

XueBaolu · 2023-06-04T02:20:02Z

Excuse me, I copy your code to my environment of experiment.
Firstly, the virtual data is generated in upsample way. Then, they are loaded and normalized by mean=(0.5, 0.5, 0.5), std=(0.25, 0.25, 0.25) to a Dataset. Lastly, the virtual dataloader is created by the virtual dataset. The batch_size and batch_num_per_class are 64, 20, respectively.
And the real dataset cifar10 is normalized by mean = [0.5071, 0.4865, 0.4409], std = [0.2673, 0.2564, 0.2762] like your code. And batch_size = 128
I set the weight of virtual data loss is 1, and the weight of proxy align loss is 0.5.
Why the accuracy is only 0.2 in the 100 round? If I ignored something? Could you help me to debug.

wizard1203 · 2023-06-04T13:22:33Z

Did you run the original project, or re-implement the algorithm with your own codes? If you re-implement it, can you provide the accuracy of FedAvg and other baselines, check if the low ACC comes from the implementation?

XueBaolu · 2023-06-05T02:29:29Z

Thank you for responding.
I do re-implement the algorithm with my own codes. And I set the lr=0.01, didn't use a scheduler, So the increase of accuracy is slow. But for FedAvg, it still got a 0.3 accuracy at 100 round, higher than which for my re-implement of VHL.
So I wander whether I ignore some steps of VHL in my codes. Could help me to check?

wizard1203 · 2023-06-05T08:50:26Z

Got it. So the FedAvg has the similar acc with VHL in your case. What is the neural network you use? A simple CNN may not have enough capacity to fit real data and noise data.

XueBaolu · 2023-06-05T12:31:58Z

The VHL is strangely worse than FedAvg in my case. And I use the resnet18 to run.

wizard1203 · 2023-06-06T04:00:01Z

Could you try lr=0.1, or lr=0.3? And check the number of clients in total, number of clients each round, local epochs, the Non-IID degree of datasets.

XueBaolu · 2023-06-06T12:01:12Z

Thank you for responding. I will try it.
And I set the number of client to 10, all sampled in each round. Set local epochs to 1. The alpha of lda is 0.05.

XueBaolu · 2023-06-07T01:03:09Z

Excuse me.
I run my codes with lr=0.1. But the VHL is still worse. The result of VHL is upper and which of FedAvg is below.

wizard1203 · 2023-06-07T02:43:15Z

I see. The accuracy of FedAvg can be improved further. You can try lr=0.3, or lr=0.1 with Momentum. And the first conv layer of resnet18 should be 3x3 instead of 7x7 used for ImageNet.
For comparing VHL and FedAvg,

Could you try loading more noise data, and ensuring each class can be sampled per iteration?
Maybe you can try sampling 5 clients each round instead of 10. I'm thinking that sampling all clients may have low variance.

XueBaolu · 2023-06-07T07:29:35Z

ok, I will try later, thanks!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

why the accuracy is so low? #6

why the accuracy is so low? #6

XueBaolu commented Jun 4, 2023

wizard1203 commented Jun 4, 2023

XueBaolu commented Jun 5, 2023

wizard1203 commented Jun 5, 2023

XueBaolu commented Jun 5, 2023

wizard1203 commented Jun 6, 2023

XueBaolu commented Jun 6, 2023

XueBaolu commented Jun 7, 2023 •

edited

Loading

wizard1203 commented Jun 7, 2023

XueBaolu commented Jun 7, 2023

why the accuracy is so low? #6

why the accuracy is so low? #6

Comments

XueBaolu commented Jun 4, 2023

wizard1203 commented Jun 4, 2023

XueBaolu commented Jun 5, 2023

wizard1203 commented Jun 5, 2023

XueBaolu commented Jun 5, 2023

wizard1203 commented Jun 6, 2023

XueBaolu commented Jun 6, 2023

XueBaolu commented Jun 7, 2023 • edited Loading

wizard1203 commented Jun 7, 2023

XueBaolu commented Jun 7, 2023

XueBaolu commented Jun 7, 2023 •

edited

Loading