Tencrop on test images & preprocessing imbalanced data #24

han-liu · 2018-04-27T01:26:21Z

Hi arnoweng,

I have two following questions in this project.
1):
I'm wondering why to apply tencrop technique on testing images. I thought data augmentation techniques should only be applied on training set in order to add diversity of training images while test images should be kept unchanged so the testing results can be compared with others. If you use tencrop on testing images, you are technically using a different testing set, right?
2):
Do you think if it is necessary to pre-process the imbalanced data? I noticed there is a HUGE imbalance between the sample number of hernia and other diseases (~200 images vs 10000 e.g. infiltration), and thus different testing set would result in really different aucroc on at least Hernia. e.g. If my test set accidentally include only just 10 Hernia images, I guess the aucroc score of Hernia in this test set would be really high like 93%. In contrast, if there are around 150 Hernia in test set, the aucroc score would be low.
Thanks a lot!

singhsegv · 2024-04-10T17:02:25Z

Did you ever get the answer for this?

han-liu · 2024-04-10T17:35:33Z

Not yet. @singhsegv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Tencrop on test images & preprocessing imbalanced data #24

Tencrop on test images & preprocessing imbalanced data #24

han-liu commented Apr 27, 2018

singhsegv commented Apr 10, 2024

han-liu commented Apr 10, 2024

Tencrop on test images & preprocessing imbalanced data #24

Tencrop on test images & preprocessing imbalanced data #24

Comments

han-liu commented Apr 27, 2018

singhsegv commented Apr 10, 2024

han-liu commented Apr 10, 2024