Adding UNet Model #210

shivance · 2022-12-27T15:16:45Z

This PR adds the UNet implementation to Metalhead.jl in favor of #112

I've referred official torchhub implementation here and @DhairyaLGandhi 's UNet.jl package.

PR Checklist

Tests are added
Documentation, if applicable

shivance · 2022-12-27T15:19:10Z

The PR is ready for code review. I'm still new to flux so apologies for silly mistakes like not following the docstring style or specific design & code principles in Julia ecosystems.

ToucheSir · 2022-12-27T16:47:08Z

Metalhead has a JuliaFormatter config, so if your editor supports that I would recommend running it to help with code style adherence.

pri1311

I doubt there is supposed to be Batchnorm and ReLu layer before the concat + conv layer

shivance · 2022-12-28T04:38:38Z

@ToucheSir I ran JuliaFormatter.jl using format(".")
Looks like the other files in Metalhead were not formatted as well.
Should I keep it this way or just format unet.jl ?

@pri1311 yup ! corrected that.

src/convnets/unet.jl

src/utilities.jl

shivance · 2023-01-01T15:57:53Z

@ToucheSir @darsnack Finally made it.
Successfully modified the model to use only Parallel and not use custom forward pass.

It was both challenging & confusing simultaneously as I was continuously getting dimension mismatch error. I knew it very well that it's because of Parallel, and the tensors were propagating through the maxpool which it wasn't supposed to.

I tried debugger as well, but there seemed some problem with it.

Following helped :

I drew architecture on paper, and wrote down all layers, and matched with my code.
Then I localized the error.

I was successfully able to resolve the error by moving

layers = Chain(layers, decoder_layer)

to before the decoder block. I realized that I have been chaining the layers with decoder, (whilst decoder layers are Chain of concat and decoder conv layers). This was causing error as chaining at this stage would make all tensors flow through concat again, thus dimensionmismatch.

Moving it before avoided this case.

darsnack

Great job! I still need to review the architecture details, but here are some initial minor changes.

src/convnets/unet.jl

test/convnets.jl

src/Metalhead.jl

shivance · 2023-01-02T11:58:00Z

Gtg for next round of review @darsnack @ToucheSir @pri1311 !

src/convnets/alexnet.jl

This reverts commit ca73586.

pri1311 · 2023-01-03T09:25:19Z

Following is the output I get from loading the model:

UNet(
  Chain(
    Chain(
      Chain([
        Chain(
          conv1 = Conv((3, 3), 3 => 32, pad=1),  # 896 parameters
          norm1 = BatchNorm(32, relu),  # 64 parameters, plus 64
          conv2 = Conv((3, 3), 32 => 32, pad=1),  # 9_248 parameters
          norm2 = BatchNorm(32, relu),  # 64 parameters, plus 64
        ),
        Chain(
          conv1 = Conv((3, 3), 32 => 64, pad=1),  # 18_496 parameters
          norm1 = BatchNorm(64, relu),  # 128 parameters, plus 128
          conv2 = Conv((3, 3), 64 => 64, pad=1),  # 36_928 parameters
          norm2 = BatchNorm(64, relu),  # 128 parameters, plus 128
        ),
        Chain(
          conv1 = Conv((3, 3), 64 => 128, pad=1),  # 73_856 parameters
          norm1 = BatchNorm(128, relu),  # 256 parameters, plus 256
          conv2 = Conv((3, 3), 128 => 128, pad=1),  # 147_584 parameters
          norm2 = BatchNorm(128, relu),  # 256 parameters, plus 256
        ),
        Chain(
          conv1 = Conv((3, 3), 128 => 256, pad=1),  # 295_168 parameters
          norm1 = BatchNorm(256, relu),  # 512 parameters, plus 512
          conv2 = Conv((3, 3), 256 => 256, pad=1),  # 590_080 parameters
          norm2 = BatchNorm(256, relu),  # 512 parameters, plus 512
        ),
      ]),
      Chain(
        conv1 = Conv((3, 3), 256 => 512, pad=1),  # 1_180_160 parameters
        norm1 = BatchNorm(512, relu),   # 1_024 parameters, plus 1_024
        conv2 = Conv((3, 3), 512 => 512, pad=1),  # 2_359_808 parameters
        norm2 = BatchNorm(512, relu),   # 1_024 parameters, plus 1_024
      ),
    ),
    Chain(
      Chain(
        conv1 = Conv((3, 3), 512 => 256, pad=1),  # 1_179_904 parameters
        norm1 = BatchNorm(256, relu),   # 512 parameters, plus 512
        conv2 = Conv((3, 3), 256 => 256, pad=1),  # 590_080 parameters
        norm2 = BatchNorm(256, relu),   # 512 parameters, plus 512
      ),
      Chain(
        conv1 = Conv((3, 3), 256 => 128, pad=1),  # 295_040 parameters
        norm1 = BatchNorm(128, relu),   # 256 parameters, plus 256
        conv2 = Conv((3, 3), 128 => 128, pad=1),  # 147_584 parameters
        norm2 = BatchNorm(128, relu),   # 256 parameters, plus 256
      ),
      Chain(
        conv1 = Conv((3, 3), 128 => 64, pad=1),  # 73_792 parameters
        norm1 = BatchNorm(64, relu),    # 128 parameters, plus 128
        conv2 = Conv((3, 3), 64 => 64, pad=1),  # 36_928 parameters
        norm2 = BatchNorm(64, relu),    # 128 parameters, plus 128
      ),
      Chain(
        conv1 = Conv((3, 3), 64 => 32, pad=1),  # 18_464 parameters
        norm1 = BatchNorm(32, relu),    # 64 parameters, plus 64
        conv2 = Conv((3, 3), 32 => 32, pad=1),  # 9_248 parameters
        norm2 = BatchNorm(32, relu),    # 64 parameters, plus 64
      ),
    ),
  ),
)         # Total: 72 trainable arrays, 7_069_152 parameters,
          # plus 36 non-trainable, 5_888 parameters, summarysize 27.002 MiB.

I believe layers are missing. I am not completely well versed with Flux, but in my knowledge it should display all the layers, even the custom cat_channels function/layer

pri1311 · 2023-01-03T09:25:48Z

src/convnets/unet.jl

+	return Chain(conv1 = Conv(kernel, in_chs => out_chs; pad = (1, 1)),
+		norm1 = BatchNorm(out_chs, relu),
+		conv2 = Conv(kernel, out_chs => out_chs; pad = (1, 1)),
+		norm2 = BatchNorm(out_chs, relu))


Any specific reason for using named layers? I don't see it being used anywhere. I haven't seen Metalhead use such a code convention, so a seems a little inconsistent with the code base.

Agree with this, it looks weird – I think the only place we might need named layers is if we specifically need to index into a Chain later for use and the name of the layer isn't apparent. Here I don't see that happening.

ToucheSir · 2023-01-22T18:06:58Z

src/convnets/unet.jl

+end
+@functor UNet
+
+function UNet(imsize::Dims{2} = (256, 256), inchannels::Integer = 3, outplanes::Integer = 3,


Suggested change

function UNet(imsize::Dims{2} = (256, 256), inchannels::Integer = 3, outplanes::Integer = 3,

function UNet(imsize::Dims = (256, 256), inchannels::Integer = 3, outplanes::Integer = 3,

Is there anything in the UNet implementation that would prevent us from generalizing it to 1, 3 or more dimensions?

Due to my own ignorance, which dimensions are spatial in the 1 and N>2 cases? Meaning which ones should be downscaled?

Same as with 2D. Spatial dimensions x channels/features x batch size, so all but the last two assuming the usual memory layout.

@shivance I think the point is that you don't need any changes other than dropping the type restriction to generalize to more dimensions.

But we'd want to have that in the test, so we can save it for another PR if you'd like.

shivance · 2023-01-22T18:30:31Z

It's funny how many rounds of reviews, architecture changes, this PR has had. Over a month since it's being reviewed 😆
Contributing to open source requires a lots of perseverance I must say 🔢

ToucheSir · 2023-01-24T21:06:38Z

Thanks for your patience, I think we're very close!

What you're experiencing is a triple learning curve of sorts. Julia is a new language for most contributors and so it takes longer to learn idiomatic code patterns than e.g. already knowing idiomatic Python. Flux is a new library for most contributors and thus folks are less familiar with what's available to use + limitations than they would be with PyTorch/TF. Metalhead is even more domain-specific and more opinionated because it sits at a higher level in the stack. I think a good analogy would be opening a PR contributing a new model to timm after just a month or two of Python experience ;)

Some things we can do on our side to flatten the learning curve:

Add pre-commit hooks and CI checks for formatting so zero review time is consumed on it
Add difficulty markers to feature issues so that potential contributors know what they're getting into. GH labels could work here.
Write proper devdocs for Flux and Metalhead. This is a much bigger project and almost certainly will be an ongoing one.

shivance · 2023-01-25T03:34:33Z

Thanks @ToucheSir !

Are we still going with n dimensional unet?

shivance · 2023-01-25T19:21:43Z

Write proper devdocs for Flux and Metalhead. This is a much bigger project and almost certainly will be an ongoing one.

@ToucheSir Come to think of it, this could be a potential GSoD project!

darsnack · 2023-01-26T01:13:16Z

Unfortunately, I think GSoCs are not allowed to be solely for documentation (I'll have to double check this). But you can propose it for GSoD!

shivance · 2023-01-26T01:20:07Z

@ToucheSir @darsnack I'm willing to open a follow up PR to add N spatial dimensional support.
Let's get the PR for 2 dimension merged in first !
(It's kind of demotivating for me to drag this PR further after so many rounds, feels like no result of all this work) 😅

Open for review in current state...

darsnack

This looks like it is ready to merge modulo one small docstring issue.

PRs can take time (sometimes extenuated by our ability to review frequently). In our case, this is even more true since FluxML is very community-driven. This makes our development extremely distributed, and it is important that PRs are "release ready" before merging. Otherwise, simple changes can get bottlenecked from release due to larger changes that require refactoring/polishing.

Your patience and hard work is very much appreciated. The long review time is not a reflection of your work, just a consequence of the fact that we're all contributing on a volunteer basis. Please don't feel discouraged! Some of the most prolific Julia contributors have high impact PRs that take months to get right. So you're in good company!

src/convnets/unet.jl

darsnack · 2023-01-26T01:38:54Z

src/convnets/unet.jl

+end
+@functor UNet
+
+function UNet(imsize::Dims{2} = (256, 256), inchannels::Integer = 3, outplanes::Integer = 3,


@shivance I think the point is that you don't need any changes other than dropping the type restriction to generalize to more dimensions.

But we'd want to have that in the test, so we can save it for another PR if you'd like.

Co-authored-by: Kyle Daruwalla <[email protected]>

shivance · 2023-01-26T02:28:07Z

@darsnack So I'll leave the signature of imsize as

imsize::Dims{2} = (256, 256)

for now?

Or make it

UNet(imsize::Dims= (256, 256)

darsnack

Let's leave it as is for now. Good job!

shivance · 2023-01-27T03:16:57Z

Thanks @darsnack !

shivance · 2023-01-27T04:03:07Z

Thank you @ToucheSir @darsnack !

Outdated

shivance added 2 commits December 27, 2022 20:30

model implemented

ba54cf0

adding documentation

11c50d9

pri1311 reviewed Dec 27, 2022

View reviewed changes

ran juliaformatter

ca73586

darsnack reviewed Dec 28, 2022

View reviewed changes

src/convnets/unet.jl Outdated Show resolved Hide resolved

shivance requested review from darsnack and pri1311 and removed request for darsnack and pri1311 December 30, 2022 03:57

pri1311 reviewed Dec 31, 2022

View reviewed changes

src/utilities.jl Outdated Show resolved Hide resolved

removed custom forward pass using Parallel

552a8fd

shivance requested review from ToucheSir and darsnack and removed request for darsnack and ToucheSir January 1, 2023 15:58

removing _random_normal

c577aed

darsnack requested changes Jan 1, 2023

View reviewed changes

src/convnets/unet.jl Outdated Show resolved Hide resolved

src/convnets/unet.jl Outdated Show resolved Hide resolved

src/convnets/unet.jl Outdated Show resolved Hide resolved

test/convnets.jl Outdated Show resolved Hide resolved

pri1311 reviewed Jan 1, 2023

View reviewed changes

src/Metalhead.jl Outdated Show resolved Hide resolved

incorporating suggested changes

fb642c4

shivance requested a review from darsnack January 2, 2023 11:57

ToucheSir reviewed Jan 2, 2023

View reviewed changes

src/convnets/alexnet.jl Outdated Show resolved Hide resolved

Revert "ran juliaformatter"

7c7b1ee

This reverts commit ca73586.

pri1311 reviewed Jan 3, 2023

View reviewed changes

minor change

4735dff

ToucheSir reviewed Jan 22, 2023

View reviewed changes

minor edit

3bebe5a

shivance requested review from darsnack and theabhirath and removed request for lorenzoh, darsnack and theabhirath January 24, 2023 12:50

shivance requested review from ToucheSir and removed request for darsnack January 25, 2023 19:17

darsnack requested changes Jan 26, 2023

View reviewed changes

Update src/convnets/unet.jl

65aa5e8

Co-authored-by: Kyle Daruwalla <[email protected]>

darsnack approved these changes Jan 26, 2023

View reviewed changes

darsnack merged commit 80ab995 into FluxML:master Jan 27, 2023

shivance deleted the unet branch January 30, 2023 18:37

shivance mentioned this pull request Jan 30, 2023

Support for convolution on N-dimensions #213

Closed

shivance mentioned this pull request Feb 16, 2023

Add Tutorial Image Segmentation using Metalhead's UNet FluxML/Flux.jl#2192

Closed

CarloLucibello mentioned this pull request May 7, 2023

Add model implementations #112

Open

46 tasks

shivance changed the title ~~Adding UNet implementation~~ Adding UNet Model Aug 1, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adding UNet Model #210

Adding UNet Model #210

shivance commented Dec 27, 2022 •

edited

Loading

shivance commented Dec 27, 2022 •

edited

Loading

ToucheSir commented Dec 27, 2022

pri1311 left a comment

shivance commented Dec 28, 2022

shivance commented Jan 1, 2023

darsnack left a comment

shivance commented Jan 2, 2023

pri1311 commented Jan 3, 2023 •

edited

Loading

pri1311 Jan 3, 2023

theabhirath Jan 6, 2023

ToucheSir Jan 22, 2023

darsnack Jan 25, 2023

ToucheSir Jan 25, 2023

darsnack Jan 26, 2023

shivance commented Jan 22, 2023 •

edited

Loading

ToucheSir commented Jan 24, 2023

shivance commented Jan 25, 2023

shivance commented Jan 25, 2023 •

edited

Loading

darsnack commented Jan 26, 2023

shivance commented Jan 26, 2023

darsnack left a comment

darsnack Jan 26, 2023

shivance commented Jan 26, 2023

darsnack left a comment

shivance commented Jan 27, 2023

shivance commented Jan 27, 2023

	function UNet(imsize::Dims{2} = (256, 256), inchannels::Integer = 3, outplanes::Integer = 3,
	function UNet(imsize::Dims = (256, 256), inchannels::Integer = 3, outplanes::Integer = 3,

Adding UNet Model #210

Adding UNet Model #210

Conversation

shivance commented Dec 27, 2022 • edited Loading

PR Checklist

shivance commented Dec 27, 2022 • edited Loading

ToucheSir commented Dec 27, 2022

pri1311 left a comment

Choose a reason for hiding this comment

shivance commented Dec 28, 2022

shivance commented Jan 1, 2023

darsnack left a comment

Choose a reason for hiding this comment

shivance commented Jan 2, 2023

pri1311 commented Jan 3, 2023 • edited Loading

pri1311 Jan 3, 2023

Choose a reason for hiding this comment

theabhirath Jan 6, 2023

Choose a reason for hiding this comment

ToucheSir Jan 22, 2023

Choose a reason for hiding this comment

darsnack Jan 25, 2023

Choose a reason for hiding this comment

ToucheSir Jan 25, 2023

Choose a reason for hiding this comment

darsnack Jan 26, 2023

Choose a reason for hiding this comment

shivance commented Jan 22, 2023 • edited Loading

ToucheSir commented Jan 24, 2023

shivance commented Jan 25, 2023

shivance commented Jan 25, 2023 • edited Loading

darsnack commented Jan 26, 2023

shivance commented Jan 26, 2023

darsnack left a comment

Choose a reason for hiding this comment

darsnack Jan 26, 2023

Choose a reason for hiding this comment

shivance commented Jan 26, 2023

darsnack left a comment

Choose a reason for hiding this comment

shivance commented Jan 27, 2023

shivance commented Jan 27, 2023

shivance commented Dec 27, 2022 •

edited

Loading

shivance commented Dec 27, 2022 •

edited

Loading

pri1311 commented Jan 3, 2023 •

edited

Loading

shivance commented Jan 22, 2023 •

edited

Loading

shivance commented Jan 25, 2023 •

edited

Loading