Add activation tests for GPU layers #1472

DhairyaLGandhi · 2021-01-22T06:57:51Z

This is in response to #1350, we should be routinely testing our layers with multiple activation functions to catch these errors early.

DhairyaLGandhi · 2021-01-22T10:07:17Z

Why are so many RNN tests marked broken @CarloLucibello? Was that introduced in #1367

CarloLucibello · 2021-01-22T11:04:55Z

Was that introduced in #1367

Is this a question? If yes, I don't think so, #1367 fixed many issues and actually unbroke one test

I don't remember when tests were marked as broken, could try with git blame

DhairyaLGandhi · 2021-01-22T11:20:04Z

Well this pr does show up some disagreements between cpu and GPU implementations which shouldn't be.

#1367 was also a major overhaul and to the best of my knowledge isn't accurate in its statement of beating cudnn. With @denizyuret cudnn PR in CUDA we would be best served to benchmark the two and use the cudnn api properly.

DhairyaLGandhi · 2021-02-01T14:49:20Z

Test:

@ModelZookeeper commands

DhairyaLGandhi · 2021-02-01T14:57:34Z

Here are the commands: build feed commands

darsnack · 2021-02-01T15:14:32Z

Does the bot only respond to the person who asked? I don't see any comments from the bot at all.

DhairyaLGandhi · 2021-02-01T15:16:21Z

The one I emoji'd on (ref #1472 (comment)) was made by the bot. I have to generate some new tokens so its not using my account for any of this.

darsnack · 2021-02-01T15:17:36Z

Ah cool!!

DhairyaLGandhi · 2021-02-01T15:38:41Z

Test 2:

@ModelZookeeper commands

ModelZookeeper · 2021-02-01T15:46:04Z

Here are the commands: build feed commands

CarloLucibello · 2021-03-24T08:03:31Z

@DhairyaLGandhi I wanted to review this, could you ask for reviews before merging PRs?

CarloLucibello · 2021-03-24T08:06:01Z

test/cuda/layers.jl

+
+  batch_norm = [BatchNorm]
+  gpu_gradtest("BatchNorm 1 with $act", batch_norm, rand(Float32, 28,28,3,4), 3, act, test_cpu = false) #TODO fix errors
+  gpu_gradtest("BatchNorm 2 with $act", batch_norm, rand(Float32, 5,4), 5, act, test_cpu = false)


test_cpu used to be `true, why are all false now? I think that checking that results are the same on cpu and gpu is very important

This pr doesn't change the src

There's a comment explaining this.

how does setting test_cpu=false when it used to be true doesn't change things?
I don't know which comment you refer to.
Also, there are some #TODO fix errors, why?

CarloLucibello · 2021-03-24T08:06:21Z

test/cuda/layers.jl

+  # m_cpu(x_cpu)
+  # gradient(() -> sum(m_cpu(x_cpu)), Flux.params(m_cpu))
+  # m_gpu(x_gpu)
+  # gradient(() -> sum(m_gpu(x_gpu)), Flux.params(m_gpu))


Why these have been commented out?

DhairyaLGandhi · 2021-03-24T08:08:43Z

I have been waiting for any objections on this for a while actually. Pretty sure I wanted this in either way.

CarloLucibello · 2021-03-24T08:09:03Z

test/cuda/layers.jl

+  gpu_gradtest("BatchNorm 2 with $act", batch_norm, rand(Float32, 5,4), 5, act, test_cpu = false)
+
+  instancenorm = [InstanceNorm]
+  gpu_gradtest("InstanceNorm with $act", instancenorm, r, 1, act, test_cpu = false)


some of these layers were checked in both testmode and trainmode, not true anymore

But those are tested in the specific tests for the specific layers in normalisation.jl. if you want to add those back, I can add a follow on. It would be easier to review those separately anyway

This should be tested here as well since we currently have bugs like #1542 that are gpu only

CarloLucibello · 2021-03-24T08:11:31Z

I have been waiting for any objections on this for a while actually. Pretty sure I wanted this in either way.

There is a button on the top right corner where you can request a review, or you can just ping someone as everyone does all the time.

I'm pretty sure we don't want to remove a lot of test coverage as this PR does.

CarloLucibello · 2021-03-24T08:14:49Z

Since you have a lot of stuff lying around incomplete, it's hard to tell what is ready for review. So please do ask explicitly for some feedback

Dhairya Gandhi added 5 commits January 22, 2021 12:24

add activation tests for GPU layers

fba755e

test for layers with activation

1c7efc6

fixes

f541102

more meaningful testset names

9905ee5

renames test names

1730d4f

Dhairya Gandhi added 3 commits January 22, 2021 17:33

don't test with cpu

e68f8b7

test errors

49d5d1d

fixes

86fe2b3

DhairyaLGandhi mentioned this pull request Jan 22, 2021

RNN update to drop CUDNN, fix LSTM bug and output type stability #1367

Merged

4 tasks

Dhairya Gandhi added 3 commits February 8, 2021 17:32

cleanup

200dae6

use f32 tolerances

53ec393

rm train loop hooks

39b1e14

DhairyaLGandhi mentioned this pull request Mar 11, 2021

Pullback within pullback throws error when using swish activation function #1500

Closed

DhairyaLGandhi mentioned this pull request Mar 23, 2021

Fix Norm Layers, Again #1509

Closed

8 tasks

Dhairya Gandhi and others added 6 commits March 23, 2021 12:47

fix conflicts

0d4ef42

Merge branch 'master' into dg/acttests

7452509

mark broken layers better

c6a8366

missed broken_layers annotation

524887b

remove group norm and instance norm from broken layers

9b1eaec

norm layers behave differently on cpu and gpu

bd3675a

Dhairya Gandhi added 2 commits March 23, 2021 19:16

document norm behaviour

cfa52cd

dont test cpu for adaptive layers

c9ee993

DhairyaLGandhi linked an issue Mar 23, 2021 that may be closed by this pull request

ConvTranspose on GPU fails with certain activation functions #1350

Closed

Dhairya Gandhi added 3 commits March 23, 2021 20:14

revert extra changes

de16624

make tests pass

1213214

sad hack arounds

ae699d0

DhairyaLGandhi merged commit 5c40716 into master Mar 24, 2021

CarloLucibello reviewed Mar 24, 2021

View reviewed changes

DhairyaLGandhi deleted the dg/acttests branch May 13, 2021 11:08

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add activation tests for GPU layers #1472

Add activation tests for GPU layers #1472

DhairyaLGandhi commented Jan 22, 2021

DhairyaLGandhi commented Jan 22, 2021

CarloLucibello commented Jan 22, 2021 •

edited

Loading

DhairyaLGandhi commented Jan 22, 2021

DhairyaLGandhi commented Feb 1, 2021

DhairyaLGandhi commented Feb 1, 2021

darsnack commented Feb 1, 2021

DhairyaLGandhi commented Feb 1, 2021 •

edited

Loading

darsnack commented Feb 1, 2021

DhairyaLGandhi commented Feb 1, 2021

ModelZookeeper commented Feb 1, 2021

CarloLucibello commented Mar 24, 2021

CarloLucibello Mar 24, 2021

DhairyaLGandhi Mar 24, 2021

CarloLucibello Mar 24, 2021

CarloLucibello Mar 24, 2021 •

edited

Loading

DhairyaLGandhi commented Mar 24, 2021

CarloLucibello Mar 24, 2021

DhairyaLGandhi Mar 24, 2021

CarloLucibello Mar 24, 2021

CarloLucibello commented Mar 24, 2021 •

edited

Loading

CarloLucibello commented Mar 24, 2021

Add activation tests for GPU layers #1472

Add activation tests for GPU layers #1472

Conversation

DhairyaLGandhi commented Jan 22, 2021

DhairyaLGandhi commented Jan 22, 2021

CarloLucibello commented Jan 22, 2021 • edited Loading

DhairyaLGandhi commented Jan 22, 2021

DhairyaLGandhi commented Feb 1, 2021

DhairyaLGandhi commented Feb 1, 2021

darsnack commented Feb 1, 2021

DhairyaLGandhi commented Feb 1, 2021 • edited Loading

darsnack commented Feb 1, 2021

DhairyaLGandhi commented Feb 1, 2021

ModelZookeeper commented Feb 1, 2021

CarloLucibello commented Mar 24, 2021

CarloLucibello Mar 24, 2021

Choose a reason for hiding this comment

DhairyaLGandhi Mar 24, 2021

Choose a reason for hiding this comment

CarloLucibello Mar 24, 2021

Choose a reason for hiding this comment

CarloLucibello Mar 24, 2021 • edited Loading

Choose a reason for hiding this comment

DhairyaLGandhi commented Mar 24, 2021

CarloLucibello Mar 24, 2021

Choose a reason for hiding this comment

DhairyaLGandhi Mar 24, 2021

Choose a reason for hiding this comment

CarloLucibello Mar 24, 2021

Choose a reason for hiding this comment

CarloLucibello commented Mar 24, 2021 • edited Loading

CarloLucibello commented Mar 24, 2021

CarloLucibello commented Jan 22, 2021 •

edited

Loading

DhairyaLGandhi commented Feb 1, 2021 •

edited

Loading

CarloLucibello Mar 24, 2021 •

edited

Loading

CarloLucibello commented Mar 24, 2021 •

edited

Loading