Inconsistent between implementation and paper descriptions #30

hoangmit · 2023-07-01T01:09:29Z

In the paper Algorithm 3, for hyena order N, there are (N+1) projections, and N filters
with order=2, it returns
mlp2(x) * FFTConv(mlp1(x) * FFTConv(mlp0(x), filter0), filter1)

However, in the implementation e.g.

safari/standalone_hyena.py

Line 244 in 4f5972c

v = self.dropout(v * x_i)

For hyena order N, there are (N+1) projections and (N-1) filters
In the code, for example, with order=2,
it will do mlp2(x) * FFTConv(mlp0(x) * mlp1(x), filter0)

i.e., for order=N there is only (N-1) FFTConv applications.

is it intentional or am I missing something (the code is quite convoluted) ?

A lot of the experiment had done with order=2. Does that mean one application of FFTConv per layer is enough ?

The text was updated successfully, but these errors were encountered:

Zymrael · 2023-07-01T06:38:45Z

See #9

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Inconsistent between implementation and paper descriptions #30

Inconsistent between implementation and paper descriptions #30

hoangmit commented Jul 1, 2023 •

edited

Loading

Zymrael commented Jul 1, 2023

Inconsistent between implementation and paper descriptions #30

Inconsistent between implementation and paper descriptions #30

Comments

hoangmit commented Jul 1, 2023 • edited Loading

Zymrael commented Jul 1, 2023

hoangmit commented Jul 1, 2023 •

edited

Loading