We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
为什么计算局部注意力时,需要把特征图变换成 (H/P × W/P, N × P², C) 这个形状,即将P²放在倒数第二个维度?
而计算全局注意力时,则需要把特征图变换成 (N × G², H/G × W/G, C) 这个形状,然后再交换 【倒数第二个维度】 和 【倒数第三个维度】 的顺序,即变成 (H/G × W/G, N × G², C),既然这种形式和局部形式相同,为什么不直接进行相同的变换呢,而是再去额外的交换维度?
The text was updated successfully, but these errors were encountered:
No branches or pull requests
为什么计算局部注意力时,需要把特征图变换成 (H/P × W/P, N × P², C) 这个形状,即将P²放在倒数第二个维度?
而计算全局注意力时,则需要把特征图变换成 (N × G², H/G × W/G, C) 这个形状,然后再交换 【倒数第二个维度】 和 【倒数第三个维度】 的顺序,即变成 (H/G × W/G, N × G², C),既然这种形式和局部形式相同,为什么不直接进行相同的变换呢,而是再去额外的交换维度?
The text was updated successfully, but these errors were encountered: