New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Apply the attention mask in all decoding steps (LM inference) #2532

Merged

vince62s merged 18 commits into OpenNMT:master from l-k-11235:masked_attn_test

Dec 15, 2023

Commits on Nov 30, 2023

wip

l-k-11235 committed Nov 30, 2023
Configuration menu
View commit details

Copy full SHA for d7b5f56

Browse repository at this point
Copy the full SHA

d7b5f56 View commit details

Browse the repository at this point in the history

Commits on Dec 4, 2023

fixed attention mask

l-k-11235 committed Dec 4, 2023
Configuration menu
View commit details

Copy full SHA for be6adab

Browse repository at this point
Copy the full SHA

be6adab View commit details

Browse the repository at this point in the history
some code cleaning

l-k-11235 committed Dec 4, 2023
Configuration menu
View commit details

Copy full SHA for 4644bc8

Browse repository at this point
Copy the full SHA

4644bc8 View commit details

Browse the repository at this point in the history

Commits on Dec 5, 2023

restore and adapt previous dec_mask

l-k-11235 committed Dec 5, 2023
Configuration menu
View commit details

Copy full SHA for c94d8c9

Browse repository at this point
Copy the full SHA

c94d8c9 View commit details

Browse the repository at this point in the history

Commits on Dec 6, 2023

wip

l-k-11235 committed Dec 6, 2023
Configuration menu
View commit details

Copy full SHA for 3116271

Browse repository at this point
Copy the full SHA

3116271 View commit details

Browse the repository at this point in the history
wip - apply mask on context before final_linear + degugging

l-k-11235 committed Dec 6, 2023
Configuration menu
View commit details

Copy full SHA for faae43d

Browse repository at this point
Copy the full SHA

faae43d View commit details

Browse the repository at this point in the history

Commits on Dec 8, 2023

wip - apply offset in rotaty embeddings

l-k-11235 committed Dec 8, 2023
Configuration menu
View commit details

Copy full SHA for 40caae8

Browse repository at this point
Copy the full SHA

40caae8 View commit details

Browse the repository at this point in the history

Commits on Dec 11, 2023

wip

l-k-11235 committed Dec 11, 2023
Configuration menu
View commit details

Copy full SHA for a805ad9

Browse repository at this point
Copy the full SHA

a805ad9 View commit details

Browse the repository at this point in the history
apply pad_mask for all decoding steps when batch_size is greater than…
```
… 1 - works for 'classical attention'
```
l-k-11235 committed Dec 11, 2023
Configuration menu
View commit details

Copy full SHA for f3ed229

Browse repository at this point
Copy the full SHA

f3ed229 View commit details

Browse the repository at this point in the history

Commits on Dec 12, 2023

wip

l-k-11235 committed Dec 12, 2023
Configuration menu
View commit details

Copy full SHA for 829a861

Browse repository at this point
Copy the full SHA

829a861 View commit details

Browse the repository at this point in the history
handle finished hypotheses

l-k-11235 committed Dec 12, 2023
Configuration menu
View commit details

Copy full SHA for 8333d04

Browse repository at this point
Copy the full SHA

8333d04 View commit details

Browse the repository at this point in the history
wip

l-k-11235 committed Dec 12, 2023
Configuration menu
View commit details

Copy full SHA for 50c0754

Browse repository at this point
Copy the full SHA

50c0754 View commit details

Browse the repository at this point in the history

Commits on Dec 13, 2023

use key_pad_mask in transformer LM attention layers to handle beam_si…
```
…ze > 1 thanks to map_state
```
l-k-11235 committed Dec 13, 2023
Configuration menu
View commit details

Copy full SHA for b50a7e1

Browse repository at this point
Copy the full SHA

b50a7e1 View commit details

Browse the repository at this point in the history

Commits on Dec 14, 2023

differentiate between scaled-dot and scaled-dot-flash values for self…
```
…_attn_type opt
```
l-k-11235 committed Dec 14, 2023
Configuration menu
View commit details

Copy full SHA for 7c49902

Browse repository at this point
Copy the full SHA

7c49902 View commit details

Browse the repository at this point in the history

Commits on Dec 15, 2023

some code cleaning

l-k-11235 committed Dec 15, 2023
Configuration menu
View commit details

Copy full SHA for 1bc3212

Browse repository at this point
Copy the full SHA

1bc3212 View commit details

Browse the repository at this point in the history
added warning in quickstart.md

l-k-11235 committed Dec 15, 2023
Configuration menu
View commit details

Copy full SHA for dc34150

Browse repository at this point
Copy the full SHA

dc34150 View commit details

Browse the repository at this point in the history
removed empty lines

l-k-11235 committed Dec 15, 2023
Configuration menu
View commit details

Copy full SHA for 3dcda9e

Browse repository at this point
Copy the full SHA

3dcda9e View commit details

Browse the repository at this point in the history
restored proper file permissions

l-k-11235 committed Dec 15, 2023
Configuration menu
View commit details

Copy full SHA for bbc28a8

Browse repository at this point
Copy the full SHA

bbc28a8 View commit details

Browse the repository at this point in the history