Embedding Layer #205

OneAdder · 2025-02-20T09:40:43Z

Input Embeddings Lookup Table (Trainable

Core

In Natural Language Processing input data is often encoded as indices of tokens in a vocabulary. Those indices are converted into vectors using weights which are trainable.
In theory, similar behaviour can be achieved by spreading input data by the desired size of vectors and then put through input2d and linear2d layers. However, it is very inefficient as we'll have to do matmul each time instead of simply getting an element by indices.
So, I created the layer that does it efficiently. It does not have a gradient as it is intended as an input layer. But it is trainable (get_params, get_gradients and set_params).

Positional

Apart from this core functionality, I also added positional encoding (output vectors + positions assigned to sin/cos waves). This will be needed for transformers.

Python Reference

Here: torch.nn.Embedding and a custom function for positional encoding.

src/nf/nf_embedding_layer_submodule.f90

test/test_embedding_layer.f90

OneAdder · 2025-02-23T10:51:42Z

@jvdp1 @milancurcic Ready for review

jvdp1

LGTM. Below are some minor suggestions.

src/nf/nf_network_submodule.f90

OneAdder · 2025-02-28T11:55:49Z

@milancurcic Your opinion?

milancurcic · 2025-02-28T13:47:12Z

Thanks for the nudge. Will try to finish review and merge either today or Monday. Thank you for all the hard work!

milancurcic · 2025-03-03T16:46:45Z

@OneAdder Can you add the Embedding layer entry to the table of layers in the README? I assume that based on it being provided in nf that it's meant to be consumed by users directly, rather than only internally by other layers.

OneAdder · 2025-03-04T10:21:56Z

@milancurcic Readme updated. Thank you for resolving conflicts!

milancurcic

Thank you!

jvdp1 reviewed Feb 21, 2025

View reviewed changes

src/nf/nf_embedding_layer_submodule.f90 Outdated Show resolved Hide resolved

jvdp1 reviewed Feb 21, 2025

View reviewed changes

src/nf/nf_embedding_layer_submodule.f90 Outdated Show resolved Hide resolved

jvdp1 reviewed Feb 21, 2025

View reviewed changes

test/test_embedding_layer.f90 Outdated Show resolved Hide resolved

OneAdder marked this pull request as draft February 22, 2025 12:00

OneAdder marked this pull request as ready for review February 23, 2025 10:51

OneAdder added 12 commits February 23, 2025 15:02

embedding_layer: initial forward implementation

1c54cf0

embedding_layer: implementation of embedding layer

d4731a1

embedding_layer: remove gradient attribute

e6b54de

embedding_layer: guard against zeros

48efd07

embedding_layer: plumbing

4cdd2e5

embedding_layer: positional encoding

6bfea21

embedding_layer: update tests

f1b414c

embedding_layer: add more comments

10e54d0

embedding_layer: update cmake

0165642

embedding_layer: pr fixes

dd0ab31

embedding_layer: add absolute positional encoding

074bcd1

embedding_layer: update constructor and tests

73799bd

OneAdder force-pushed the embedding_layer branch from 90d80d2 to 73799bd Compare February 23, 2025 11:24

jvdp1 reviewed Feb 23, 2025

View reviewed changes

src/nf/nf_network_submodule.f90 Outdated Show resolved Hide resolved

embedding_layer: make integer input generics

fe02beb

OneAdder mentioned this pull request Feb 24, 2025

Fc2d layer #208

Open

milancurcic and others added 2 commits March 3, 2025 13:00

Resolve conflicts with main

fedf098

embedding_layer: update readme

e97be10

milancurcic approved these changes Mar 5, 2025

View reviewed changes

milancurcic merged commit e628d1e into modern-fortran:main Mar 5, 2025
4 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Embedding Layer #205

Embedding Layer #205

OneAdder commented Feb 20, 2025 •

edited

Loading

OneAdder commented Feb 23, 2025

jvdp1 left a comment

OneAdder commented Feb 28, 2025

milancurcic commented Feb 28, 2025

milancurcic commented Mar 3, 2025

OneAdder commented Mar 4, 2025

milancurcic left a comment

Embedding Layer #205

Embedding Layer #205

Conversation

OneAdder commented Feb 20, 2025 • edited Loading

Input Embeddings Lookup Table (Trainable

Core

Positional

Python Reference

OneAdder commented Feb 23, 2025

jvdp1 left a comment

Choose a reason for hiding this comment

OneAdder commented Feb 28, 2025

milancurcic commented Feb 28, 2025

milancurcic commented Mar 3, 2025

OneAdder commented Mar 4, 2025

milancurcic left a comment

Choose a reason for hiding this comment

OneAdder commented Feb 20, 2025 •

edited

Loading