feat: Support for gradient_checkpointing for Sentence Transformers training #5030

sjrl · 2023-05-26T08:51:14Z

Related Issues

fixes N/A

Proposed Changes:

Added support for gradient_checkpointing for Sentence Transformers training. This can greatly reduce the memory usage on a GPU allowing for much larger batch sizes at a smallish expense of training time. This is worth it for Multiple Negatives Ranking Loss (MNRL) training in Sentence Transformers because it has been shown that when using MNRL larger batch sizes (upwards of 128) can significantly boost retrieval metrics.
More details on gradient checkpointing can be found here

How did you test it?

Added two new integration tests to test SentenceTransformer retriever training

Notes for the reviewer

Checklist

I have read the contributors guidelines and the code of conduct
I have updated the related issue with new insights and changes
I added tests that demonstrate the correct behavior of the change
I've used one of the conventional commit types for my PR title: fix:, feat:, build:, chore:, ci:, docs:, style:, refactor:, perf:, test:.
I documented my code
I ran pre-commit hooks and fixed any issue

coveralls · 2023-05-26T09:46:30Z

Pull Request Test Coverage Report for Build 5256397223

0 of 0 changed or added relevant lines in 0 files are covered.
133 unchanged lines in 4 files lost coverage.
Overall coverage increased (+0.07%) to 42.0%

Files with Coverage Reduction	New Missed Lines	%
nodes/prompt/invocation_layer/cohere.py	4	75.61%
nodes/prompt/invocation_layer/hugging_face.py	7	87.2%
nodes/retriever/dense.py	55	25.98%
nodes/retriever/_embedding_encoder.py	67	36.47%

Totals
Change from base Build 5253128824:	0.07%
Covered Lines:	9421
Relevant Lines:	22431

💛 - Coveralls

ZanSara

A bit puzzled by one bit of code specifically, let's talk about it before moving forward.

In addition, please next time open an issue before the PR, so we can have a discussion on the issue about the feature's design, and on the PR about the implementation details. I'm curious where the demand for this specific feature comes from 🙂

haystack/nodes/retriever/_embedding_encoder.py

haystack/nodes/retriever/dense.py

test/nodes/test_retriever.py

…aining

ZanSara

One more small thing to fix and this is ready to merge

test/nodes/test_retriever.py

ZanSara · 2023-06-13T10:46:28Z

test/nodes/test_retriever.py

+        retriever = EmbeddingRetriever(
+            embedding_model="sentence-transformers/all-MiniLM-L6-v2",
+            model_format="sentence_transformers",
+            use_gpu=False,
+        )


I'm really sorry I didn't notice it earlier 🙈 but is this initialization loading the model? If yes, can we mock the model as well so that we don't actually download and load the weights from hf?

If we can do that we'll be able to add the @pytest.mark.unit marker to this test. Otherwise it won't run in CI.

ZanSara · 2023-11-29T09:03:35Z

As agreed with @sjrl we can close this for now and re-implement it in v2 later, if still relevant

sjrl requested a review from a team as a code owner May 26, 2023 08:51

sjrl requested review from ZanSara and removed request for a team May 26, 2023 08:51

github-actions bot added topic:retriever topic:tests type:documentation Improvements on the docs labels May 26, 2023

ZanSara suggested changes May 29, 2023

View reviewed changes

haystack/nodes/retriever/_embedding_encoder.py Outdated Show resolved Hide resolved

ZanSara reviewed May 29, 2023

View reviewed changes

haystack/nodes/retriever/dense.py Outdated Show resolved Hide resolved

haystack/nodes/retriever/dense.py Outdated Show resolved Hide resolved

test/nodes/test_retriever.py Outdated Show resolved Hide resolved

test/nodes/test_retriever.py Outdated Show resolved Hide resolved

sjrl added 6 commits June 6, 2023 16:05

Added support for gradient_checkpointing for Sentence Transformers tr…

f627d45

…aining

Fix mypy

17b2bef

Removing pytest mark integration and using Optional[Literal[

973fa24

Remove more pytest mark integration

f35cc75

Tried to start adding mock test

3e07190

Use mocking to skip the actual training part

72156d1

sjrl force-pushed the update-st-training branch from c553e33 to 72156d1 Compare June 6, 2023 14:05

sjrl requested a review from ZanSara June 6, 2023 14:46

ZanSara reviewed Jun 7, 2023

View reviewed changes

test/nodes/test_retriever.py Outdated Show resolved Hide resolved

Update test

1632c7d

sjrl commented Jun 7, 2023

View reviewed changes

test/nodes/test_retriever.py Show resolved Hide resolved

sjrl requested a review from ZanSara June 7, 2023 10:54

ZanSara suggested changes Jun 13, 2023

View reviewed changes

ZanSara and others added 3 commits June 13, 2023 12:48

Merge branch 'main' into update-st-training

312362e

Update to test

c295cbe

Add unit test

7cf333b

masci assigned ZanSara Nov 23, 2023

Merge branch 'main' into update-st-training

80ad04c

masci changed the base branch from main to v1.x November 24, 2023 12:04

masci added the 1.x label Nov 24, 2023

sjrl closed this Dec 11, 2023

sjrl deleted the update-st-training branch June 3, 2024 08:38

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: Support for gradient_checkpointing for Sentence Transformers training #5030

feat: Support for gradient_checkpointing for Sentence Transformers training #5030

sjrl commented May 26, 2023

coveralls commented May 26, 2023 •

edited

Loading

ZanSara left a comment

ZanSara left a comment

ZanSara Jun 13, 2023

ZanSara commented Nov 29, 2023

feat: Support for gradient_checkpointing for Sentence Transformers training #5030

feat: Support for gradient_checkpointing for Sentence Transformers training #5030

Conversation

sjrl commented May 26, 2023

Related Issues

Proposed Changes:

How did you test it?

Notes for the reviewer

Checklist

coveralls commented May 26, 2023 • edited Loading

Pull Request Test Coverage Report for Build 5256397223

💛 - Coveralls

ZanSara left a comment

Choose a reason for hiding this comment

ZanSara left a comment

Choose a reason for hiding this comment

ZanSara Jun 13, 2023

Choose a reason for hiding this comment

ZanSara commented Nov 29, 2023

coveralls commented May 26, 2023 •

edited

Loading