feat: MetaField Ranker #6189

domenicocinque · 2023-10-29T14:46:03Z

Related Issues

fixes Ranker based on custom meta field #6054

Proposed Changes:

proposal: meta field ranker #6141

Why:

To allow users to rank documents by a relevant metadata field after having used a retriever.

How can it be used:

from haystack.preview.components.rankers.meta_field import MetaFieldRanker
from haystack.preview.dataclasses import Document 

# Documents coming from a retriever 
documents = [
    Document(content="Product 1", meta={"rating": 1.3}, score=0.3),
    Document(content="Product 2", meta={"rating": 0.7}, score=0.4),
    Document(content="Product 3", meta={"rating": 2.1}, score=0.6),
]

ranker = MetaFieldRanker(
    metadata_field="rating",
    ranking_mode="reciprocal_rank_fusion", 
    weight=0.5
)

sorted_documents = ranker.run(query="", documents=documents)
print(sorted_documents)

The example shows how the component can be used to rank documents by combining a meta field of choice ("rating" in this case) and the score of the retriever.

How did you test it?

Tested locally and with unit tests

Notes for the reviewer

The implementation is based on the Recentness Ranker. However I made some changes such as renaming the "score" ranking method to "linear_score" to make it more specific. Moreover I separated the logic that reranks the results in another function, in order to make it possible to inherit from this class for the implementation of a Recentness Ranker in Haystack 2.0

Checklist

I have read the contributors guidelines and the code of conduct
I have updated the related issue with new insights and changes
I added unit tests and updated the docstrings
I've used one of the conventional commit types for my PR title: fix:, feat:, build:, chore:, ci:, docs:, style:, refactor:, perf:, test:.
I documented my code
I ran pre-commit hooks and fixed any issue

Co-authored-by: ZanSara <[email protected]>

vblagoje · 2023-10-30T16:37:21Z

Hey @domenicocinque , thanks for opening this PR. Would you please provide a bit more context about using this component? See, for example How it can be used section of #6199 It'll help greatly not only me but also @dfokina who will help us with the docs.

For the release note CI failure, you need to add a release note with reno tool. See https://github.com/deepset-ai/haystack/blob/main/CONTRIBUTING.md#release-notes for more details.

domenicocinque · 2023-11-02T15:48:11Z

Hi @vblagoje, thanks for the fast response. I added the How it can be used section in the PR description and the release notes in the code. Please let me know if it needs to be improved

vblagoje · 2023-11-03T10:18:48Z

@domenicocinque Your explanation helped me a lot to understand the use case of this component. However, what I'm not 100% sure about is whether this component should be integrated into our core packages or included as a valuable community integration. Let me consult internally and we'll get back to you soon.

vblagoje

@domenicocinque looks quite solid already. I left one comment that I think should improve code readability and performance as well

haystack/preview/components/rankers/meta_field.py

vblagoje · 2023-11-06T09:45:17Z

@domenicocinque great work, thank you. One last request - let's add a unit test for the non-happy path in linear_score mode when the score is invalid. Tests those logs in the caplog are captured. Also, in init, let's raise ValueError instead of ComponentError. ComponentErrors are mostly reserved for the run method to signal the invalid component state preventing execution.

vblagoje · 2023-11-06T09:54:51Z

@dfokina I don't expect any more changes for this PR after @domenicocinque's next commit. Please have a look at it after he commits his last change and make any pydoc corrections 🙏

vblagoje · 2023-11-09T08:21:52Z

Thanks for the update @domenicocinque and for this overall valuable contribution! @dfokina have a pass now, make any needed changes, and we are ready to 🚢

dfokina · 2023-11-09T10:30:37Z

All done from my side too! 🚀

domenicocinque and others added 8 commits October 20, 2023 19:50

proposal: meta field ranker

d20a764

Merge branch 'main' into feat/metafieldranker

6f20214

Apply suggestions from code review

5b1618b

Co-authored-by: ZanSara <[email protected]>

update proposal filename

92bf5cf

Merge branch 'main' into feat/metafieldranker

7efb91f

Merge branch 'main' into feat/metafieldranker

6df0658

feat: add metafield ranker

be760d8

fix docstrings

54dc030

domenicocinque requested review from a team as code owners October 29, 2023 14:46

domenicocinque requested review from dfokina and vblagoje and removed request for a team October 29, 2023 14:46

github-actions bot added topic:tests proposal 2.x Related to Haystack v2.0 type:documentation Improvements on the docs labels Oct 29, 2023

domenicocinque added 2 commits October 29, 2023 15:57

Merge branch 'deepset-ai:main' into code/metafieldranker

1c13fd0

remove proposal file from pr

fb64da3

github-actions bot removed the proposal label Oct 29, 2023

Merge branch 'deepset-ai:main' into code/metafieldranker

1faef5e

domenicocinque added 3 commits November 2, 2023 16:08

add release notes

ac3cc8d

Merge branch 'deepset-ai:main' into code/metafieldranker

f89aceb

update code according to new Document class

3b78b0b

vblagoje requested changes Nov 3, 2023

View reviewed changes

haystack/preview/components/rankers/meta_field.py Outdated Show resolved Hide resolved

Merge branch 'main' into code/metafieldranker

8492f76

separate loops for each ranking mode in __merge_scores

722abc8

domenicocinque added 2 commits November 8, 2023 19:59

Merge branch 'main' into code/metafieldranker

b12fb1e

change error type in init and new tests for linear score warning

a32d636

vblagoje self-requested a review November 9, 2023 09:00

vblagoje approved these changes Nov 9, 2023

View reviewed changes

docstring upd

7b6ec8b

dfokina approved these changes Nov 9, 2023

View reviewed changes

dfokina merged commit 676da68 into deepset-ai:main Nov 9, 2023
21 checks passed

domenicocinque deleted the code/metafieldranker branch November 9, 2023 11:31

julian-risch mentioned this pull request Nov 13, 2023

Remove unused query parameter from MetaFieldRanker 2.0 #6292

Closed

This was referenced Nov 13, 2023

feat: Removed the unused "query" parameter from MetaFieldRanker #6293

Closed

feat: Removed unused "query" parameter from MetaFieldRanked #6299

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: MetaField Ranker #6189

feat: MetaField Ranker #6189

domenicocinque commented Oct 29, 2023 •

edited

Loading

vblagoje commented Oct 30, 2023

domenicocinque commented Nov 2, 2023 •

edited

Loading

vblagoje commented Nov 3, 2023

vblagoje left a comment

vblagoje commented Nov 6, 2023

vblagoje commented Nov 6, 2023

vblagoje commented Nov 9, 2023

dfokina commented Nov 9, 2023

feat: MetaField Ranker #6189

feat: MetaField Ranker #6189

Conversation

domenicocinque commented Oct 29, 2023 • edited Loading

Related Issues

Proposed Changes:

Why:

How can it be used:

How did you test it?

Notes for the reviewer

Checklist

vblagoje commented Oct 30, 2023

domenicocinque commented Nov 2, 2023 • edited Loading

vblagoje commented Nov 3, 2023

vblagoje left a comment

Choose a reason for hiding this comment

vblagoje commented Nov 6, 2023

vblagoje commented Nov 6, 2023

vblagoje commented Nov 9, 2023

dfokina commented Nov 9, 2023

domenicocinque commented Oct 29, 2023 •

edited

Loading

domenicocinque commented Nov 2, 2023 •

edited

Loading