Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

fix: Allow search in ChromaDocumentStore without metadata #863

Merged
merged 10 commits into from
Jul 8, 2024

Conversation

Amnah199
Copy link
Contributor

@Amnah199 Amnah199 commented Jul 1, 2024

Related Issues

Proposed Changes:

Update _query_result_to_documents to also run without metadata

How did you test it?

Added a unit test

Notes for the reviewer

NA

Checklist

@Amnah199 Amnah199 requested a review from a team as a code owner July 1, 2024 21:45
@Amnah199 Amnah199 requested review from silvanocerza and removed request for a team July 1, 2024 21:45
@github-actions github-actions bot added the type:documentation Improvements or additions to documentation label Jul 2, 2024
if metadatas := result.get("metadatas"):
document_dict["meta"] = dict(metadatas[i][j])
metadatas = result.get("metadatas")
if metadatas and metadatas[i][j] is not None:
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Can it ever happen that metadatas size is shorter than i? And metadatas[i] is shorter than j? 🤔

Cause if it can happen this could fail.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Added an exception here as discussed.

@Amnah199
Copy link
Contributor Author

Amnah199 commented Jul 8, 2024

@silvanocerza Can you take a look at the test that's failing. I am wondering whats wrong there.

@Amnah199 Amnah199 merged commit abfe76e into main Jul 8, 2024
13 checks passed
@Amnah199 Amnah199 deleted the fix-metadata-chroma-document branch July 8, 2024 17:00
FloRul added a commit to FloRul/haystack-core-integrations that referenced this pull request Aug 13, 2024
commit 4c8c8813241947c8f41953a4407056eb4b761cf1
Author: Amna Mubashar <[email protected]>
Date:   Tue Aug 13 17:58:57 2024 +0200

    feat: make truncation optional for bedrock chat generator (#967)

    * Added truncate param to chat generator and adapters
    * Added tests to check truncation

    * Add doc_string

    * Fixed linting

commit 0451e6f43ad5731887ab0aa2aa8e4de9020913e6
Author: HaystackBot <[email protected]>
Date:   Mon Aug 12 15:56:05 2024 +0000

    Update the changelog

commit 0f1452ac3d7fcec1142473f5049cd0266b52784a
Author: David S. Batista <[email protected]>
Date:   Mon Aug 12 17:46:03 2024 +0200

    refactor: change meta data fields (#911)

    * initial import

    * formatting

    * fixing tests

    * removing warnings

    * linting issues

    * fixes due to conflicts

commit a8b2de9d86621aba2243e2d6e350e082c087700d
Author: Stefano Fiorucci <[email protected]>
Date:   Mon Aug 12 17:19:02 2024 +0200

    test: do not retry tests in `hatch run test` command (#954)

    * do not retry tests in hatch run test command

    * fix

    * hatch config improvements

commit 93d2c6824207f0e29928fc82516b2327ff0d54d2
Author: HaystackBot <[email protected]>
Date:   Mon Aug 12 13:41:07 2024 +0000

    Update the changelog

commit 7d90a58f1e77e776d5682ccbb50c87d71eee99c4
Author: tstadel <[email protected]>
Date:   Mon Aug 12 15:36:58 2024 +0200

    fix: support streaming_callback param in amazon bedrock generators (#927)

    * fix: support streaming_callback param in amazon bedrock generators

    * fix chat generator merge

    * reformat

    ---------

    Co-authored-by: Thomas Stadelmann <[email protected]>

commit f03073f995fa39a88f8ca4b14438ee6a6aa3c892
Author: Rashmi Pawar <[email protected]>
Date:   Mon Aug 12 14:30:16 2024 +0530

     Add default model for NVIDIA HayStack local NIM endpoints (#915)

    * initial embedder code

    * default model code

    * docs: update model docstring

    * tests: add userwarning

    * docs: literal lint fix

    * review changes

    * remove pydantic dependency

    * move backend, nim_backend under utils

    * move is_hosted to warm_up

    * test cases, docstring fix

    * error message updation

    Co-authored-by: Madeesh Kannan <[email protected]>

    * move is_hosted code to util

    * remove backend code

    * update import for is_hosted

    * remove util and move code to utils

    * fix api key issue for failing test cases

    * Update integrations/nvidia/tests/conftest.py

    ---------

    Co-authored-by: Madeesh Kannan <[email protected]>

commit 7f5b12e9260d97afbdf1ba550bc7ee450e9a0bfc
Author: HaystackBot <[email protected]>
Date:   Thu Aug 8 13:34:35 2024 +0000

    Update the changelog

commit 76a35a7240228d1abaeb0dd50f7ccfe250bb2149
Author: Stefano Fiorucci <[email protected]>
Date:   Thu Aug 8 15:33:01 2024 +0200

    chore: pin `llama-cpp-python>=0.2.87` (#955)

    * pin llama-cpp-python>=0.2.86

    * update version

commit 9a7a9f748a804fe08387bb0f67630611720ef319
Author: Agnieszka Marzec <[email protected]>
Date:   Thu Aug 8 12:46:35 2024 +0200

    Docs: Update AmazonBedrockGenerator docstrings (#956)

    * update docstrings

    * Update integrations/amazon_bedrock/src/haystack_integrations/components/generators/amazon_bedrock/generator.py

    Co-authored-by: Daria Fokina <[email protected]>

    ---------

    Co-authored-by: Daria Fokina <[email protected]>

commit ee08a763c856411c75ca35a47cfa36ce47c9291d
Author: Agnieszka Marzec <[email protected]>
Date:   Thu Aug 8 12:46:17 2024 +0200

    Docs: Update CohereGenerator docstrings (#960)

    * Update docstrings

    * Update integrations/cohere/src/haystack_integrations/components/generators/cohere/generator.py

    Co-authored-by: Daria Fokina <[email protected]>

    * Update integrations/cohere/src/haystack_integrations/components/generators/cohere/generator.py

    Co-authored-by: Daria Fokina <[email protected]>

    ---------

    Co-authored-by: Daria Fokina <[email protected]>

commit dccaf3f3e2f5c8eb270442f2cb5eb69f7e490bbc
Author: Agnieszka Marzec <[email protected]>
Date:   Thu Aug 8 12:45:50 2024 +0200

    Docs: Update CohereChatGenerator docstrings (#958)

    * update docstrings

    * Fix formatting

    * fix formatting

    * Update integrations/cohere/src/haystack_integrations/components/generators/cohere/chat/chat_generator.py

    Co-authored-by: Daria Fokina <[email protected]>

    * Update integrations/cohere/src/haystack_integrations/components/generators/cohere/chat/chat_generator.py

    Co-authored-by: Daria Fokina <[email protected]>

    * Update integrations/cohere/src/haystack_integrations/components/generators/cohere/chat/chat_generator.py

    Co-authored-by: Daria Fokina <[email protected]>

    ---------

    Co-authored-by: Daria Fokina <[email protected]>

commit b8b72ae2bcaa3e61078df6e377a0491763f632f2
Author: Agnieszka Marzec <[email protected]>
Date:   Thu Aug 8 12:45:23 2024 +0200

    Docs: Update GoogleChatGenerator docstrings (#962)

    * Update docstrings

    * Update integrations/google_ai/src/haystack_integrations/components/generators/google_ai/chat/gemini.py

    Co-authored-by: Daria Fokina <[email protected]>

    ---------

    Co-authored-by: Daria Fokina <[email protected]>

commit 855dc33a031fe523c2767bc10e99c8d1a76b639a
Author: Agnieszka Marzec <[email protected]>
Date:   Thu Aug 8 12:45:01 2024 +0200

    Docs: Update GeminiGenerator docstrings (#964)

    * Update docstrings

    * Update integrations/google_ai/src/haystack_integrations/components/generators/google_ai/gemini.py

    Co-authored-by: Daria Fokina <[email protected]>

    ---------

    Co-authored-by: Daria Fokina <[email protected]>

commit fe9292bfc07e6d9d005572ae17aed169fb0568e6
Author: Agnieszka Marzec <[email protected]>
Date:   Thu Aug 8 12:44:22 2024 +0200

    Docs: Update NvidiaGenerator docstrings (#966)

    * Update docstrings

    * Fix formatting

    * Update integrations/nvidia/src/haystack_integrations/components/generators/nvidia/generator.py

    Co-authored-by: Daria Fokina <[email protected]>

    * Update integrations/nvidia/src/haystack_integrations/components/generators/nvidia/generator.py

    Co-authored-by: Daria Fokina <[email protected]>

    ---------

    Co-authored-by: Daria Fokina <[email protected]>

commit f6c4b242071dbdc4a08c70b71f3ed88e72e40434
Author: Mateusz Haligowski <[email protected]>
Date:   Tue Aug 6 13:31:02 2024 +0200

    feat: remove gradient integration (#926)

commit e664b0cad1f9db8cc2f1abedb6b192c66ec85e47
Author: Vladimir Blagojevic <[email protected]>
Date:   Tue Aug 6 10:41:22 2024 +0100

    Update Langfuse README to avoid common initialization issues (#952)

commit 4f15df0b257b44f8ba31409b6e39117009665be6
Author: Agnieszka Marzec <[email protected]>
Date:   Tue Aug 6 09:49:28 2024 +0200

    Docs: Update AmazonBedrockChatGenerator docstrings (#949)

    * UPdate docstrings

    * Fix formatting

    * Update integrations/amazon_bedrock/src/haystack_integrations/components/generators/amazon_bedrock/chat/chat_generator.py

    Co-authored-by: Daria Fokina <[email protected]>

    ---------

    Co-authored-by: Daria Fokina <[email protected]>

commit 62d643bcd42b7cabad2eb9908dd68c6ea70a7c19
Author: Agnieszka Marzec <[email protected]>
Date:   Tue Aug 6 09:49:07 2024 +0200

    Docs: Update BM25 docstrings (#945)

    * update docstrings

    * add description

    * fix linters

    * fix whitespaces

commit 5be0bf74d2836b81cc60de84d9f8a1541ec7e191
Author: Agnieszka Marzec <[email protected]>
Date:   Mon Aug 5 22:01:52 2024 +0200

    Update docstrings (#947)

commit 993d99e0a645d67c5a1813218224c483406ca518
Author: HaystackBot <[email protected]>
Date:   Mon Aug 5 18:16:57 2024 +0000

    Update the changelog

commit a665f1f65be1f169f21bc9d3484051f1ad1c3636
Author: Stefano Fiorucci <[email protected]>
Date:   Fri Aug 2 16:47:20 2024 +0200

    introduce  utility function (#939)

    Co-authored-by: Silvano Cerza <[email protected]>

commit 7cee6c80c3ec532112ca4b7fb976711d48d72b4e
Author: Amna Mubashar <[email protected]>
Date:   Fri Aug 2 15:37:58 2024 +0200

    fix: replace DynamicChatPromptBuilder with ChatPromptBuilder (#940)

    * Remove occurrences of DynamicChatPromptBuilder

commit 3d698fbe9bf4115db27acf095dadcd12ce40e35b
Author: Amna Mubashar <[email protected]>
Date:   Fri Aug 2 15:21:44 2024 +0200

    Pin llama-cpp version (#943)

    Co-authored-by: Amna Mubashar <[email protected]>

commit 7fbc7062f1205a4f65ca59956f4f2f1bb5a37423
Author: Stefano Fiorucci <[email protected]>
Date:   Fri Aug 2 11:44:10 2024 +0200

    update docker compose to v2 (#941)

commit fa4c6ccfa7e471baf01045d9e48b118594a92b50
Author: Amna Mubashar <[email protected]>
Date:   Thu Aug 1 18:31:21 2024 +0200

    Small improvement for resolving connection string (#937)

commit eab212ab5b79d782f1a840a2078ff11651ac0d8d
Author: Stefano Lottini <[email protected]>
Date:   Thu Aug 1 11:38:56 2024 +0200

    fix: Astra DB, improved warnings and guidance about indexing-related mismatches (#932)

    * better warning text and readme note on indexing settings

    * language

    * style

    * style

commit f3878d481558d31b52c4fdebdb8c63c912d58324
Author: HaystackBot <[email protected]>
Date:   Thu Aug 1 08:05:02 2024 +0000

    Update the changelog

commit 35d657a06eea311171da3265cbf1200d5f5bc971
Author: tstadel <[email protected]>
Date:   Thu Aug 1 10:03:39 2024 +0200

    feat: support aws authentication with OpenSearchDocumentStore (#920)

    * feat: support aws auth with OpenSearchDocumentStore

    * fix: to_dict() from_dict()

    * fix tests

    * add tests

    * fix lint

    * fix mypy

    * rename test class

    * fix feedback

    * lazy-import boto3

    * move _get_auth() to AWSAuth class

    * get rid of aws_auth param

    * better docstrings

    * apply review feedback

commit f0b619e890cb07993c14dc94ee4c4b12441b948f
Author: Vladimir Blagojevic <[email protected]>
Date:   Wed Jul 31 07:57:02 2024 +0100

    chore: Remove all `DynamicChatPromptBuilder` references in Langfuse integration (#931)

    * Remove all DynamicChatPromptBuilder references

    * Lint fixes

commit b8e2623014be9ab7410d98832e25eda46923b04c
Author: Vladimir Blagojevic <[email protected]>
Date:   Tue Jul 30 09:04:39 2024 +0100

    chore: `Langfuse` - replace DynamicChatPromptBuilder with ChatPromptBuilder (#925)

commit 2f6f134300c4e8743dc567f6a5e08ff48962af6c
Author: Rashmi Pawar <[email protected]>
Date:   Mon Jul 29 21:21:32 2024 +0530

    Raise warning for base_url ../embeddings .../completions .../rankings (#922)

    * add validation for base url routes

    * move url validation to utils

    * update docstring for url validation

    * add typing for arg type

    * return docstring update

    Co-authored-by: Madeesh Kannan <[email protected]>

    * fix typo error

    Co-authored-by: Madeesh Kannan <[email protected]>

    ---------

    Co-authored-by: Madeesh Kannan <[email protected]>

commit 0cdda5ce6c5e92ef77dcfea046bc7af0bfb47aff
Author: Tobias Wochinger <[email protected]>
Date:   Wed Jul 24 18:27:45 2024 +0200

    docs: add release instructions (#923)

    * docs: add release instructions

    * docs: add note about maintainers only

commit 84aa8356e067f2ed4a59949f8254262f5268d347
Author: HaystackBot <[email protected]>
Date:   Wed Jul 24 16:08:33 2024 +0000

    Update the changelog

commit 9d511de96ddf2f9f204be00053bb7be9855ecf3a
Author: HaystackBot <[email protected]>
Date:   Wed Jul 24 13:11:25 2024 +0000

    Update the changelog

commit cd521cf00e2b8fd8eed3d7048505e2ed3c3f3a92
Author: Stefano Fiorucci <[email protected]>
Date:   Wed Jul 24 12:32:05 2024 +0200

    chore: Ragas - remove context relevancy metric (#917)

    * ragas: remove context relevancy

    * try removing rerun-failures

    * add rerun-failures back, introduce pytest-asycio

    * add asyncio marker

    * lower-bound pin

commit 282ccc4a41f9dc9791c1f811439572d1563d0238
Author: Vladimir Blagojevic <[email protected]>
Date:   Wed Jul 24 10:45:44 2024 +0200

    Use collections.list_all instead of collections._get_all (#921)

commit eaf36aa25baee410341f9153784762962a27c164
Author: HaystackBot <[email protected]>
Date:   Tue Jul 23 15:30:06 2024 +0000

    Update the changelog

commit 2ed4be7cf9a84bfd6a926adadec3e05d5a8de979
Author: Anthony Tran <[email protected]>
Date:   Mon Jul 22 10:26:10 2024 -0400

    Fix nested logic operators

    * Normalize logical filter conditions

commit a30cb9620f5744e793968a6085696c57c06af7ec
Author: Stefano Fiorucci <[email protected]>
Date:   Mon Jul 22 10:02:36 2024 +0200

    better compatibility (#914)

commit 6fb8a66b6a874afaa90dbbef874216a56782456e
Author: HaystackBot <[email protected]>
Date:   Wed Jul 17 15:26:49 2024 +0000

    Update the changelog

commit 3d07b790f8d3618c56f09488c828e72463c7802b
Author: HaystackBot <[email protected]>
Date:   Wed Jul 17 15:22:09 2024 +0000

    Update the changelog

commit 93eae15351ecd88105933b83b1a217d39390ce36
Author: David S. Batista <[email protected]>
Date:   Wed Jul 17 17:13:29 2024 +0200

    Add meta deprecration warning (#910)

    * adding deprecation warnings
    * fixing imports
    * fixing 3.9 linting issues

commit f03f8bc2b9c4110ec29c06528601ae3e255783b2
Author: HaystackBot <[email protected]>
Date:   Wed Jul 17 13:09:06 2024 +0000

    Update the changelog

commit 4553a05336ba19909747ab053cd5316350c0027a
Author: HaystackBot <[email protected]>
Date:   Wed Jul 17 09:21:17 2024 +0000

    Update the changelog

commit 9893b56d553a158264894440edefa9661589c22c
Author: Stefano Fiorucci <[email protected]>
Date:   Wed Jul 17 11:17:36 2024 +0200

    fix: `ChromaDocumentStore` - discard `meta` items when the type of their value is not supported in Chroma (#907)

    * discard invalid meta values

    * reduce warnings

commit db2b5f72bd1c724eb346a3e4b55c3b416a1fd821
Author: Vladimir Blagojevic <[email protected]>
Date:   Wed Jul 17 11:04:57 2024 +0200

    Add defensive check for filter_policy deserialization (#903)

commit be04358aefd6f125c38634acb44cb12300514664
Author: HaystackBot <[email protected]>
Date:   Wed Jul 17 07:28:22 2024 +0000

    Update the changelog

commit f5c93d90ceea9bb3dacc2fe91507cd07e0e5cfb1
Author: HaystackBot <[email protected]>
Date:   Wed Jul 17 07:25:46 2024 +0000

    Update the changelog

commit ac92b3a832f2c2ea352887c03d02b07e7bc0ba3a
Author: HaystackBot <[email protected]>
Date:   Wed Jul 17 07:18:41 2024 +0000

    Update the changelog

commit 80622966951cc2897a689805de203f866c955819
Author: Vladimir Blagojevic <[email protected]>
Date:   Wed Jul 17 09:14:45 2024 +0200

    fix: `PgVector` - Fallback to default filter policy when deserializing retrievers without the init parameter (#900)

    * Add defensive check for filter_policy deserialization

    * Update integrations/pgvector/tests/test_retrievers.py

    Co-authored-by: David S. Batista <[email protected]>

    ---------

    Co-authored-by: David S. Batista <[email protected]>

commit d943f4e62c8950c11438bb99f6314a83d02d2b53
Author: Vladimir Blagojevic <[email protected]>
Date:   Wed Jul 17 09:14:24 2024 +0200

    fix: `Mongo` - Fallback to default filter policy when deserializing retrievers without the init parameter (#899)

    * Add defensive check for filter_policy deserialization

    * black test

    * Fix ruff

    * Black tests

commit b1201e01cd9c3d84d8d1abb646bec0be8441a4f7
Author: HaystackBot <[email protected]>
Date:   Tue Jul 16 09:48:13 2024 +0000

    Update the changelog

commit ecaeedd3ab192bd62ad4e21a53e7201e1efac0ac
Author: Guillaume Chérel <[email protected]>
Date:   Tue Jul 16 11:44:57 2024 +0200

    feat: Add metadata parameter to ChromaDocumentStore. (#906)

    * feat: Add metadata parameter to ChromaDocumentStore.

    * Update integrations/chroma/src/haystack_integrations/document_stores/chroma/document_store.py

    Co-authored-by: Stefano Fiorucci <[email protected]>

    * Update integrations/chroma/src/haystack_integrations/document_stores/chroma/document_store.py

    Co-authored-by: Stefano Fiorucci <[email protected]>

    * Update integrations/chroma/src/haystack_integrations/document_stores/chroma/document_store.py

    Co-authored-by: Stefano Fiorucci <[email protected]>

    * test: update logging message in chroma document store tests

    * style: Fix formatting

    * test: Add test for logging messages when creating chroma collection with the same name.

    * test: Fix logging message.

    ---------

    Co-authored-by: Stefano Fiorucci <[email protected]>

commit 1b3b36e73661ca4fe6694419a9ba4a947c09f5b9
Author: Vladimir Blagojevic <[email protected]>
Date:   Tue Jul 16 10:47:14 2024 +0200

    fix: `pinecone` - Fallback to default filter policy when deserializing retrievers without the init parameter (#901)

    * Add defensive check for filter_policy deserialization

    * Add comment

commit b33505a1fc6f653c15d59bf8e94a986f831a07a9
Author: Vladimir Blagojevic <[email protected]>
Date:   Tue Jul 16 10:46:54 2024 +0200

    fix: `ElasticSearch` - Fallback to default filter policy when deserializing retrievers without the init parameter (#898)

    * Add defensive check for filter_policy deserialization

    * Add defensive check for filter_policy deserialization

    * Add unit test

    * Revert change in chroma

    * Linter fix

commit 6349d155cfa57b9d26c6b6ede106a86d72445b47
Author: HaystackBot <[email protected]>
Date:   Mon Jul 15 16:31:30 2024 +0000

    Update the changelog

commit 52345424db8232e353516577a96b5323b276d6c2
Author: Vladimir Blagojevic <[email protected]>
Date:   Mon Jul 15 18:29:35 2024 +0200

    fix: `qdrant` - Fallback to default filter policy when deserializing retrievers without the init parameter (#902)

    * Add defensive check for filter_policy deserialization

    * Add unit tests

commit 05a21f63279517a973360dfc792ae51438fe5e5f
Author: HaystackBot <[email protected]>
Date:   Mon Jul 15 15:59:14 2024 +0000

    Update the changelog

commit 16b38492d05eff489031d99856667154aa2b88b8
Author: Vladimir Blagojevic <[email protected]>
Date:   Mon Jul 15 17:57:36 2024 +0200

    fix: `Chroma` - Fallback to default filter policy when deserializing retrievers without the init parameter (#897)

    * Add defensive check for filter_policy deserialization

    * Add unit test

    * Fix test

commit 43ccd3cea5bd5430c0a2c925810a40b62601144f
Author: HaystackBot <[email protected]>
Date:   Mon Jul 15 15:56:18 2024 +0000

    Update the changelog

commit bcdf33d979528405cde73a2125dfac98e96630d7
Author: Vladimir Blagojevic <[email protected]>
Date:   Mon Jul 15 17:49:01 2024 +0200

    fix: `Astra` - Fallback to default filter policy when deserializing retrievers without the init parameter (#896)

    * Add defensive check for filter_policy deserialization

    * Add unit test

    * Add comment

commit a9da4ed55435608ce9875ff885e5ad27b9e01027
Author: HaystackBot <[email protected]>
Date:   Mon Jul 15 15:14:56 2024 +0000

    Update the changelog

commit b23ab153c86c8724edfb341fce680f5a38004162
Author: Madeesh Kannan <[email protected]>
Date:   Mon Jul 15 17:13:15 2024 +0200

    fix: `OpenSearch` - Fallback to default filter policy when deserializing retrievers without the init parameter (#895)

commit 20221ac6c53e3d9dedcec9a2da3289178bf7c495
Author: HaystackBot <[email protected]>
Date:   Mon Jul 15 14:20:05 2024 +0000

    Update the changelog

commit 90255b472e556aec861432ffc10a0a3d06913d98
Author: HaystackBot <[email protected]>
Date:   Mon Jul 15 13:57:20 2024 +0000

    Update the changelog

commit 140015b8c81918a7fe4726d89a43c1acbbef0907
Author: David S. Batista <[email protected]>
Date:   Mon Jul 15 15:27:11 2024 +0200

    Update README.md (#893)

    updating Amazon Bedrock link

commit dfebd7d6e60e2c161f1790db7f4391a1d880460d
Author: HaystackBot <[email protected]>
Date:   Mon Jul 15 09:57:17 2024 +0000

    Update the changelog

commit c8f19a2fb734e6284dca6bf042c964f67a42b051
Author: Amna Mubashar <[email protected]>
Date:   Wed Jul 10 10:52:35 2024 +0200

    fix: errors in convert_filters_to_qdrant (#870)

    * progress

    * Fixed logic error

    * Some tests are still failing

    * Passed all tests

    * Fixed errors in logic

    * Fixed linting issues

    * Minor adjustments

    * Further improvements in code structure

    * Final changes for review

    * Updated

    * Added more tests

    * Add a test to check nested filters

    * Minor changes

    * Fix bugs and add docstrings

    ---------

    Co-authored-by: Amna Mubashar <[email protected]>

commit 1ecfbfa6d08f24b1bd24ff83b6ae6941e40ab352
Author: Silvano Cerza <[email protected]>
Date:   Tue Jul 9 15:54:39 2024 +0200

    Fix Google AI tests failing (#885)

    * Fix Google AI tests failing

    * Fix GoogleAIGeminiChatGenerator to_dict and from_dict

commit abfe76e5f2c0193736beab0b42b69af19fb0934d
Author: Amna Mubashar <[email protected]>
Date:   Mon Jul 8 19:00:43 2024 +0200

    fix: Allow search in ChromaDocumentStore without metadata (#863)

    * Fix a bug for checking metadata values

commit 5ab3a10c68f4834b72af74d2d996cf0fe6359f43
Author: Vladimir Blagojevic <[email protected]>
Date:   Mon Jul 8 17:20:37 2024 +0200

    Minor retriever pydoc fix (#884)

commit 11a07449734c504de821a551a238c086f1a3d0e8
Author: Vladimir Blagojevic <[email protected]>
Date:   Fri Jul 5 16:35:34 2024 +0200

    feat: Add filter_policy to chroma integration (#826)

    * Add filter_policy to chroma integration

commit 124b6e801f4cfff38b65c26eda033fee5fa23d08
Author: Vladimir Blagojevic <[email protected]>
Date:   Fri Jul 5 16:35:14 2024 +0200

    feat: Add filter_policy to mongodb_atlas integration (#823)

    * Add filter_policy to mongodb_atlas integration

commit 7c60cbf204f1edd039285f460920ef09c5a8243f
Author: Vladimir Blagojevic <[email protected]>
Date:   Fri Jul 5 16:34:55 2024 +0200

    feat: Add filter_policy to opensearch integration (#822)

    * Add filter_policy to opensearch integration

commit e6d378a6235d13170d66e93db3f4303d2bb26cc6
Author: Vladimir Blagojevic <[email protected]>
Date:   Fri Jul 5 16:34:32 2024 +0200

    feat: Add filter_policy to qdrant integration (#819)

    * Add filter_policy to qdrant integration

commit 2182edb01a0a3a4379a72e0fdf94cd3640a0fa9c
Author: Vladimir Blagojevic <[email protected]>
Date:   Fri Jul 5 15:40:11 2024 +0200

    feat: Add filter_policy to elasticsearch integration (#825)

    * Add filter_policy to elasticsearch integration

commit dff8518588374f67ab55c81a65ec23076710be40
Author: Vladimir Blagojevic <[email protected]>
Date:   Fri Jul 5 13:00:06 2024 +0200

    feat: Add filter_policy to pinecone integration (#821)

    * Add filter_policy to pinecone integration

commit cf792d78151967378505ae7b0581ebfa280f7bdb
Author: Vladimir Blagojevic <[email protected]>
Date:   Fri Jul 5 12:26:40 2024 +0200

    feat: Add filter_policy to pgvector integration (#820)

    * Add filter_policy to pgvector integration
    ---------
    Co-authored-by: Stefano Fiorucci <[email protected]>

commit 978b71d4dbd9d19c2e616d80c686e48cf876a3da
Author: Vladimir Blagojevic <[email protected]>
Date:   Fri Jul 5 12:26:10 2024 +0200

    feat: Add filter_policy to weaviate integration (#824)

    * Add filter_policy to weaviate integration

commit 24c56e96b7b74b2780dab8620a6a232ccb99cde9
Author: Vladimir Blagojevic <[email protected]>
Date:   Fri Jul 5 12:25:38 2024 +0200

    feat: Add filter_policy to astra integration (#827)

    * Add filter_policy to astra integration
    ---------
    Co-authored-by: Stefano Fiorucci <[email protected]>

commit 52b0a1a59187c1b0001c2c07351438554e0788b1
Author: David Basoco <[email protected]>
Date:   Thu Jul 4 15:18:51 2024 +0200

    Fix not equal astra filter operator (#868)

commit ab68d24a688228d8d37e5c8e3f9264d9c5fcdeee
Author: HaystackBot <[email protected]>
Date:   Thu Jul 4 08:54:14 2024 +0000

    Update the changelog

commit fcbc35b972d8a06dbe5c97edb11f308297736fa0
Author: Madeesh Kannan <[email protected]>
Date:   Thu Jul 4 10:51:25 2024 +0200

    fix: Fix typo in the `ORTModel.inputs_names` field to align with upstream (#866)

commit 0d89e832f84354be0c19464cb9915a8d4aa46a3c
Author: HaystackBot <[email protected]>
Date:   Wed Jul 3 13:52:38 2024 +0000

    Update the changelog

commit 87bb97dffc2aaebd9238866c11fac57c2332fa9f
Author: HaystackBot <[email protected]>
Date:   Wed Jul 3 11:10:43 2024 +0000

    Update the changelog

commit 0fd154b97a8621e29f5f70010cc9f025d7eed245
Author: Stefano Fiorucci <[email protected]>
Date:   Wed Jul 3 13:06:55 2024 +0200

    feat: Qdrant - add support for BM42 (#864)

    * Qdrant: add support for BM42

    * add test for sparse configuration

commit fd0059e8ce8dc0db338321d2ac5f92b0c1be985a
Author: Isaac Chung <[email protected]>
Date:   Wed Jul 3 13:49:24 2024 +0300

    feat: add `score_threshold` to Qdrant Retrievers (#860)

    * feat: add score_threshold to qdrant retrievers

    * test: add score_threshold to qdrant tests

    * ruff linting

    * hatch run lint:all

    * add test case using score_threshold

    * linting

    * test: new test case with fixed embeds per review

    * expand docstrings

    * small fixes

    ---------

    Co-authored-by: anakin87 <[email protected]>

commit f73c3514c13cdaaf10a878e88f5987529958e9d2
Author: HaystackBot <[email protected]>
Date:   Tue Jul 2 09:13:17 2024 +0000

    Update the changelog

commit 9c86675bb00d587410335b6b32c28c8b94a1c795
Author: Stefano Fiorucci <[email protected]>
Date:   Tue Jul 2 11:01:01 2024 +0200

    refactor!: Qdrant - set `scale_score` default value to `False` (#862)

    * rm unused params

    * qdrant - set scale_score to False

commit 06d77769199607c717cffa297d9b71e51bee4ed4
Author: Stefano Fiorucci <[email protected]>
Date:   Tue Jul 2 09:50:43 2024 +0200

    refactor!: Qdrant - remove unused init parameters: `content_field`, `name_field`, `embedding_field`, and `duplicate_documents` (#861)

    * rm unused params

    * docs: change duplicate_documents to policy in docstring

    ---------

    Co-authored-by: Julian Risch <[email protected]>

commit 268b487a2e8633acecc917e51746eafb2040a9a6
Author: Amna Mubashar <[email protected]>
Date:   Tue Jul 2 01:45:31 2024 +0200

    feat: made truncation optional for BedrockGenerator (#833)

    * Added truncate parameter to init method

    * fixed serialization bug for BedrockGenerator

    * Add a test to check truncation functionality

commit 2d93ea3abf2141bca2395ead36d28d9dff8bb413
Author: HaystackBot <[email protected]>
Date:   Mon Jul 1 18:44:29 2024 +0000

    Update the changelog

commit 23c3e108cad5aae08c1deea8c6cc1162d0e900df
Author: Amna Mubashar <[email protected]>
Date:   Mon Jul 1 15:33:46 2024 +0200

    Add system files in git ignore (#858)

    Co-authored-by: Amna Mubashar <[email protected]>

commit 7127be63c9aa152b5139c1e826f7d345f912b178
Author: Amna Mubashar <[email protected]>
Date:   Mon Jul 1 01:50:01 2024 +0200

    feat: added distance_function property to ChromadocumentStore (#817)

    * Added the distance metric property
    ---------

    Co-authored-by: Amna Mubashar <[email protected]>
    Co-authored-by: Stefano Fiorucci <[email protected]>

commit 6d8ce95005ffedcfa76347e0d06820f2c8490092
Author: HaystackBot <[email protected]>
Date:   Fri Jun 28 15:49:09 2024 +0000

    Update the changelog

commit 1f582d7dee069209696084236438a0f09e2b8bbf
Author: tstadel <[email protected]>
Date:   Fri Jun 28 12:40:45 2024 +0200

    feat: add raise_on_failure param to OpenSearch retrievers (#852)

    * feat: add ignore_errors param to OpenSearch retrievers

    * feedback

    * fix ruff command

    * fix except

commit 605ba29f0ec722c103085e83ac9bd32762296105
Author: HaystackBot <[email protected]>
Date:   Fri Jun 28 10:30:15 2024 +0000

    Update the changelog

commit d2fd97af5d5a28b9d56f233f45b5c6664243c8ee
Author: Stefano Fiorucci <[email protected]>
Date:   Fri Jun 28 12:26:04 2024 +0200

    build: add `psutil` dependency to Unstructured integration (#854)

    * add psutil dependency to unstructured integration

    * fxi

commit 9aafa795935a4a70614d43c7afad0be0e2b12c28
Author: Vladimir Blagojevic <[email protected]>
Date:   Fri Jun 28 11:32:10 2024 +0200

    chore: Update ruff invocation to include check parameter (#853)

    * Update ruff invocation to include check parameter

    * fix linting Sagemaker

    * unused import

    ---------

    Co-authored-by: anakin87 <[email protected]>

commit 439945410b7968759427e02ff81deaccca313f10
Author: HaystackBot <[email protected]>
Date:   Thu Jun 27 11:32:56 2024 +0000

    Update the changelog

commit 0039a486f6eb01f2730238a62b76253b185efa5f
Author: tstadel <[email protected]>
Date:   Thu Jun 27 13:28:21 2024 +0200

    fix: support legacy filters with OpenSearchDocumentStore (#850)

    * feat: support legacy filters with OpenSearchDocumentStore

    * add tests

commit f170ab434711d24a66c9c1a63575c9968e54824d
Author: tstadel <[email protected]>
Date:   Wed Jun 26 19:13:32 2024 +0200

    fix: serialization for custom_query in OpenSearch retrievers (#851)

commit 49e323f037527d0552ddd387ac42d8b888884782
Author: paulmartrencharpro <[email protected]>
Date:   Wed Jun 26 14:19:53 2024 +0200

    Fix: typo on Sparse embedders. The parameter should be "progress_bar" … (#814)

    * Fix typo on Sparse embedders. The parameter should be "progress_bar" instead of "show_progress_bar"

    * Fix typo on Sparse embedders tests. The parameter should be "progress_bar" instead of "show_progress_bar"

commit bd21df73ed9d298c88e538cfc2f1ede191722863
Author: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
Date:   Wed Jun 26 10:51:19 2024 +0200

    chore(deps): bump actions/add-to-project from 1.0.1 to 1.0.2 (#849)

    Bumps [actions/add-to-project](https://github.com/actions/add-to-project) from 1.0.1 to 1.0.2.
    - [Release notes](https://github.com/actions/add-to-project/releases)
    - [Commits](https://github.com/actions/add-to-project/compare/v1.0.1...v1.0.2)

    ---
    updated-dependencies:
    - dependency-name: actions/add-to-project
      dependency-type: direct:production
      update-type: version-update:semver-patch
    ...

    Signed-off-by: dependabot[bot] <[email protected]>
    Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

commit 6945503db2f72089dd6082a4095755f80dd3dcbc
Author: Julian Risch <[email protected]>
Date:   Wed Jun 26 08:50:16 2024 +0200

    feat: Use non-gated tokenizer as fallback for mistral in AmazonBedrockChatGenerator (#843)

    * feat: Use non-gated tokenizer as fallback for mistral

    * formatting

    * fix linter issues

commit 05ccdb2a9755fc00cb10381fcf1e6c82c7b12cd2
Author: HaystackBot <[email protected]>
Date:   Tue Jun 25 16:11:17 2024 +0000

    Update the changelog

commit 69c29a95d82a7f2bbd53949257a926351521879e
Author: tstadel <[email protected]>
Date:   Tue Jun 25 17:24:10 2024 +0200

    feat: add custom_query param to OpenSearch retrievers (#841)

    * feat: add custom_query param to OpenSearch retrievers

    * feat: add custom_query to OpenSearch retrievers

    * add as run param

    * fix lint

    * switch to jinja2 templates

    * Revert "switch to jinja2 templates"

    This reverts commit f36ed13fa25abc5d17df7e087841a9ecf839c75f.

    * support custom_query as dict

    * remove unneccessary comments

    * remove str

    * fix lint

commit be09adf256a107536d5d3b5434bdb70deefa58e3
Author: tstadel <[email protected]>
Date:   Tue Jun 25 17:19:02 2024 +0200

    feat: add create_index option to OpenSearchDocumentStore (#840)

    * [opensearch] feat: add create_index option

    * fix lint

    * fix lint

    * add create_index() method

    * fix lint

    * better match

    * fix docs

commit 1c557cb06a3e19338e1ff4bd53494bb69c83695d
Author: HaystackBot <[email protected]>
Date:   Tue Jun 25 13:17:50 2024 +0000

    Update the changelog

commit 53f26ec92e233fa0e8dfa7757ae9236268086cd3
Author: Stefano Fiorucci <[email protected]>
Date:   Tue Jun 25 10:43:21 2024 +0200

    update Pinecone test scripts (#848)

commit 60c666624cfb20a3600a28be074697fdf4cf2553
Author: Stefano Fiorucci <[email protected]>
Date:   Mon Jun 24 16:31:47 2024 +0200

    install pytest-rerunfailures; change test-cov script (#845)

commit 7723ddee7c17356f1393dc15b48711c73ba0663a
Author: Silvano Cerza <[email protected]>
Date:   Mon Jun 24 15:35:30 2024 +0200

    fix: fix connection to Weaviate Cloud Service (#624)

    * Fix connection to Weaviate Cloud Service

    * Handle connection to WCS and add tests

    * Add comment explaining why we use utility function

commit 14499cd48a0c766b10a79814c5d3c9bf8d1e6bb2
Author: Silvano Cerza <[email protected]>
Date:   Mon Jun 24 13:02:12 2024 +0200

    Revert "Handle connection to WCS and add tests"

    This reverts commit f48802b2ce612896fd06a13cf33dffd9f77a8859.

commit f48802b2ce612896fd06a13cf33dffd9f77a8859
Author: Silvano Cerza <[email protected]>
Date:   Mon Jun 24 13:01:27 2024 +0200

    Handle connection to WCS and add tests

commit 75bb792e956f038992be8b8d84c4b7b94e42b9db
Author: HaystackBot <[email protected]>
Date:   Fri Jun 21 15:01:10 2024 +0000

    Update the changelog

commit 7cd71f17a3f86a22f74e85db2825f993c597e5b8
Author: Vladimir Blagojevic <[email protected]>
Date:   Fri Jun 21 16:59:14 2024 +0200

    feat: Update Anthropic default models, pydocs (#839)

    * Update default models, pydocs

    * Update unit test

commit 1f7f75a663292df2cc322c526a80be0fbba7cdee
Author: Vladimir Blagojevic <[email protected]>
Date:   Fri Jun 21 14:03:29 2024 +0200

    feat: Update Cohere default LLMs, add examples and update unit tests (#838)

    * Update default models, examples and unit tests

    * PR feedback

commit 8152b6a3c29eb3ebd0727646b50b0bd71d791ebe
Author: Massimiliano Pippi <[email protected]>
Date:   Fri Jun 21 13:20:39 2024 +0200

    ci: retry tests to reduce flakyness (#836)

    * test the strategy

    * cargo load the whole bunch

commit 4b3abda8eec9eaa43c87eb37427c69a78760ba20
Author: Massimiliano Pippi <[email protected]>
Date:   Fri Jun 21 09:57:23 2024 +0200

    clean up workflow files (#835)

commit b11486fb7a20728ed89b52243200cc8184272ab8
Author: HaystackBot <[email protected]>
Date:   Thu Jun 20 13:35:54 2024 +0000

    Update the changelog

commit e42f8f2935676a55ed0c596baac5f026e8195fbb
Author: Amna Mubashar <[email protected]>
Date:   Thu Jun 20 13:19:43 2024 +0200

    doc: added docstrings for QdrantDocumentStore (#808)

    * doc: add docstrings qdrant document store
    * Updated docstrings based on PR review

    ---------

    Co-authored-by: Amna Mubashar

commit a0eca9ae434f4774ed7401700879f9a96db8014a
Author: HaystackBot <[email protected]>
Date:   Thu Jun 20 08:06:17 2024 +0000

    Update the changelog

commit d161220395aabf45fdc049ee7b129448406e39d6
Author: Leonardo Teixeira Menezes <[email protected]>
Date:   Thu Jun 20 10:02:31 2024 +0200

    feat: add customizable index names for pgvector (#818)

    * feat: add customizable index names for pgvector

    * refactor: Remove unnecessary constants on PGVector document store

commit 867b2a3b66eb75e735726f2b9780778ec59cd33b
Author: Vedant Naik <[email protected]>
Date:   Wed Jun 19 15:55:41 2024 +0100

    fix: weaviate filter error (#811)

    * fix: weaviate filter error

    * test: add test for legacy filters for weaviate

commit 236fd28287c42d34b0f0cb48426625f3e747d797
Author: HaystackBot <[email protected]>
Date:   Wed Jun 19 04:15:36 2024 +0000

    Update the changelog

commit b9f783dfd7d3b18e749889a1031173d9643940c9
Author: HaystackBot <[email protected]>
Date:   Tue Jun 18 15:33:22 2024 +0000

    Update the changelog

commit 70e2a9cf9315e01c292105acdf3eecb150793336
Author: Massimiliano Pippi <[email protected]>
Date:   Tue Jun 18 17:32:04 2024 +0200

    add support for Azure generators (#815)

commit 6254d58019101b7ec9d239fe9ee7b3adc504c38a
Author: Vladimir Blagojevic <[email protected]>
Date:   Tue Jun 18 10:27:04 2024 +0200

    feat: Update Anthropic/Cohere for tools use (#790)

    * Update for tools use

    * Test updates

    * Add tools usage integration tests

    * Minor test detail update

    * PR review

    * Pydocs update

commit 14a2711079c565b4eb7baaccbf1f3f31c7a646b8
Author: HaystackBot <[email protected]>
Date:   Fri Jun 14 15:09:05 2024 +0000

    Update the changelog

commit 5e66f1d370cc33d6e4f29019bfdc368fc63182c8
Author: tstadel <[email protected]>
Date:   Fri Jun 14 17:06:51 2024 +0200

    feat: support Claude v3, Llama3 and Command R models on Amazon Bedrock (#809)

    * feat: support Claude v3 and Cohere Command R models on Amazon Bedrock

    * revert chat pattern change

    * rename llama adapter

    * fix tests after llama adapter rename

commit 590e2b016009d171eeb9b3aae95467f295936453
Author: HaystackBot <[email protected]>
Date:   Thu Jun 13 10:27:46 2024 +0000

    Update the changelog

commit bf5c64138cfcb0feea173d4e2289754922ae5a41
Author: agruhl <[email protected]>
Date:   Thu Jun 13 12:26:05 2024 +0200

    fix: Performance optimizations and value error when streaming in langfuse (#798)

    * Solves issue with usage stats when streaming is enabled on the OpenAIGenerator

    * Root span should be closed when the pipeline run is complete

    * Added documentation

    * Moved flushing execution to the last span in the context and improved the documentation to give examples of flushing properly manually

    * Fixed linting issues

    * make use of monkeypatch

    * improving code

commit 575e209f7020a327b3347368eb29e2cc9d01ac71
Author: Stefano Fiorucci <[email protected]>
Date:   Wed Jun 12 19:17:53 2024 +0200

    tests: Pinecone - fix `test_serverless_index_creation_from_scratch` (#806)

    * pinecone tests: wait for index creation

    * make index name dynamic

    * lint

    * run the test only once in our suite

    * add reason

    * increase sleep time

    * better skipif condition

    * better coverage options

    * fix

    * revert changes in coverage.run

    * fix

    * add unit to sleep tima

    * define index name in the matrix

    * fix

    * add default

    * lint

commit 015cc3efc8da4256f75c7ae568bc8e090137a480
Author: HaystackBot <[email protected]>
Date:   Wed Jun 12 09:50:16 2024 +0000

    Update the changelog

commit 9f5a4601018c1a5707da02c43ebbd75b43d78640
Author: Stefano Fiorucci <[email protected]>
Date:   Wed Jun 12 11:48:35 2024 +0200

    cohere - remove warning (#805)

commit eb0722b90220b868d69764901342badabadee196
Author: Stefano Fiorucci <[email protected]>
Date:   Tue Jun 11 12:54:04 2024 +0200

    test: Amazon Bedrock - skip integration tests from forks (#801)

    * try skipping integration tests from forks

    * make it work on windows

    * skip mistral tests when HF token is not set

    * add windows step

    * separate unit and int tests

    * refinement

    * format

commit 4821ff3dbc86a51c652651df6bf626ced1412f67
Author: HaystackBot <[email protected]>
Date:   Tue Jun 11 09:05:42 2024 +0000

    Update the changelog

commit 7524022524ca38889a39744b2661913ef2552206
Author: Massimiliano Pippi <[email protected]>
Date:   Tue Jun 11 11:03:27 2024 +0200

    feat: defer the database connection to when it's needed (#804)

    * feat: defer the database connection to when it's needed

    * linting

    * fix tests

commit 29b363c2cea1c677b3d0ba23de4974d9e033b710
Author: HaystackBot <[email protected]>
Date:   Mon Jun 10 16:33:29 2024 +0000

    Update the changelog

commit f70664dd5d276a801b3544ad8943b95bdd064deb
Author: Massimiliano Pippi <[email protected]>
Date:   Mon Jun 10 18:31:06 2024 +0200

    feat: defer the database connection to when it's needed (#802)

    * feat: defer the database connection to when it's needed

    * remove unneeded noqa

    * fix fixture

    * trigger the connection before asserting

    * trigger connection

    * make also serialization lazy

    * remove copypasta leftovers

commit 5bc08dfba2be655b149137c2e852d9a58bd8172c
Author: Madeesh Kannan <[email protected]>
Date:   Mon Jun 10 17:41:05 2024 +0200

    refactor: Remove deprecated Nvidia Cloud Functions backend and related code. (#803)

commit 76504ab95bf7bbfaebaa2f06bae3dab892df2ae9
Author: HaystackBot <[email protected]>
Date:   Mon Jun 10 15:03:23 2024 +0000

    Update the changelog

commit 4dda1b9ec4eea7f48cd2546da4c36be7f4134210
Author: Stefano Fiorucci <[email protected]>
Date:   Mon Jun 10 17:00:36 2024 +0200

    feat!: Pinecone - support for the new API (#793)

    * upgrate to new API

    * increase sleep time

    * update example

    * address feedback

commit 1f71c721049f91634a96d61c5b505e66649655c5
Author: HaystackBot <[email protected]>
Date:   Mon Jun 10 08:08:33 2024 +0000

    Update the changelog

commit 2b4a534e21ca9c0a148011ecd2daac9c02556206
Author: HaystackBot <[email protected]>
Date:   Fri Jun 7 07:13:42 2024 +0000

    Update the changelog

commit 78157a94e7e2d0cd3ae8ac42508e9a248b96852b
Author: lohit8846 <[email protected]>
Date:   Thu Jun 6 00:17:50 2024 -0700

    feat: Add force_disable_check_same_thread init param for Qdrant local client (#779)

    * fix: added missing init param which is use by qdrant local client

    * fix: Corrected linting

commit 3f3fa2bd2185733621b6a85becd4713e0074b7d1
Author: HaystackBot <[email protected]>
Date:   Wed Jun 5 14:46:41 2024 +0000

    Update the changelog

commit bc032a37a20a38ebc3a6b9bc7fe342d4e3e5411c
Author: Silvano Cerza <[email protected]>
Date:   Wed Jun 5 16:44:57 2024 +0200

    Fix tests skipping for Google AI integration (#788)

commit 48422b1e91e3d71df149ae04a36e2d4e506dcc5c
Author: Silvano Cerza <[email protected]>
Date:   Wed Jun 5 16:37:15 2024 +0200

    Drop support for Python 3.8 for Unstructured integration (#787)

commit ab58a2319fa3df2c51ec68864a5da09105bf1fef
Author: antoniomuzzolini <[email protected]>
Date:   Wed Jun 5 16:32:06 2024 +0200

    fix: Handle `TypeError: Could not create Blob` in `GoogleAIGeminiChatGenerator` (#772)

    * bugfix 654

    aligned _message_to_content to VertexAIGeminiChatGenerator

    * Update test_chat_gemini.py

    ---------

    Co-authored-by: Silvano Cerza <[email protected]>

commit 4290d6a3fad63701882fa4b70af8aafb94db7d89
Author: Silvano Cerza <[email protected]>
Date:   Wed Jun 5 16:22:31 2024 +0200

    Fix Google AI integration tests (#786)

commit 4042fddcfe5682885b711260812a37f6323bf040
Author: ArzelaAscoIi <[email protected]>
Date:   Wed Jun 5 14:32:07 2024 +0200

    feat: return_embeddings flag for opensearch (#784)

    * feat: return_embeddings flag for opensearch

    * fix

    * fix

    * fix

    * tests: bm25 retrieval

commit 42d85b3a2936dc1275558d99907b543a24aa595b
Author: Matthew Farrellee <[email protected]>
Date:   Wed Jun 5 07:42:44 2024 -0400

    align tests and docs on NVIDIA_API_KEY (instead of NVIDIA_CATALOG_API_KEY) (#731)

commit e118b926456d1c52ad35ef1a7fa1a1e274487d58
Author: Matthew Farrellee <[email protected]>
Date:   Wed Jun 5 07:42:37 2024 -0400

    switch default from NVCF backend to hosted NIM backend (#734)

commit 6aaf523636d923ce158c46ea39a057f5529b5a17
Author: HaystackBot <[email protected]>
Date:   Mon Jun 3 07:40:19 2024 +0000

    Update the changelog

commit f7996741d9b78fce418e85bf886f7c5664e4beda
Author: Etienne <[email protected]>
Date:   Fri May 31 10:54:40 2024 -0400

    feat: Add streaming support to OllamaChatGenerator (#757)

    * Add streaming support to OllamaChatGenerator

    * Clean imports, update docstring

    * Organize imports

    * Fix test

    * Reformat code

    * Optimize imports

    * Add test for streaming callback

    ---------

    Co-authored-by: Silvano Cerza <[email protected]>

commit 145751535df0759c5dc4a93801a9e9f474f5a103
Author: HaystackBot <[email protected]>
Date:   Fri May 31 09:57:08 2024 +0000

    Update the changelog

commit 43c2cf39caa59505cb50e0180dc13819e0ab9940
Author: Stefano Fiorucci <[email protected]>
Date:   Fri May 31 11:54:53 2024 +0200

    pin chromadb (#777)

commit 8a79bfa0fa4108309ed4c2d16ad7633d5305dd82
Author: Massimiliano Pippi <[email protected]>
Date:   Fri May 31 08:49:06 2024 +0200

    fix typing checks

commit 5a84d41d2bf2f919931399eddfe634f34e03cd63
Author: HaystackBot <[email protected]>
Date:   Thu May 30 09:21:04 2024 +0000

    Update the changelog

commit 5098fbdb5c396cada8f5844d0c3a0c98deb04b00
Author: HaystackBot <[email protected]>
Date:   Thu May 30 09:19:41 2024 +0000

    Update the changelog

commit f805abc9687e6f78aa3b6cbedb37093644a78925
Author: HaystackBot <[email protected]>
Date:   Thu May 30 09:18:13 2024 +0000

    Update the changelog

commit b07dc1e2372598ecfa86eae2d1cd80ad68f06531
Author: HaystackBot <[email protected]>
Date:   Thu May 30 09:16:57 2024 +0000

    Update the changelog

commit b966f182f0f55f028a19032c8bda1738f1ee2af9
Author: HaystackBot <[email protected]>
Date:   Thu May 30 09:12:42 2024 +0000

    Update the changelog

commit 42339cdbdcdc63ded4aaaa760771a610b9406dbb
Author: Massimiliano Pippi <[email protected]>
Date:   Thu May 30 11:09:09 2024 +0200

    fix: pass empty dict to filter instead of None (#775)

commit f68b0dc48d282f7abdf05e83fb3251cb09b1a26f
Author: Massimiliano Pippi <[email protected]>
Date:   Thu May 30 09:19:46 2024 +0200

    fix astra nightly

commit c58b319596c81a7f6af2adf03933cdd8865ce614
Author: Massimiliano Pippi <[email protected]>
Date:   Wed May 29 22:08:35 2024 +0200

    show the unreleased changes for an integration (#774)

commit e63b2b03fc9e4fdca40afc6cf0e7a7ca3ceb2866
Author: Massimiliano Pippi <[email protected]>
Date:   Wed May 29 14:22:16 2024 +0200

    feat: defer the database connection to when it's needed (#773)

    * fix tests

    * fix cursor creation

commit 588d6549e0772aa069355e91e809aa638b569c05
Author: Massimiliano Pippi <[email protected]>
Date:   Wed May 29 14:22:02 2024 +0200

    feat: defer the database connection to when it's needed (#770)

    * feat: defer the database connection to when it's needed

    * lazy collection too

    * add test

    * linting

commit 5eebd8444eb18e24cc448e33aa7761ff630025aa
Author: Massimiliano Pippi <[email protected]>
Date:   Wed May 29 10:36:44 2024 +0200

    feat: defer the database connection to when it's needed (#769)

    * feat: defer the database connection to when it's needed

    * fix typing

    * test init is lazy

commit 7d36d02f47bea45dd8c48235fb0d850949e13aba
Author: Massimiliano Pippi <[email protected]>
Date:   Wed May 29 10:36:14 2024 +0200

    feat: defer the database connection to when it's needed (#766)

    * fix tests

    * fix linting

commit 257f99276f35abe68c04e2f616e3669b43fd010f
Author: Edoardo Abati <[email protected]>
Date:   Mon May 27 09:16:20 2024 +0200

    feat: Improve `OpenSearchDocumentStore.__init__` arguments (#739)

    * add max_chunk_bytes parameter to OpenSearchDocumentStore init

    * more documentation in opensearch init

    * fix tests

    * use `max_chunk_bytes` in delete_documents

    * re added type ignore

    * add new kwargs to to_dict

    * test default mappings

    * Update document_store.py

    * restore C++

    ---------

    Co-authored-by: Massimiliano Pippi <[email protected]>

commit 51d0be00c20bef29090d1d37837b6de830d3b43e
Author: Massimiliano Pippi <[email protected]>
Date:   Fri May 24 22:22:50 2024 +0200

    allow unconventional

commit 8b5d5906b29899e9ca59c158abec6e5f3fd0f840
Author: HaystackBot <[email protected]>
Date:   Fri May 24 19:40:00 2024 +0000

    Update the changelog

commit f3d6b130ffd7eaf158f1df45baf82c162d7a21f6
Author: Massimiliano Pippi <[email protected]>
Date:   Fri May 24 21:39:15 2024 +0200

    use git-cliff-action directly

commit 6eac9df34b83728334fc1750631f6d65ba702b4b
Author: HaystackBot <[email protected]>
Date:   Fri May 24 19:27:04 2024 +0000

    Update the changelog

commit 8fbf0876b00b89362a3367974a1d1dfe05a45fdd
Author: Massimiliano Pippi <[email protected]>
Date:   Fri May 24 21:26:09 2024 +0200

    fetch the full history

commit b591fb5b2f92d14e116bef0f9b6723c4035aa73a
Author: HaystackBot <[email protected]>
Date:   Fri May 24 19:14:17 2024 +0000

    Update the changelog

commit d4c87ecbd9c9fbcd0b54aa937bff59fac21c81b6
Author: Massimiliano Pippi <[email protected]>
Date:   Fri May 24 21:13:24 2024 +0200

    push to main

commit 39be0e750964d0acbaa047b2cdcb6f3cde07a98c
Author: Massimiliano Pippi <[email protected]>
Date:   Fri May 24 21:06:42 2024 +0200

    introduce EndBug/add-and-commit

commit e89f6d888d7afe99f78fe1b4231c8ff832a62a4f
Author: Massimiliano Pippi <[email protected]>
Date:   Fri May 24 18:38:15 2024 +0200

    restore GITHUB_TOKEN

commit 4238980b1e33a3cfa4d520a2bcae21b738df2279
Author: Massimiliano Pippi <[email protected]>
Date:   Fri May 24 17:35:31 2024 +0200

    stup git config

commit 9c4a17fcfca85d8f76cc2596590470186db9dc71
Author: Massimiliano Pippi <[email protected]>
Date:   Fri May 24 17:33:37 2024 +0200

    authenticate checkout step

commit 216b73421fcfe29856499aa4653272dd2e3e5ec7
Author: Massimiliano Pippi <[email protected]>
Date:   Fri May 24 17:15:52 2024 +0200

    try setting git user explicitly

commit 36828c59228e520f944824d053445157cc084e3a
Author: Massimiliano Pippi <[email protected]>
Date:   Fri May 24 17:01:29 2024 +0200

    fetch before checkout

commit d6c7c573492eefdf380a0dcb4cb594bd8f977183
Author: Massimiliano Pippi <[email protected]>
Date:   Fri May 24 16:59:34 2024 +0200

    fix branch checkout

commit 6ff501154ce3c025e6755446088ed5e3dd988733
Author: Massimiliano Pippi <[email protected]>
Date:   Fri May 24 16:56:04 2024 +0200

    setup git config

commit 051ca4924c0e6a8f776a9e8ed26df51b2a7e84d3
Author: Massimiliano Pippi <[email protected]>
Date:   Fri May 24 16:53:31 2024 +0200

    feat: defer the database connection to when it's needed (#753)

    * defer the database connection to when it's needed

    * avoid accessing _client, use the property instead

    * ignore mypy errors on private field

commit d914669fa71f9bce299a17abe177f7c89c793c18
Author: Massimiliano Pippi <[email protected]>
Date:   Fri May 24 16:52:54 2024 +0200

    generate integrations changelog

commit 38dc95f2b60d011476c1fb3b46895b05c416b472
Author: Ruben <[email protected]>
Date:   Fri May 24 12:05:43 2024 +0200

    feat: make get_distance and recreate_collection public, replace deprecated recreate_collection function (#754)

    * make get_distance and recreate_collection public

    * replace deprecated recreate_collection function

    * make on_disk and use_sparse_embeddings optional for recreate_collection

    * use client.collection_exists instead of try-catch

    ---------

    Co-authored-by: Massimiliano Pippi <[email protected]>

commit 3a67349dc572269049d65db13d4eed3b5acec637
Author: Massimiliano Pippi <[email protected]>
Date:   Fri May 24 12:03:13 2024 +0200

    fix: remove support for generate API (#755)

    * remove support for generate API

    * add note about super()

commit 29469fa16c4f02b4d0b9d304bcf7e3810bea23e4
Author: Jan Beitner <[email protected]>
Date:   Fri May 24 11:00:12 2024 +0100

    Allow vanilla qdrant filters (#692)

    * allow vanilla qdrant filters

    * updated signatures

    * fix formatting

    * Fix type check in filter_documents and add small test

    ---------

    Co-authored-by: Silvano Cerza <[email protected]>

commit 2667d6bc6077a8db176b257bf178a1df45153214
Author: Anushree Bannadabhavi <[email protected]>
Date:   Fri May 24 05:40:41 2024 -0400

    fix: add support for custom mapping in ElasticsearchDocumentStore (#721)

    * Add custom mapping in ElasticsearchDocumentStore init

    * Update docstrings and add test

    * Fix linting

    * Fix retrievers tests

    ---------

    Co-authored-by: Silvano Cerza <[email protected]>

commit 95daee37d79799f5134c7bed7889865084aae114
Author: Massimiliano Pippi <[email protected]>
Date:   Fri May 24 08:15:03 2024 +0200

    feat: defer database connection to the first usage (#748)

    * defer the database connection to the first usage of the client

    * add test to avoid regressions

commit ee2d54baf5dacf3817a0447b8d34390f108b6d2a
Author: mohammedsohel <[email protected]>
Date:   Thu May 23 19:10:42 2024 +0530

    adding support of "amazon.titan-embed-text-v2:0" (#735)

    * adding support of "amazon.titan-embed-text-v2:0"

    * rectifying the format

    ---------

    Co-authored-by: Massimiliano Pippi <[email protected]>

commit 428c2a8a9c22dc407a203d3ebc502a153a70505b
Author: Massimiliano Pippi <[email protected]>
Date:   Thu May 23 11:49:12 2024 +0200

    amend PR template and CoC

commit 660d73ddb48801bc0ea34f056d11494699d77bd1
Author: Massimiliano Pippi <[email protected]>
Date:   Thu May 23 11:44:17 2024 +0200

    add PR template and CoC

commit 63cf323abf97c9d7221a610a9dcf2a45c17d3fb3
Author: Vishal <[email protected]>
Date:   Thu May 23 14:33:32 2024 +0530

    fix: max_tokens typo in Mistral Chat (#740)

commit 31e61b72571dbc699577af2a573e7031aeb379e8
Author: Tuana Çelik <[email protected]>
Date:   Wed May 22 12:30:26 2024 +0200

    Update _nim_backend.py (#744)

    * Update _nim_backend.py

    * Update _nim_backend.py

commit 7141c68c25d64192e8fa7e341e9d6fb5eb3f5612
Author: jlonge4 <[email protected]>
Date:   Sat May 18 05:12:19 2024 -0400

    [deepset-ai/haystack-core-integrations#727] (#738)

    * hybrid retrieval ex

    * Update integrations/pgvector/examples/hybrid_retrieval.py

    Co-authored-by: Stefano Fiorucci <[email protected]>

    * suggested updates

    * suggested updates

    * suggested updates

    ---------

    Co-authored-by: Stefano Fiorucci <[email protected]>

commit c4f1cc48c8e83baf12335ec1c35e424d1a0975e9
Author: Stefano Lottini <[email protected]>
Date:   Wed May 15 21:49:21 2024 +0200

    explicit projection when reading from Astra DB (#733)

commit 6f298cec4c3c374f714e9eb753162026666cb525
Author: paulmartrencharpro <[email protected]>
Date:   Wed May 15 18:11:42 2024 +0200

    Use the local_files_only option available as of fastembed==0.2.7. It … (#736)

    * Use the local_files_only option available as of fastembed==0.2.7. It allows to not look for the models online, but only use the local, cached, files.
    This way, we can download the model once then use this without internet access

    * Fix lint issues

    * add same param to doc embedder

    ---------

    Co-authored-by: anakin87 <[email protected]>

commit 0e02fd65f432f10c77fcd066a7064bf5ba7223a3
Author: Ulises M <[email protected]>
Date:   Mon May 13 05:44:45 2024 -0700

    basic implementation of llama.cpp chat generation (#723)

    * basic implementation of llama.cpp chat generation

    allows for constraining to json

    allows for function calling (not tested)

    streaming needs to be implemented when stream is set to true in generation_kwargs

    * add testing

    * remove unnecessary function

    * slight documentation fix, comment out broken test

    * support for function calling through functionary

    also add a basic rag test

    * add function calling and execute test, it works!

    * add json test, add chatml test

    * make function call and execute more deterministic

    * try removing additional deps

    * revert

    * make transformers a tests-only dependency

    ---------

    Co-authored-by: Stefano Fiorucci <[email protected]>

commit d4a598b6f5d3287ead9e8c234b001ea0f15e376b
Author: Jon <[email protected]>
Date:   Fri May 10 00:30:48 2024 -0700

    Implement filters for chromaQueryTextRetriever via existing haystack filters logic (#705)

    * Implement filters for chromaQueryTextRetriever via existing haystack filters logic

    Run linter

    * un-skip tests

    ---------

    Co-authored-by: Massimiliano Pippi <[email protected]>

commit c29db9c913c8326c28d2f3de091429d95ee73b38
Author: Daria Fokina <[email protected]>
Date:   Thu May 9 19:10:51 2024 +0200

    missing api references (#728)

commit 7b4428d934e4304f186cd01e02bcf239a43f5729
Author: Vladimir Blagojevic <[email protected]>
Date:   Wed May 8 17:30:52 2024 +0200

    chore: Use ChatMessage to_openai_format, update unit tests, pydocs (#725)

    * Use ChatMessage to_openai_format, update unit tests, pydocs

    * Minor pydocs fixes, turn off integration tests for nightly runs

    * Run only unit tests against haystack-ai main nightly

commit 758e5f372bea65b6ffa149c06a3ab2431d7e7237
Author: jlonge4 <[email protected]>
Date:   Wed May 8 04:10:55 2024 -0400

    feat: Implement keyword retrieval for pgvector integration (#644)

    * keyword retriever

    * add lang to init for tests

    * make suggested edits/test

    * fixes to test / lint

    * index check query change

    * SQLLiteral fix

    * table name quotes

    * table name quotes

    * table name quotes

    * test query edit

    * remove meta

    * move keyword index to init

    * move keyword index to init

    * move keyword index to init

    * keyword with filters test

    * keyword with filters test

    * keyword with filters test

    * keyword with filters test

    * keyword with filters test

    * keyword with filters test

    * keyword with filters test

    * keyword with filters test

    * keyword with filters test

    * keyword with filters test

    * more tests

    * rename example

    ---------

    Co-authored-by: anakin87 <[email protected]>

commit 3c14c52fc38903ec907d28da384b0a36119a3892
Author: Silvano Cerza <[email protected]>
Date:   Tue May 7 17:03:50 2024 +0200

    Update Nvidia integration to support new endpoints (#701)

    * Add support for Nvidia catalog API for generator

    * Add support for Nvidia catalog API for embedders

    * Add NVIDIA_CATALOG_API_KEY in Nvidia integration workflow

    * Enable ruff auto formatting for tests

    * Fix linting

    * Simplify Secret import and enhance docstring

    Co-authored-by: Madeesh Kannan <[email protected]>

    * Add deprecation warnings for NvcfBackend

    * Add truncate parameter for embedders

    * Fix linting

    * Use enum for truncate mode in embedders

    * Change how truncate argument is handled

    * Fix truncate conversion

    * Update truncate docstring

    ---------

    Co-authored-by: Madeesh Kannan <[email protected]>

commit 975e0e528cb4f01a92bc95905fe948674c47aaed
Author: Massimiliano Pippi <[email protected]>
Date:   Tue May 7 10:57:05 2024 +0200

    fix: make unit tests pass (#720)

    * make unit tests pass

    * linting

commit 8a1242f64e7825238101b646f687ced76177f397
Author: Vladimir Blagojevic <[email protected]>
Date:   Mon May 6 21:04:54 2024 +0200

    Fix langfuse nightly tests (#716)

commit f61db6d3a24cbd9bb5327a8765bf0c8272639754
Author: Massimiliano Pippi <[email protected]>
Date:   Mon May 6 17:56:21 2024 +0200

    change the pydoc renderer class (#718)

commit e5667e78b222135dbaddb77f51d6c096650cdd38
Author: Massimiliano Pippi <[email protected]>
Date:   Mon May 6 17:54:08 2024 +0200

    pass the haystack docs version when generating docs (#719)

commit d30c0eafef3dd399e37a8e4168c90a418e6e5716
Author: Stefano Fiorucci <[email protected]>
Date:   Mon May 6 14:58:23 2024 +0200

    FastembedTextEmbedder - remove batch_size (#688)

commit da46c9c5069ad78c75456dfa2f8684e7ba37d55e
Author: Massimiliano Pippi <[email protected]>
Date:   Mon May 6 09:58:12 2024 +0200

    Update README.md

commit 9659b1305f8750ff1c8a675d054f3a7c360fd0af
Author: Dmitry <[email protected]>
Date:   Mon May 6 10:43:57 2024 +0300

    Type hints in pgvector document store updated for 3.8 compability (#704)

commit 04fb950f182fb9d69af3bf42fee26d32206a9a27
Author: Massimiliano Pippi <[email protected]>
Date:   Fri May 3 19:01:36 2024 +0200

    fix: add multi-line variable to step output in the right way (#714)

    * try

    * try

    * remove testing code

commit 8b916a33ffc25e549ae52728515daf30fa1da5bb
Author: Massimiliano Pippi <[email protected]>
Date:   Fri May 3 17:53:30 2024 +0200

    Follow up: update Cohere integration to use Cohere SDK v5 (#711)

    * add support for python client v5

    * linting

commit 1d0a5568178340bf4abbdaeecd7b690ebfc7ddec
Author: Massimiliano Pippi <[email protected]>
Date:   Fri May 3 17:46:25 2024 +0200

    chore: sync integrations docs with all the available Haystack versions (#713)

    * sync docs with all the versions

    * automatically fetch versions

    * Update CI_readme_sync.yml

    Co-authored-by: Silvano Cerza <[email protected]>

    ---------

    Co-authored-by: Silvano Cerza <[email protected]>

commit 5c7e1b74e58ddb7943999a8f0ca24cfb0d834884
Author: Chris Knight <[email protected]>
Date:   Fri May 3 10:53:45 2024 +0100

    fix: Weaviate schema class name conversion which preserves PascalCase (#707)

    * fix: Weaviate schema class name conversion which preserves PascalCase

    * adding a test case for schema name conversion

    * linting

    * formatting

    ---------

    Co-authored-by: David S. Batista <[email protected]>

commit ca87b0edcf607daf80158c13ec182a4f578b6667
Author: Jon <[email protected]>
Date:   Thu May 2 09:46:22 2024 -0700

    Feature/bump chromadb dep to 0.5.0 (#700)

    * [DEP]: Bump chromadb version

    Remove specific version pin

    * Fix: chromadb tests failing due to sort order variance

    Fix: linting error

    * Add OllamaEmbeddingFunction to function registry

    Linting

commit 48521156d9fa1af1afb332395ee430d58fcb9428
Author: Vladimir Blagojevic <[email protected]>
Date:   Thu May 2 18:08:19 2024 +0200

    feat: Langfuse integration (#686)

    * Add langfuse integration

    * Trace pipeline run

    * Integration admin additions

    * Pydoc config

    * Capture trace url in tracer component

    * Add integration test, update example

    * Linting

    * Add haystack-pydoc-tools dep

    * Add comprehensive README

    * Handle both ChatMessage and str payloads

    * Renaming

    * Versioning scheme

    * Pydocs, add public trace flag

    * Add hatch-vcs dep

    * Use OPENAI_API_KEY secret

    * update docstrings

    * Update integrations/langfuse/README.md

    Co-authored-by: Daria Fokina <[email protected]>

    * Update integrations/langfuse/README.md

    Co-authored-by: Daria Fokina <[email protected]>

    * Update integrations/langfuse/README.md

    Co-authored-by: Daria Fokina <[email protected]>

    * Update integrations/langfuse/README.md

    Co-authored-by: Daria Fokina <[email protected]>

    * Update integrations/langfuse/README.md

    Co-authored-by: Daria Fokina <[email protected]>

    * Update integrations/langfuse/README.md

    Co-authored-by: Daria Fokina <[email protected]>

    * lint fixups

    * Improve test, previous version always returned 200

    * Update integrations/langfuse/README.md

    Co-authored-by: Daria Fokina <[email protected]>

    * Update integrations/langfuse/README.md

    Co-authored-by: Daria Fokina <[email protected]>

    * Add details about Langfuse keys

    * Pylint

    ---------

    Co-authored-by: Massimiliano Pippi <[email protected]>
    Co-authored-by: Daria Fokina <[email protected]>

commit 26a16b69398e3dc9894f2a853ba817c728805062
Author: Julian Risch <[email protected]>
Date:   Thu May 2 13:06:36 2024 +0200

    docs: Add Ranker type to Cohere and Jina in inventory (#708)

commit 667162fe6a08dc052063919f52c40479cb574226
Author: Alex W <[email protected]>
Date:   Thu May 2 04:50:17 2024 -0400

    [Cohere] …
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
integration:chroma type:documentation Improvements or additions to documentation
Projects
None yet
Development

Successfully merging this pull request may close these issues.

ChromaDocumentStore fails to search if no metadata is given
2 participants