-
Notifications
You must be signed in to change notification settings - Fork 126
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
fix: Allow search in ChromaDocumentStore without metadata #863
Conversation
if metadatas := result.get("metadatas"): | ||
document_dict["meta"] = dict(metadatas[i][j]) | ||
metadatas = result.get("metadatas") | ||
if metadatas and metadatas[i][j] is not None: |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Can it ever happen that metadatas
size is shorter than i
? And metadatas[i]
is shorter than j
? 🤔
Cause if it can happen this could fail.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Added an exception here as discussed.
integrations/chroma/src/haystack_integrations/document_stores/chroma/document_store.py
Outdated
Show resolved
Hide resolved
@silvanocerza Can you take a look at the test that's failing. I am wondering whats wrong there. |
Co-authored-by: Silvano Cerza <[email protected]>
commit 4c8c8813241947c8f41953a4407056eb4b761cf1 Author: Amna Mubashar <[email protected]> Date: Tue Aug 13 17:58:57 2024 +0200 feat: make truncation optional for bedrock chat generator (#967) * Added truncate param to chat generator and adapters * Added tests to check truncation * Add doc_string * Fixed linting commit 0451e6f43ad5731887ab0aa2aa8e4de9020913e6 Author: HaystackBot <[email protected]> Date: Mon Aug 12 15:56:05 2024 +0000 Update the changelog commit 0f1452ac3d7fcec1142473f5049cd0266b52784a Author: David S. Batista <[email protected]> Date: Mon Aug 12 17:46:03 2024 +0200 refactor: change meta data fields (#911) * initial import * formatting * fixing tests * removing warnings * linting issues * fixes due to conflicts commit a8b2de9d86621aba2243e2d6e350e082c087700d Author: Stefano Fiorucci <[email protected]> Date: Mon Aug 12 17:19:02 2024 +0200 test: do not retry tests in `hatch run test` command (#954) * do not retry tests in hatch run test command * fix * hatch config improvements commit 93d2c6824207f0e29928fc82516b2327ff0d54d2 Author: HaystackBot <[email protected]> Date: Mon Aug 12 13:41:07 2024 +0000 Update the changelog commit 7d90a58f1e77e776d5682ccbb50c87d71eee99c4 Author: tstadel <[email protected]> Date: Mon Aug 12 15:36:58 2024 +0200 fix: support streaming_callback param in amazon bedrock generators (#927) * fix: support streaming_callback param in amazon bedrock generators * fix chat generator merge * reformat --------- Co-authored-by: Thomas Stadelmann <[email protected]> commit f03073f995fa39a88f8ca4b14438ee6a6aa3c892 Author: Rashmi Pawar <[email protected]> Date: Mon Aug 12 14:30:16 2024 +0530 Add default model for NVIDIA HayStack local NIM endpoints (#915) * initial embedder code * default model code * docs: update model docstring * tests: add userwarning * docs: literal lint fix * review changes * remove pydantic dependency * move backend, nim_backend under utils * move is_hosted to warm_up * test cases, docstring fix * error message updation Co-authored-by: Madeesh Kannan <[email protected]> * move is_hosted code to util * remove backend code * update import for is_hosted * remove util and move code to utils * fix api key issue for failing test cases * Update integrations/nvidia/tests/conftest.py --------- Co-authored-by: Madeesh Kannan <[email protected]> commit 7f5b12e9260d97afbdf1ba550bc7ee450e9a0bfc Author: HaystackBot <[email protected]> Date: Thu Aug 8 13:34:35 2024 +0000 Update the changelog commit 76a35a7240228d1abaeb0dd50f7ccfe250bb2149 Author: Stefano Fiorucci <[email protected]> Date: Thu Aug 8 15:33:01 2024 +0200 chore: pin `llama-cpp-python>=0.2.87` (#955) * pin llama-cpp-python>=0.2.86 * update version commit 9a7a9f748a804fe08387bb0f67630611720ef319 Author: Agnieszka Marzec <[email protected]> Date: Thu Aug 8 12:46:35 2024 +0200 Docs: Update AmazonBedrockGenerator docstrings (#956) * update docstrings * Update integrations/amazon_bedrock/src/haystack_integrations/components/generators/amazon_bedrock/generator.py Co-authored-by: Daria Fokina <[email protected]> --------- Co-authored-by: Daria Fokina <[email protected]> commit ee08a763c856411c75ca35a47cfa36ce47c9291d Author: Agnieszka Marzec <[email protected]> Date: Thu Aug 8 12:46:17 2024 +0200 Docs: Update CohereGenerator docstrings (#960) * Update docstrings * Update integrations/cohere/src/haystack_integrations/components/generators/cohere/generator.py Co-authored-by: Daria Fokina <[email protected]> * Update integrations/cohere/src/haystack_integrations/components/generators/cohere/generator.py Co-authored-by: Daria Fokina <[email protected]> --------- Co-authored-by: Daria Fokina <[email protected]> commit dccaf3f3e2f5c8eb270442f2cb5eb69f7e490bbc Author: Agnieszka Marzec <[email protected]> Date: Thu Aug 8 12:45:50 2024 +0200 Docs: Update CohereChatGenerator docstrings (#958) * update docstrings * Fix formatting * fix formatting * Update integrations/cohere/src/haystack_integrations/components/generators/cohere/chat/chat_generator.py Co-authored-by: Daria Fokina <[email protected]> * Update integrations/cohere/src/haystack_integrations/components/generators/cohere/chat/chat_generator.py Co-authored-by: Daria Fokina <[email protected]> * Update integrations/cohere/src/haystack_integrations/components/generators/cohere/chat/chat_generator.py Co-authored-by: Daria Fokina <[email protected]> --------- Co-authored-by: Daria Fokina <[email protected]> commit b8b72ae2bcaa3e61078df6e377a0491763f632f2 Author: Agnieszka Marzec <[email protected]> Date: Thu Aug 8 12:45:23 2024 +0200 Docs: Update GoogleChatGenerator docstrings (#962) * Update docstrings * Update integrations/google_ai/src/haystack_integrations/components/generators/google_ai/chat/gemini.py Co-authored-by: Daria Fokina <[email protected]> --------- Co-authored-by: Daria Fokina <[email protected]> commit 855dc33a031fe523c2767bc10e99c8d1a76b639a Author: Agnieszka Marzec <[email protected]> Date: Thu Aug 8 12:45:01 2024 +0200 Docs: Update GeminiGenerator docstrings (#964) * Update docstrings * Update integrations/google_ai/src/haystack_integrations/components/generators/google_ai/gemini.py Co-authored-by: Daria Fokina <[email protected]> --------- Co-authored-by: Daria Fokina <[email protected]> commit fe9292bfc07e6d9d005572ae17aed169fb0568e6 Author: Agnieszka Marzec <[email protected]> Date: Thu Aug 8 12:44:22 2024 +0200 Docs: Update NvidiaGenerator docstrings (#966) * Update docstrings * Fix formatting * Update integrations/nvidia/src/haystack_integrations/components/generators/nvidia/generator.py Co-authored-by: Daria Fokina <[email protected]> * Update integrations/nvidia/src/haystack_integrations/components/generators/nvidia/generator.py Co-authored-by: Daria Fokina <[email protected]> --------- Co-authored-by: Daria Fokina <[email protected]> commit f6c4b242071dbdc4a08c70b71f3ed88e72e40434 Author: Mateusz Haligowski <[email protected]> Date: Tue Aug 6 13:31:02 2024 +0200 feat: remove gradient integration (#926) commit e664b0cad1f9db8cc2f1abedb6b192c66ec85e47 Author: Vladimir Blagojevic <[email protected]> Date: Tue Aug 6 10:41:22 2024 +0100 Update Langfuse README to avoid common initialization issues (#952) commit 4f15df0b257b44f8ba31409b6e39117009665be6 Author: Agnieszka Marzec <[email protected]> Date: Tue Aug 6 09:49:28 2024 +0200 Docs: Update AmazonBedrockChatGenerator docstrings (#949) * UPdate docstrings * Fix formatting * Update integrations/amazon_bedrock/src/haystack_integrations/components/generators/amazon_bedrock/chat/chat_generator.py Co-authored-by: Daria Fokina <[email protected]> --------- Co-authored-by: Daria Fokina <[email protected]> commit 62d643bcd42b7cabad2eb9908dd68c6ea70a7c19 Author: Agnieszka Marzec <[email protected]> Date: Tue Aug 6 09:49:07 2024 +0200 Docs: Update BM25 docstrings (#945) * update docstrings * add description * fix linters * fix whitespaces commit 5be0bf74d2836b81cc60de84d9f8a1541ec7e191 Author: Agnieszka Marzec <[email protected]> Date: Mon Aug 5 22:01:52 2024 +0200 Update docstrings (#947) commit 993d99e0a645d67c5a1813218224c483406ca518 Author: HaystackBot <[email protected]> Date: Mon Aug 5 18:16:57 2024 +0000 Update the changelog commit a665f1f65be1f169f21bc9d3484051f1ad1c3636 Author: Stefano Fiorucci <[email protected]> Date: Fri Aug 2 16:47:20 2024 +0200 introduce utility function (#939) Co-authored-by: Silvano Cerza <[email protected]> commit 7cee6c80c3ec532112ca4b7fb976711d48d72b4e Author: Amna Mubashar <[email protected]> Date: Fri Aug 2 15:37:58 2024 +0200 fix: replace DynamicChatPromptBuilder with ChatPromptBuilder (#940) * Remove occurrences of DynamicChatPromptBuilder commit 3d698fbe9bf4115db27acf095dadcd12ce40e35b Author: Amna Mubashar <[email protected]> Date: Fri Aug 2 15:21:44 2024 +0200 Pin llama-cpp version (#943) Co-authored-by: Amna Mubashar <[email protected]> commit 7fbc7062f1205a4f65ca59956f4f2f1bb5a37423 Author: Stefano Fiorucci <[email protected]> Date: Fri Aug 2 11:44:10 2024 +0200 update docker compose to v2 (#941) commit fa4c6ccfa7e471baf01045d9e48b118594a92b50 Author: Amna Mubashar <[email protected]> Date: Thu Aug 1 18:31:21 2024 +0200 Small improvement for resolving connection string (#937) commit eab212ab5b79d782f1a840a2078ff11651ac0d8d Author: Stefano Lottini <[email protected]> Date: Thu Aug 1 11:38:56 2024 +0200 fix: Astra DB, improved warnings and guidance about indexing-related mismatches (#932) * better warning text and readme note on indexing settings * language * style * style commit f3878d481558d31b52c4fdebdb8c63c912d58324 Author: HaystackBot <[email protected]> Date: Thu Aug 1 08:05:02 2024 +0000 Update the changelog commit 35d657a06eea311171da3265cbf1200d5f5bc971 Author: tstadel <[email protected]> Date: Thu Aug 1 10:03:39 2024 +0200 feat: support aws authentication with OpenSearchDocumentStore (#920) * feat: support aws auth with OpenSearchDocumentStore * fix: to_dict() from_dict() * fix tests * add tests * fix lint * fix mypy * rename test class * fix feedback * lazy-import boto3 * move _get_auth() to AWSAuth class * get rid of aws_auth param * better docstrings * apply review feedback commit f0b619e890cb07993c14dc94ee4c4b12441b948f Author: Vladimir Blagojevic <[email protected]> Date: Wed Jul 31 07:57:02 2024 +0100 chore: Remove all `DynamicChatPromptBuilder` references in Langfuse integration (#931) * Remove all DynamicChatPromptBuilder references * Lint fixes commit b8e2623014be9ab7410d98832e25eda46923b04c Author: Vladimir Blagojevic <[email protected]> Date: Tue Jul 30 09:04:39 2024 +0100 chore: `Langfuse` - replace DynamicChatPromptBuilder with ChatPromptBuilder (#925) commit 2f6f134300c4e8743dc567f6a5e08ff48962af6c Author: Rashmi Pawar <[email protected]> Date: Mon Jul 29 21:21:32 2024 +0530 Raise warning for base_url ../embeddings .../completions .../rankings (#922) * add validation for base url routes * move url validation to utils * update docstring for url validation * add typing for arg type * return docstring update Co-authored-by: Madeesh Kannan <[email protected]> * fix typo error Co-authored-by: Madeesh Kannan <[email protected]> --------- Co-authored-by: Madeesh Kannan <[email protected]> commit 0cdda5ce6c5e92ef77dcfea046bc7af0bfb47aff Author: Tobias Wochinger <[email protected]> Date: Wed Jul 24 18:27:45 2024 +0200 docs: add release instructions (#923) * docs: add release instructions * docs: add note about maintainers only commit 84aa8356e067f2ed4a59949f8254262f5268d347 Author: HaystackBot <[email protected]> Date: Wed Jul 24 16:08:33 2024 +0000 Update the changelog commit 9d511de96ddf2f9f204be00053bb7be9855ecf3a Author: HaystackBot <[email protected]> Date: Wed Jul 24 13:11:25 2024 +0000 Update the changelog commit cd521cf00e2b8fd8eed3d7048505e2ed3c3f3a92 Author: Stefano Fiorucci <[email protected]> Date: Wed Jul 24 12:32:05 2024 +0200 chore: Ragas - remove context relevancy metric (#917) * ragas: remove context relevancy * try removing rerun-failures * add rerun-failures back, introduce pytest-asycio * add asyncio marker * lower-bound pin commit 282ccc4a41f9dc9791c1f811439572d1563d0238 Author: Vladimir Blagojevic <[email protected]> Date: Wed Jul 24 10:45:44 2024 +0200 Use collections.list_all instead of collections._get_all (#921) commit eaf36aa25baee410341f9153784762962a27c164 Author: HaystackBot <[email protected]> Date: Tue Jul 23 15:30:06 2024 +0000 Update the changelog commit 2ed4be7cf9a84bfd6a926adadec3e05d5a8de979 Author: Anthony Tran <[email protected]> Date: Mon Jul 22 10:26:10 2024 -0400 Fix nested logic operators * Normalize logical filter conditions commit a30cb9620f5744e793968a6085696c57c06af7ec Author: Stefano Fiorucci <[email protected]> Date: Mon Jul 22 10:02:36 2024 +0200 better compatibility (#914) commit 6fb8a66b6a874afaa90dbbef874216a56782456e Author: HaystackBot <[email protected]> Date: Wed Jul 17 15:26:49 2024 +0000 Update the changelog commit 3d07b790f8d3618c56f09488c828e72463c7802b Author: HaystackBot <[email protected]> Date: Wed Jul 17 15:22:09 2024 +0000 Update the changelog commit 93eae15351ecd88105933b83b1a217d39390ce36 Author: David S. Batista <[email protected]> Date: Wed Jul 17 17:13:29 2024 +0200 Add meta deprecration warning (#910) * adding deprecation warnings * fixing imports * fixing 3.9 linting issues commit f03f8bc2b9c4110ec29c06528601ae3e255783b2 Author: HaystackBot <[email protected]> Date: Wed Jul 17 13:09:06 2024 +0000 Update the changelog commit 4553a05336ba19909747ab053cd5316350c0027a Author: HaystackBot <[email protected]> Date: Wed Jul 17 09:21:17 2024 +0000 Update the changelog commit 9893b56d553a158264894440edefa9661589c22c Author: Stefano Fiorucci <[email protected]> Date: Wed Jul 17 11:17:36 2024 +0200 fix: `ChromaDocumentStore` - discard `meta` items when the type of their value is not supported in Chroma (#907) * discard invalid meta values * reduce warnings commit db2b5f72bd1c724eb346a3e4b55c3b416a1fd821 Author: Vladimir Blagojevic <[email protected]> Date: Wed Jul 17 11:04:57 2024 +0200 Add defensive check for filter_policy deserialization (#903) commit be04358aefd6f125c38634acb44cb12300514664 Author: HaystackBot <[email protected]> Date: Wed Jul 17 07:28:22 2024 +0000 Update the changelog commit f5c93d90ceea9bb3dacc2fe91507cd07e0e5cfb1 Author: HaystackBot <[email protected]> Date: Wed Jul 17 07:25:46 2024 +0000 Update the changelog commit ac92b3a832f2c2ea352887c03d02b07e7bc0ba3a Author: HaystackBot <[email protected]> Date: Wed Jul 17 07:18:41 2024 +0000 Update the changelog commit 80622966951cc2897a689805de203f866c955819 Author: Vladimir Blagojevic <[email protected]> Date: Wed Jul 17 09:14:45 2024 +0200 fix: `PgVector` - Fallback to default filter policy when deserializing retrievers without the init parameter (#900) * Add defensive check for filter_policy deserialization * Update integrations/pgvector/tests/test_retrievers.py Co-authored-by: David S. Batista <[email protected]> --------- Co-authored-by: David S. Batista <[email protected]> commit d943f4e62c8950c11438bb99f6314a83d02d2b53 Author: Vladimir Blagojevic <[email protected]> Date: Wed Jul 17 09:14:24 2024 +0200 fix: `Mongo` - Fallback to default filter policy when deserializing retrievers without the init parameter (#899) * Add defensive check for filter_policy deserialization * black test * Fix ruff * Black tests commit b1201e01cd9c3d84d8d1abb646bec0be8441a4f7 Author: HaystackBot <[email protected]> Date: Tue Jul 16 09:48:13 2024 +0000 Update the changelog commit ecaeedd3ab192bd62ad4e21a53e7201e1efac0ac Author: Guillaume Chérel <[email protected]> Date: Tue Jul 16 11:44:57 2024 +0200 feat: Add metadata parameter to ChromaDocumentStore. (#906) * feat: Add metadata parameter to ChromaDocumentStore. * Update integrations/chroma/src/haystack_integrations/document_stores/chroma/document_store.py Co-authored-by: Stefano Fiorucci <[email protected]> * Update integrations/chroma/src/haystack_integrations/document_stores/chroma/document_store.py Co-authored-by: Stefano Fiorucci <[email protected]> * Update integrations/chroma/src/haystack_integrations/document_stores/chroma/document_store.py Co-authored-by: Stefano Fiorucci <[email protected]> * test: update logging message in chroma document store tests * style: Fix formatting * test: Add test for logging messages when creating chroma collection with the same name. * test: Fix logging message. --------- Co-authored-by: Stefano Fiorucci <[email protected]> commit 1b3b36e73661ca4fe6694419a9ba4a947c09f5b9 Author: Vladimir Blagojevic <[email protected]> Date: Tue Jul 16 10:47:14 2024 +0200 fix: `pinecone` - Fallback to default filter policy when deserializing retrievers without the init parameter (#901) * Add defensive check for filter_policy deserialization * Add comment commit b33505a1fc6f653c15d59bf8e94a986f831a07a9 Author: Vladimir Blagojevic <[email protected]> Date: Tue Jul 16 10:46:54 2024 +0200 fix: `ElasticSearch` - Fallback to default filter policy when deserializing retrievers without the init parameter (#898) * Add defensive check for filter_policy deserialization * Add defensive check for filter_policy deserialization * Add unit test * Revert change in chroma * Linter fix commit 6349d155cfa57b9d26c6b6ede106a86d72445b47 Author: HaystackBot <[email protected]> Date: Mon Jul 15 16:31:30 2024 +0000 Update the changelog commit 52345424db8232e353516577a96b5323b276d6c2 Author: Vladimir Blagojevic <[email protected]> Date: Mon Jul 15 18:29:35 2024 +0200 fix: `qdrant` - Fallback to default filter policy when deserializing retrievers without the init parameter (#902) * Add defensive check for filter_policy deserialization * Add unit tests commit 05a21f63279517a973360dfc792ae51438fe5e5f Author: HaystackBot <[email protected]> Date: Mon Jul 15 15:59:14 2024 +0000 Update the changelog commit 16b38492d05eff489031d99856667154aa2b88b8 Author: Vladimir Blagojevic <[email protected]> Date: Mon Jul 15 17:57:36 2024 +0200 fix: `Chroma` - Fallback to default filter policy when deserializing retrievers without the init parameter (#897) * Add defensive check for filter_policy deserialization * Add unit test * Fix test commit 43ccd3cea5bd5430c0a2c925810a40b62601144f Author: HaystackBot <[email protected]> Date: Mon Jul 15 15:56:18 2024 +0000 Update the changelog commit bcdf33d979528405cde73a2125dfac98e96630d7 Author: Vladimir Blagojevic <[email protected]> Date: Mon Jul 15 17:49:01 2024 +0200 fix: `Astra` - Fallback to default filter policy when deserializing retrievers without the init parameter (#896) * Add defensive check for filter_policy deserialization * Add unit test * Add comment commit a9da4ed55435608ce9875ff885e5ad27b9e01027 Author: HaystackBot <[email protected]> Date: Mon Jul 15 15:14:56 2024 +0000 Update the changelog commit b23ab153c86c8724edfb341fce680f5a38004162 Author: Madeesh Kannan <[email protected]> Date: Mon Jul 15 17:13:15 2024 +0200 fix: `OpenSearch` - Fallback to default filter policy when deserializing retrievers without the init parameter (#895) commit 20221ac6c53e3d9dedcec9a2da3289178bf7c495 Author: HaystackBot <[email protected]> Date: Mon Jul 15 14:20:05 2024 +0000 Update the changelog commit 90255b472e556aec861432ffc10a0a3d06913d98 Author: HaystackBot <[email protected]> Date: Mon Jul 15 13:57:20 2024 +0000 Update the changelog commit 140015b8c81918a7fe4726d89a43c1acbbef0907 Author: David S. Batista <[email protected]> Date: Mon Jul 15 15:27:11 2024 +0200 Update README.md (#893) updating Amazon Bedrock link commit dfebd7d6e60e2c161f1790db7f4391a1d880460d Author: HaystackBot <[email protected]> Date: Mon Jul 15 09:57:17 2024 +0000 Update the changelog commit c8f19a2fb734e6284dca6bf042c964f67a42b051 Author: Amna Mubashar <[email protected]> Date: Wed Jul 10 10:52:35 2024 +0200 fix: errors in convert_filters_to_qdrant (#870) * progress * Fixed logic error * Some tests are still failing * Passed all tests * Fixed errors in logic * Fixed linting issues * Minor adjustments * Further improvements in code structure * Final changes for review * Updated * Added more tests * Add a test to check nested filters * Minor changes * Fix bugs and add docstrings --------- Co-authored-by: Amna Mubashar <[email protected]> commit 1ecfbfa6d08f24b1bd24ff83b6ae6941e40ab352 Author: Silvano Cerza <[email protected]> Date: Tue Jul 9 15:54:39 2024 +0200 Fix Google AI tests failing (#885) * Fix Google AI tests failing * Fix GoogleAIGeminiChatGenerator to_dict and from_dict commit abfe76e5f2c0193736beab0b42b69af19fb0934d Author: Amna Mubashar <[email protected]> Date: Mon Jul 8 19:00:43 2024 +0200 fix: Allow search in ChromaDocumentStore without metadata (#863) * Fix a bug for checking metadata values commit 5ab3a10c68f4834b72af74d2d996cf0fe6359f43 Author: Vladimir Blagojevic <[email protected]> Date: Mon Jul 8 17:20:37 2024 +0200 Minor retriever pydoc fix (#884) commit 11a07449734c504de821a551a238c086f1a3d0e8 Author: Vladimir Blagojevic <[email protected]> Date: Fri Jul 5 16:35:34 2024 +0200 feat: Add filter_policy to chroma integration (#826) * Add filter_policy to chroma integration commit 124b6e801f4cfff38b65c26eda033fee5fa23d08 Author: Vladimir Blagojevic <[email protected]> Date: Fri Jul 5 16:35:14 2024 +0200 feat: Add filter_policy to mongodb_atlas integration (#823) * Add filter_policy to mongodb_atlas integration commit 7c60cbf204f1edd039285f460920ef09c5a8243f Author: Vladimir Blagojevic <[email protected]> Date: Fri Jul 5 16:34:55 2024 +0200 feat: Add filter_policy to opensearch integration (#822) * Add filter_policy to opensearch integration commit e6d378a6235d13170d66e93db3f4303d2bb26cc6 Author: Vladimir Blagojevic <[email protected]> Date: Fri Jul 5 16:34:32 2024 +0200 feat: Add filter_policy to qdrant integration (#819) * Add filter_policy to qdrant integration commit 2182edb01a0a3a4379a72e0fdf94cd3640a0fa9c Author: Vladimir Blagojevic <[email protected]> Date: Fri Jul 5 15:40:11 2024 +0200 feat: Add filter_policy to elasticsearch integration (#825) * Add filter_policy to elasticsearch integration commit dff8518588374f67ab55c81a65ec23076710be40 Author: Vladimir Blagojevic <[email protected]> Date: Fri Jul 5 13:00:06 2024 +0200 feat: Add filter_policy to pinecone integration (#821) * Add filter_policy to pinecone integration commit cf792d78151967378505ae7b0581ebfa280f7bdb Author: Vladimir Blagojevic <[email protected]> Date: Fri Jul 5 12:26:40 2024 +0200 feat: Add filter_policy to pgvector integration (#820) * Add filter_policy to pgvector integration --------- Co-authored-by: Stefano Fiorucci <[email protected]> commit 978b71d4dbd9d19c2e616d80c686e48cf876a3da Author: Vladimir Blagojevic <[email protected]> Date: Fri Jul 5 12:26:10 2024 +0200 feat: Add filter_policy to weaviate integration (#824) * Add filter_policy to weaviate integration commit 24c56e96b7b74b2780dab8620a6a232ccb99cde9 Author: Vladimir Blagojevic <[email protected]> Date: Fri Jul 5 12:25:38 2024 +0200 feat: Add filter_policy to astra integration (#827) * Add filter_policy to astra integration --------- Co-authored-by: Stefano Fiorucci <[email protected]> commit 52b0a1a59187c1b0001c2c07351438554e0788b1 Author: David Basoco <[email protected]> Date: Thu Jul 4 15:18:51 2024 +0200 Fix not equal astra filter operator (#868) commit ab68d24a688228d8d37e5c8e3f9264d9c5fcdeee Author: HaystackBot <[email protected]> Date: Thu Jul 4 08:54:14 2024 +0000 Update the changelog commit fcbc35b972d8a06dbe5c97edb11f308297736fa0 Author: Madeesh Kannan <[email protected]> Date: Thu Jul 4 10:51:25 2024 +0200 fix: Fix typo in the `ORTModel.inputs_names` field to align with upstream (#866) commit 0d89e832f84354be0c19464cb9915a8d4aa46a3c Author: HaystackBot <[email protected]> Date: Wed Jul 3 13:52:38 2024 +0000 Update the changelog commit 87bb97dffc2aaebd9238866c11fac57c2332fa9f Author: HaystackBot <[email protected]> Date: Wed Jul 3 11:10:43 2024 +0000 Update the changelog commit 0fd154b97a8621e29f5f70010cc9f025d7eed245 Author: Stefano Fiorucci <[email protected]> Date: Wed Jul 3 13:06:55 2024 +0200 feat: Qdrant - add support for BM42 (#864) * Qdrant: add support for BM42 * add test for sparse configuration commit fd0059e8ce8dc0db338321d2ac5f92b0c1be985a Author: Isaac Chung <[email protected]> Date: Wed Jul 3 13:49:24 2024 +0300 feat: add `score_threshold` to Qdrant Retrievers (#860) * feat: add score_threshold to qdrant retrievers * test: add score_threshold to qdrant tests * ruff linting * hatch run lint:all * add test case using score_threshold * linting * test: new test case with fixed embeds per review * expand docstrings * small fixes --------- Co-authored-by: anakin87 <[email protected]> commit f73c3514c13cdaaf10a878e88f5987529958e9d2 Author: HaystackBot <[email protected]> Date: Tue Jul 2 09:13:17 2024 +0000 Update the changelog commit 9c86675bb00d587410335b6b32c28c8b94a1c795 Author: Stefano Fiorucci <[email protected]> Date: Tue Jul 2 11:01:01 2024 +0200 refactor!: Qdrant - set `scale_score` default value to `False` (#862) * rm unused params * qdrant - set scale_score to False commit 06d77769199607c717cffa297d9b71e51bee4ed4 Author: Stefano Fiorucci <[email protected]> Date: Tue Jul 2 09:50:43 2024 +0200 refactor!: Qdrant - remove unused init parameters: `content_field`, `name_field`, `embedding_field`, and `duplicate_documents` (#861) * rm unused params * docs: change duplicate_documents to policy in docstring --------- Co-authored-by: Julian Risch <[email protected]> commit 268b487a2e8633acecc917e51746eafb2040a9a6 Author: Amna Mubashar <[email protected]> Date: Tue Jul 2 01:45:31 2024 +0200 feat: made truncation optional for BedrockGenerator (#833) * Added truncate parameter to init method * fixed serialization bug for BedrockGenerator * Add a test to check truncation functionality commit 2d93ea3abf2141bca2395ead36d28d9dff8bb413 Author: HaystackBot <[email protected]> Date: Mon Jul 1 18:44:29 2024 +0000 Update the changelog commit 23c3e108cad5aae08c1deea8c6cc1162d0e900df Author: Amna Mubashar <[email protected]> Date: Mon Jul 1 15:33:46 2024 +0200 Add system files in git ignore (#858) Co-authored-by: Amna Mubashar <[email protected]> commit 7127be63c9aa152b5139c1e826f7d345f912b178 Author: Amna Mubashar <[email protected]> Date: Mon Jul 1 01:50:01 2024 +0200 feat: added distance_function property to ChromadocumentStore (#817) * Added the distance metric property --------- Co-authored-by: Amna Mubashar <[email protected]> Co-authored-by: Stefano Fiorucci <[email protected]> commit 6d8ce95005ffedcfa76347e0d06820f2c8490092 Author: HaystackBot <[email protected]> Date: Fri Jun 28 15:49:09 2024 +0000 Update the changelog commit 1f582d7dee069209696084236438a0f09e2b8bbf Author: tstadel <[email protected]> Date: Fri Jun 28 12:40:45 2024 +0200 feat: add raise_on_failure param to OpenSearch retrievers (#852) * feat: add ignore_errors param to OpenSearch retrievers * feedback * fix ruff command * fix except commit 605ba29f0ec722c103085e83ac9bd32762296105 Author: HaystackBot <[email protected]> Date: Fri Jun 28 10:30:15 2024 +0000 Update the changelog commit d2fd97af5d5a28b9d56f233f45b5c6664243c8ee Author: Stefano Fiorucci <[email protected]> Date: Fri Jun 28 12:26:04 2024 +0200 build: add `psutil` dependency to Unstructured integration (#854) * add psutil dependency to unstructured integration * fxi commit 9aafa795935a4a70614d43c7afad0be0e2b12c28 Author: Vladimir Blagojevic <[email protected]> Date: Fri Jun 28 11:32:10 2024 +0200 chore: Update ruff invocation to include check parameter (#853) * Update ruff invocation to include check parameter * fix linting Sagemaker * unused import --------- Co-authored-by: anakin87 <[email protected]> commit 439945410b7968759427e02ff81deaccca313f10 Author: HaystackBot <[email protected]> Date: Thu Jun 27 11:32:56 2024 +0000 Update the changelog commit 0039a486f6eb01f2730238a62b76253b185efa5f Author: tstadel <[email protected]> Date: Thu Jun 27 13:28:21 2024 +0200 fix: support legacy filters with OpenSearchDocumentStore (#850) * feat: support legacy filters with OpenSearchDocumentStore * add tests commit f170ab434711d24a66c9c1a63575c9968e54824d Author: tstadel <[email protected]> Date: Wed Jun 26 19:13:32 2024 +0200 fix: serialization for custom_query in OpenSearch retrievers (#851) commit 49e323f037527d0552ddd387ac42d8b888884782 Author: paulmartrencharpro <[email protected]> Date: Wed Jun 26 14:19:53 2024 +0200 Fix: typo on Sparse embedders. The parameter should be "progress_bar" … (#814) * Fix typo on Sparse embedders. The parameter should be "progress_bar" instead of "show_progress_bar" * Fix typo on Sparse embedders tests. The parameter should be "progress_bar" instead of "show_progress_bar" commit bd21df73ed9d298c88e538cfc2f1ede191722863 Author: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> Date: Wed Jun 26 10:51:19 2024 +0200 chore(deps): bump actions/add-to-project from 1.0.1 to 1.0.2 (#849) Bumps [actions/add-to-project](https://github.com/actions/add-to-project) from 1.0.1 to 1.0.2. - [Release notes](https://github.com/actions/add-to-project/releases) - [Commits](https://github.com/actions/add-to-project/compare/v1.0.1...v1.0.2) --- updated-dependencies: - dependency-name: actions/add-to-project dependency-type: direct:production update-type: version-update:semver-patch ... Signed-off-by: dependabot[bot] <[email protected]> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> commit 6945503db2f72089dd6082a4095755f80dd3dcbc Author: Julian Risch <[email protected]> Date: Wed Jun 26 08:50:16 2024 +0200 feat: Use non-gated tokenizer as fallback for mistral in AmazonBedrockChatGenerator (#843) * feat: Use non-gated tokenizer as fallback for mistral * formatting * fix linter issues commit 05ccdb2a9755fc00cb10381fcf1e6c82c7b12cd2 Author: HaystackBot <[email protected]> Date: Tue Jun 25 16:11:17 2024 +0000 Update the changelog commit 69c29a95d82a7f2bbd53949257a926351521879e Author: tstadel <[email protected]> Date: Tue Jun 25 17:24:10 2024 +0200 feat: add custom_query param to OpenSearch retrievers (#841) * feat: add custom_query param to OpenSearch retrievers * feat: add custom_query to OpenSearch retrievers * add as run param * fix lint * switch to jinja2 templates * Revert "switch to jinja2 templates" This reverts commit f36ed13fa25abc5d17df7e087841a9ecf839c75f. * support custom_query as dict * remove unneccessary comments * remove str * fix lint commit be09adf256a107536d5d3b5434bdb70deefa58e3 Author: tstadel <[email protected]> Date: Tue Jun 25 17:19:02 2024 +0200 feat: add create_index option to OpenSearchDocumentStore (#840) * [opensearch] feat: add create_index option * fix lint * fix lint * add create_index() method * fix lint * better match * fix docs commit 1c557cb06a3e19338e1ff4bd53494bb69c83695d Author: HaystackBot <[email protected]> Date: Tue Jun 25 13:17:50 2024 +0000 Update the changelog commit 53f26ec92e233fa0e8dfa7757ae9236268086cd3 Author: Stefano Fiorucci <[email protected]> Date: Tue Jun 25 10:43:21 2024 +0200 update Pinecone test scripts (#848) commit 60c666624cfb20a3600a28be074697fdf4cf2553 Author: Stefano Fiorucci <[email protected]> Date: Mon Jun 24 16:31:47 2024 +0200 install pytest-rerunfailures; change test-cov script (#845) commit 7723ddee7c17356f1393dc15b48711c73ba0663a Author: Silvano Cerza <[email protected]> Date: Mon Jun 24 15:35:30 2024 +0200 fix: fix connection to Weaviate Cloud Service (#624) * Fix connection to Weaviate Cloud Service * Handle connection to WCS and add tests * Add comment explaining why we use utility function commit 14499cd48a0c766b10a79814c5d3c9bf8d1e6bb2 Author: Silvano Cerza <[email protected]> Date: Mon Jun 24 13:02:12 2024 +0200 Revert "Handle connection to WCS and add tests" This reverts commit f48802b2ce612896fd06a13cf33dffd9f77a8859. commit f48802b2ce612896fd06a13cf33dffd9f77a8859 Author: Silvano Cerza <[email protected]> Date: Mon Jun 24 13:01:27 2024 +0200 Handle connection to WCS and add tests commit 75bb792e956f038992be8b8d84c4b7b94e42b9db Author: HaystackBot <[email protected]> Date: Fri Jun 21 15:01:10 2024 +0000 Update the changelog commit 7cd71f17a3f86a22f74e85db2825f993c597e5b8 Author: Vladimir Blagojevic <[email protected]> Date: Fri Jun 21 16:59:14 2024 +0200 feat: Update Anthropic default models, pydocs (#839) * Update default models, pydocs * Update unit test commit 1f7f75a663292df2cc322c526a80be0fbba7cdee Author: Vladimir Blagojevic <[email protected]> Date: Fri Jun 21 14:03:29 2024 +0200 feat: Update Cohere default LLMs, add examples and update unit tests (#838) * Update default models, examples and unit tests * PR feedback commit 8152b6a3c29eb3ebd0727646b50b0bd71d791ebe Author: Massimiliano Pippi <[email protected]> Date: Fri Jun 21 13:20:39 2024 +0200 ci: retry tests to reduce flakyness (#836) * test the strategy * cargo load the whole bunch commit 4b3abda8eec9eaa43c87eb37427c69a78760ba20 Author: Massimiliano Pippi <[email protected]> Date: Fri Jun 21 09:57:23 2024 +0200 clean up workflow files (#835) commit b11486fb7a20728ed89b52243200cc8184272ab8 Author: HaystackBot <[email protected]> Date: Thu Jun 20 13:35:54 2024 +0000 Update the changelog commit e42f8f2935676a55ed0c596baac5f026e8195fbb Author: Amna Mubashar <[email protected]> Date: Thu Jun 20 13:19:43 2024 +0200 doc: added docstrings for QdrantDocumentStore (#808) * doc: add docstrings qdrant document store * Updated docstrings based on PR review --------- Co-authored-by: Amna Mubashar commit a0eca9ae434f4774ed7401700879f9a96db8014a Author: HaystackBot <[email protected]> Date: Thu Jun 20 08:06:17 2024 +0000 Update the changelog commit d161220395aabf45fdc049ee7b129448406e39d6 Author: Leonardo Teixeira Menezes <[email protected]> Date: Thu Jun 20 10:02:31 2024 +0200 feat: add customizable index names for pgvector (#818) * feat: add customizable index names for pgvector * refactor: Remove unnecessary constants on PGVector document store commit 867b2a3b66eb75e735726f2b9780778ec59cd33b Author: Vedant Naik <[email protected]> Date: Wed Jun 19 15:55:41 2024 +0100 fix: weaviate filter error (#811) * fix: weaviate filter error * test: add test for legacy filters for weaviate commit 236fd28287c42d34b0f0cb48426625f3e747d797 Author: HaystackBot <[email protected]> Date: Wed Jun 19 04:15:36 2024 +0000 Update the changelog commit b9f783dfd7d3b18e749889a1031173d9643940c9 Author: HaystackBot <[email protected]> Date: Tue Jun 18 15:33:22 2024 +0000 Update the changelog commit 70e2a9cf9315e01c292105acdf3eecb150793336 Author: Massimiliano Pippi <[email protected]> Date: Tue Jun 18 17:32:04 2024 +0200 add support for Azure generators (#815) commit 6254d58019101b7ec9d239fe9ee7b3adc504c38a Author: Vladimir Blagojevic <[email protected]> Date: Tue Jun 18 10:27:04 2024 +0200 feat: Update Anthropic/Cohere for tools use (#790) * Update for tools use * Test updates * Add tools usage integration tests * Minor test detail update * PR review * Pydocs update commit 14a2711079c565b4eb7baaccbf1f3f31c7a646b8 Author: HaystackBot <[email protected]> Date: Fri Jun 14 15:09:05 2024 +0000 Update the changelog commit 5e66f1d370cc33d6e4f29019bfdc368fc63182c8 Author: tstadel <[email protected]> Date: Fri Jun 14 17:06:51 2024 +0200 feat: support Claude v3, Llama3 and Command R models on Amazon Bedrock (#809) * feat: support Claude v3 and Cohere Command R models on Amazon Bedrock * revert chat pattern change * rename llama adapter * fix tests after llama adapter rename commit 590e2b016009d171eeb9b3aae95467f295936453 Author: HaystackBot <[email protected]> Date: Thu Jun 13 10:27:46 2024 +0000 Update the changelog commit bf5c64138cfcb0feea173d4e2289754922ae5a41 Author: agruhl <[email protected]> Date: Thu Jun 13 12:26:05 2024 +0200 fix: Performance optimizations and value error when streaming in langfuse (#798) * Solves issue with usage stats when streaming is enabled on the OpenAIGenerator * Root span should be closed when the pipeline run is complete * Added documentation * Moved flushing execution to the last span in the context and improved the documentation to give examples of flushing properly manually * Fixed linting issues * make use of monkeypatch * improving code commit 575e209f7020a327b3347368eb29e2cc9d01ac71 Author: Stefano Fiorucci <[email protected]> Date: Wed Jun 12 19:17:53 2024 +0200 tests: Pinecone - fix `test_serverless_index_creation_from_scratch` (#806) * pinecone tests: wait for index creation * make index name dynamic * lint * run the test only once in our suite * add reason * increase sleep time * better skipif condition * better coverage options * fix * revert changes in coverage.run * fix * add unit to sleep tima * define index name in the matrix * fix * add default * lint commit 015cc3efc8da4256f75c7ae568bc8e090137a480 Author: HaystackBot <[email protected]> Date: Wed Jun 12 09:50:16 2024 +0000 Update the changelog commit 9f5a4601018c1a5707da02c43ebbd75b43d78640 Author: Stefano Fiorucci <[email protected]> Date: Wed Jun 12 11:48:35 2024 +0200 cohere - remove warning (#805) commit eb0722b90220b868d69764901342badabadee196 Author: Stefano Fiorucci <[email protected]> Date: Tue Jun 11 12:54:04 2024 +0200 test: Amazon Bedrock - skip integration tests from forks (#801) * try skipping integration tests from forks * make it work on windows * skip mistral tests when HF token is not set * add windows step * separate unit and int tests * refinement * format commit 4821ff3dbc86a51c652651df6bf626ced1412f67 Author: HaystackBot <[email protected]> Date: Tue Jun 11 09:05:42 2024 +0000 Update the changelog commit 7524022524ca38889a39744b2661913ef2552206 Author: Massimiliano Pippi <[email protected]> Date: Tue Jun 11 11:03:27 2024 +0200 feat: defer the database connection to when it's needed (#804) * feat: defer the database connection to when it's needed * linting * fix tests commit 29b363c2cea1c677b3d0ba23de4974d9e033b710 Author: HaystackBot <[email protected]> Date: Mon Jun 10 16:33:29 2024 +0000 Update the changelog commit f70664dd5d276a801b3544ad8943b95bdd064deb Author: Massimiliano Pippi <[email protected]> Date: Mon Jun 10 18:31:06 2024 +0200 feat: defer the database connection to when it's needed (#802) * feat: defer the database connection to when it's needed * remove unneeded noqa * fix fixture * trigger the connection before asserting * trigger connection * make also serialization lazy * remove copypasta leftovers commit 5bc08dfba2be655b149137c2e852d9a58bd8172c Author: Madeesh Kannan <[email protected]> Date: Mon Jun 10 17:41:05 2024 +0200 refactor: Remove deprecated Nvidia Cloud Functions backend and related code. (#803) commit 76504ab95bf7bbfaebaa2f06bae3dab892df2ae9 Author: HaystackBot <[email protected]> Date: Mon Jun 10 15:03:23 2024 +0000 Update the changelog commit 4dda1b9ec4eea7f48cd2546da4c36be7f4134210 Author: Stefano Fiorucci <[email protected]> Date: Mon Jun 10 17:00:36 2024 +0200 feat!: Pinecone - support for the new API (#793) * upgrate to new API * increase sleep time * update example * address feedback commit 1f71c721049f91634a96d61c5b505e66649655c5 Author: HaystackBot <[email protected]> Date: Mon Jun 10 08:08:33 2024 +0000 Update the changelog commit 2b4a534e21ca9c0a148011ecd2daac9c02556206 Author: HaystackBot <[email protected]> Date: Fri Jun 7 07:13:42 2024 +0000 Update the changelog commit 78157a94e7e2d0cd3ae8ac42508e9a248b96852b Author: lohit8846 <[email protected]> Date: Thu Jun 6 00:17:50 2024 -0700 feat: Add force_disable_check_same_thread init param for Qdrant local client (#779) * fix: added missing init param which is use by qdrant local client * fix: Corrected linting commit 3f3fa2bd2185733621b6a85becd4713e0074b7d1 Author: HaystackBot <[email protected]> Date: Wed Jun 5 14:46:41 2024 +0000 Update the changelog commit bc032a37a20a38ebc3a6b9bc7fe342d4e3e5411c Author: Silvano Cerza <[email protected]> Date: Wed Jun 5 16:44:57 2024 +0200 Fix tests skipping for Google AI integration (#788) commit 48422b1e91e3d71df149ae04a36e2d4e506dcc5c Author: Silvano Cerza <[email protected]> Date: Wed Jun 5 16:37:15 2024 +0200 Drop support for Python 3.8 for Unstructured integration (#787) commit ab58a2319fa3df2c51ec68864a5da09105bf1fef Author: antoniomuzzolini <[email protected]> Date: Wed Jun 5 16:32:06 2024 +0200 fix: Handle `TypeError: Could not create Blob` in `GoogleAIGeminiChatGenerator` (#772) * bugfix 654 aligned _message_to_content to VertexAIGeminiChatGenerator * Update test_chat_gemini.py --------- Co-authored-by: Silvano Cerza <[email protected]> commit 4290d6a3fad63701882fa4b70af8aafb94db7d89 Author: Silvano Cerza <[email protected]> Date: Wed Jun 5 16:22:31 2024 +0200 Fix Google AI integration tests (#786) commit 4042fddcfe5682885b711260812a37f6323bf040 Author: ArzelaAscoIi <[email protected]> Date: Wed Jun 5 14:32:07 2024 +0200 feat: return_embeddings flag for opensearch (#784) * feat: return_embeddings flag for opensearch * fix * fix * fix * tests: bm25 retrieval commit 42d85b3a2936dc1275558d99907b543a24aa595b Author: Matthew Farrellee <[email protected]> Date: Wed Jun 5 07:42:44 2024 -0400 align tests and docs on NVIDIA_API_KEY (instead of NVIDIA_CATALOG_API_KEY) (#731) commit e118b926456d1c52ad35ef1a7fa1a1e274487d58 Author: Matthew Farrellee <[email protected]> Date: Wed Jun 5 07:42:37 2024 -0400 switch default from NVCF backend to hosted NIM backend (#734) commit 6aaf523636d923ce158c46ea39a057f5529b5a17 Author: HaystackBot <[email protected]> Date: Mon Jun 3 07:40:19 2024 +0000 Update the changelog commit f7996741d9b78fce418e85bf886f7c5664e4beda Author: Etienne <[email protected]> Date: Fri May 31 10:54:40 2024 -0400 feat: Add streaming support to OllamaChatGenerator (#757) * Add streaming support to OllamaChatGenerator * Clean imports, update docstring * Organize imports * Fix test * Reformat code * Optimize imports * Add test for streaming callback --------- Co-authored-by: Silvano Cerza <[email protected]> commit 145751535df0759c5dc4a93801a9e9f474f5a103 Author: HaystackBot <[email protected]> Date: Fri May 31 09:57:08 2024 +0000 Update the changelog commit 43c2cf39caa59505cb50e0180dc13819e0ab9940 Author: Stefano Fiorucci <[email protected]> Date: Fri May 31 11:54:53 2024 +0200 pin chromadb (#777) commit 8a79bfa0fa4108309ed4c2d16ad7633d5305dd82 Author: Massimiliano Pippi <[email protected]> Date: Fri May 31 08:49:06 2024 +0200 fix typing checks commit 5a84d41d2bf2f919931399eddfe634f34e03cd63 Author: HaystackBot <[email protected]> Date: Thu May 30 09:21:04 2024 +0000 Update the changelog commit 5098fbdb5c396cada8f5844d0c3a0c98deb04b00 Author: HaystackBot <[email protected]> Date: Thu May 30 09:19:41 2024 +0000 Update the changelog commit f805abc9687e6f78aa3b6cbedb37093644a78925 Author: HaystackBot <[email protected]> Date: Thu May 30 09:18:13 2024 +0000 Update the changelog commit b07dc1e2372598ecfa86eae2d1cd80ad68f06531 Author: HaystackBot <[email protected]> Date: Thu May 30 09:16:57 2024 +0000 Update the changelog commit b966f182f0f55f028a19032c8bda1738f1ee2af9 Author: HaystackBot <[email protected]> Date: Thu May 30 09:12:42 2024 +0000 Update the changelog commit 42339cdbdcdc63ded4aaaa760771a610b9406dbb Author: Massimiliano Pippi <[email protected]> Date: Thu May 30 11:09:09 2024 +0200 fix: pass empty dict to filter instead of None (#775) commit f68b0dc48d282f7abdf05e83fb3251cb09b1a26f Author: Massimiliano Pippi <[email protected]> Date: Thu May 30 09:19:46 2024 +0200 fix astra nightly commit c58b319596c81a7f6af2adf03933cdd8865ce614 Author: Massimiliano Pippi <[email protected]> Date: Wed May 29 22:08:35 2024 +0200 show the unreleased changes for an integration (#774) commit e63b2b03fc9e4fdca40afc6cf0e7a7ca3ceb2866 Author: Massimiliano Pippi <[email protected]> Date: Wed May 29 14:22:16 2024 +0200 feat: defer the database connection to when it's needed (#773) * fix tests * fix cursor creation commit 588d6549e0772aa069355e91e809aa638b569c05 Author: Massimiliano Pippi <[email protected]> Date: Wed May 29 14:22:02 2024 +0200 feat: defer the database connection to when it's needed (#770) * feat: defer the database connection to when it's needed * lazy collection too * add test * linting commit 5eebd8444eb18e24cc448e33aa7761ff630025aa Author: Massimiliano Pippi <[email protected]> Date: Wed May 29 10:36:44 2024 +0200 feat: defer the database connection to when it's needed (#769) * feat: defer the database connection to when it's needed * fix typing * test init is lazy commit 7d36d02f47bea45dd8c48235fb0d850949e13aba Author: Massimiliano Pippi <[email protected]> Date: Wed May 29 10:36:14 2024 +0200 feat: defer the database connection to when it's needed (#766) * fix tests * fix linting commit 257f99276f35abe68c04e2f616e3669b43fd010f Author: Edoardo Abati <[email protected]> Date: Mon May 27 09:16:20 2024 +0200 feat: Improve `OpenSearchDocumentStore.__init__` arguments (#739) * add max_chunk_bytes parameter to OpenSearchDocumentStore init * more documentation in opensearch init * fix tests * use `max_chunk_bytes` in delete_documents * re added type ignore * add new kwargs to to_dict * test default mappings * Update document_store.py * restore C++ --------- Co-authored-by: Massimiliano Pippi <[email protected]> commit 51d0be00c20bef29090d1d37837b6de830d3b43e Author: Massimiliano Pippi <[email protected]> Date: Fri May 24 22:22:50 2024 +0200 allow unconventional commit 8b5d5906b29899e9ca59c158abec6e5f3fd0f840 Author: HaystackBot <[email protected]> Date: Fri May 24 19:40:00 2024 +0000 Update the changelog commit f3d6b130ffd7eaf158f1df45baf82c162d7a21f6 Author: Massimiliano Pippi <[email protected]> Date: Fri May 24 21:39:15 2024 +0200 use git-cliff-action directly commit 6eac9df34b83728334fc1750631f6d65ba702b4b Author: HaystackBot <[email protected]> Date: Fri May 24 19:27:04 2024 +0000 Update the changelog commit 8fbf0876b00b89362a3367974a1d1dfe05a45fdd Author: Massimiliano Pippi <[email protected]> Date: Fri May 24 21:26:09 2024 +0200 fetch the full history commit b591fb5b2f92d14e116bef0f9b6723c4035aa73a Author: HaystackBot <[email protected]> Date: Fri May 24 19:14:17 2024 +0000 Update the changelog commit d4c87ecbd9c9fbcd0b54aa937bff59fac21c81b6 Author: Massimiliano Pippi <[email protected]> Date: Fri May 24 21:13:24 2024 +0200 push to main commit 39be0e750964d0acbaa047b2cdcb6f3cde07a98c Author: Massimiliano Pippi <[email protected]> Date: Fri May 24 21:06:42 2024 +0200 introduce EndBug/add-and-commit commit e89f6d888d7afe99f78fe1b4231c8ff832a62a4f Author: Massimiliano Pippi <[email protected]> Date: Fri May 24 18:38:15 2024 +0200 restore GITHUB_TOKEN commit 4238980b1e33a3cfa4d520a2bcae21b738df2279 Author: Massimiliano Pippi <[email protected]> Date: Fri May 24 17:35:31 2024 +0200 stup git config commit 9c4a17fcfca85d8f76cc2596590470186db9dc71 Author: Massimiliano Pippi <[email protected]> Date: Fri May 24 17:33:37 2024 +0200 authenticate checkout step commit 216b73421fcfe29856499aa4653272dd2e3e5ec7 Author: Massimiliano Pippi <[email protected]> Date: Fri May 24 17:15:52 2024 +0200 try setting git user explicitly commit 36828c59228e520f944824d053445157cc084e3a Author: Massimiliano Pippi <[email protected]> Date: Fri May 24 17:01:29 2024 +0200 fetch before checkout commit d6c7c573492eefdf380a0dcb4cb594bd8f977183 Author: Massimiliano Pippi <[email protected]> Date: Fri May 24 16:59:34 2024 +0200 fix branch checkout commit 6ff501154ce3c025e6755446088ed5e3dd988733 Author: Massimiliano Pippi <[email protected]> Date: Fri May 24 16:56:04 2024 +0200 setup git config commit 051ca4924c0e6a8f776a9e8ed26df51b2a7e84d3 Author: Massimiliano Pippi <[email protected]> Date: Fri May 24 16:53:31 2024 +0200 feat: defer the database connection to when it's needed (#753) * defer the database connection to when it's needed * avoid accessing _client, use the property instead * ignore mypy errors on private field commit d914669fa71f9bce299a17abe177f7c89c793c18 Author: Massimiliano Pippi <[email protected]> Date: Fri May 24 16:52:54 2024 +0200 generate integrations changelog commit 38dc95f2b60d011476c1fb3b46895b05c416b472 Author: Ruben <[email protected]> Date: Fri May 24 12:05:43 2024 +0200 feat: make get_distance and recreate_collection public, replace deprecated recreate_collection function (#754) * make get_distance and recreate_collection public * replace deprecated recreate_collection function * make on_disk and use_sparse_embeddings optional for recreate_collection * use client.collection_exists instead of try-catch --------- Co-authored-by: Massimiliano Pippi <[email protected]> commit 3a67349dc572269049d65db13d4eed3b5acec637 Author: Massimiliano Pippi <[email protected]> Date: Fri May 24 12:03:13 2024 +0200 fix: remove support for generate API (#755) * remove support for generate API * add note about super() commit 29469fa16c4f02b4d0b9d304bcf7e3810bea23e4 Author: Jan Beitner <[email protected]> Date: Fri May 24 11:00:12 2024 +0100 Allow vanilla qdrant filters (#692) * allow vanilla qdrant filters * updated signatures * fix formatting * Fix type check in filter_documents and add small test --------- Co-authored-by: Silvano Cerza <[email protected]> commit 2667d6bc6077a8db176b257bf178a1df45153214 Author: Anushree Bannadabhavi <[email protected]> Date: Fri May 24 05:40:41 2024 -0400 fix: add support for custom mapping in ElasticsearchDocumentStore (#721) * Add custom mapping in ElasticsearchDocumentStore init * Update docstrings and add test * Fix linting * Fix retrievers tests --------- Co-authored-by: Silvano Cerza <[email protected]> commit 95daee37d79799f5134c7bed7889865084aae114 Author: Massimiliano Pippi <[email protected]> Date: Fri May 24 08:15:03 2024 +0200 feat: defer database connection to the first usage (#748) * defer the database connection to the first usage of the client * add test to avoid regressions commit ee2d54baf5dacf3817a0447b8d34390f108b6d2a Author: mohammedsohel <[email protected]> Date: Thu May 23 19:10:42 2024 +0530 adding support of "amazon.titan-embed-text-v2:0" (#735) * adding support of "amazon.titan-embed-text-v2:0" * rectifying the format --------- Co-authored-by: Massimiliano Pippi <[email protected]> commit 428c2a8a9c22dc407a203d3ebc502a153a70505b Author: Massimiliano Pippi <[email protected]> Date: Thu May 23 11:49:12 2024 +0200 amend PR template and CoC commit 660d73ddb48801bc0ea34f056d11494699d77bd1 Author: Massimiliano Pippi <[email protected]> Date: Thu May 23 11:44:17 2024 +0200 add PR template and CoC commit 63cf323abf97c9d7221a610a9dcf2a45c17d3fb3 Author: Vishal <[email protected]> Date: Thu May 23 14:33:32 2024 +0530 fix: max_tokens typo in Mistral Chat (#740) commit 31e61b72571dbc699577af2a573e7031aeb379e8 Author: Tuana Çelik <[email protected]> Date: Wed May 22 12:30:26 2024 +0200 Update _nim_backend.py (#744) * Update _nim_backend.py * Update _nim_backend.py commit 7141c68c25d64192e8fa7e341e9d6fb5eb3f5612 Author: jlonge4 <[email protected]> Date: Sat May 18 05:12:19 2024 -0400 [deepset-ai/haystack-core-integrations#727] (#738) * hybrid retrieval ex * Update integrations/pgvector/examples/hybrid_retrieval.py Co-authored-by: Stefano Fiorucci <[email protected]> * suggested updates * suggested updates * suggested updates --------- Co-authored-by: Stefano Fiorucci <[email protected]> commit c4f1cc48c8e83baf12335ec1c35e424d1a0975e9 Author: Stefano Lottini <[email protected]> Date: Wed May 15 21:49:21 2024 +0200 explicit projection when reading from Astra DB (#733) commit 6f298cec4c3c374f714e9eb753162026666cb525 Author: paulmartrencharpro <[email protected]> Date: Wed May 15 18:11:42 2024 +0200 Use the local_files_only option available as of fastembed==0.2.7. It … (#736) * Use the local_files_only option available as of fastembed==0.2.7. It allows to not look for the models online, but only use the local, cached, files. This way, we can download the model once then use this without internet access * Fix lint issues * add same param to doc embedder --------- Co-authored-by: anakin87 <[email protected]> commit 0e02fd65f432f10c77fcd066a7064bf5ba7223a3 Author: Ulises M <[email protected]> Date: Mon May 13 05:44:45 2024 -0700 basic implementation of llama.cpp chat generation (#723) * basic implementation of llama.cpp chat generation allows for constraining to json allows for function calling (not tested) streaming needs to be implemented when stream is set to true in generation_kwargs * add testing * remove unnecessary function * slight documentation fix, comment out broken test * support for function calling through functionary also add a basic rag test * add function calling and execute test, it works! * add json test, add chatml test * make function call and execute more deterministic * try removing additional deps * revert * make transformers a tests-only dependency --------- Co-authored-by: Stefano Fiorucci <[email protected]> commit d4a598b6f5d3287ead9e8c234b001ea0f15e376b Author: Jon <[email protected]> Date: Fri May 10 00:30:48 2024 -0700 Implement filters for chromaQueryTextRetriever via existing haystack filters logic (#705) * Implement filters for chromaQueryTextRetriever via existing haystack filters logic Run linter * un-skip tests --------- Co-authored-by: Massimiliano Pippi <[email protected]> commit c29db9c913c8326c28d2f3de091429d95ee73b38 Author: Daria Fokina <[email protected]> Date: Thu May 9 19:10:51 2024 +0200 missing api references (#728) commit 7b4428d934e4304f186cd01e02bcf239a43f5729 Author: Vladimir Blagojevic <[email protected]> Date: Wed May 8 17:30:52 2024 +0200 chore: Use ChatMessage to_openai_format, update unit tests, pydocs (#725) * Use ChatMessage to_openai_format, update unit tests, pydocs * Minor pydocs fixes, turn off integration tests for nightly runs * Run only unit tests against haystack-ai main nightly commit 758e5f372bea65b6ffa149c06a3ab2431d7e7237 Author: jlonge4 <[email protected]> Date: Wed May 8 04:10:55 2024 -0400 feat: Implement keyword retrieval for pgvector integration (#644) * keyword retriever * add lang to init for tests * make suggested edits/test * fixes to test / lint * index check query change * SQLLiteral fix * table name quotes * table name quotes * table name quotes * test query edit * remove meta * move keyword index to init * move keyword index to init * move keyword index to init * keyword with filters test * keyword with filters test * keyword with filters test * keyword with filters test * keyword with filters test * keyword with filters test * keyword with filters test * keyword with filters test * keyword with filters test * keyword with filters test * more tests * rename example --------- Co-authored-by: anakin87 <[email protected]> commit 3c14c52fc38903ec907d28da384b0a36119a3892 Author: Silvano Cerza <[email protected]> Date: Tue May 7 17:03:50 2024 +0200 Update Nvidia integration to support new endpoints (#701) * Add support for Nvidia catalog API for generator * Add support for Nvidia catalog API for embedders * Add NVIDIA_CATALOG_API_KEY in Nvidia integration workflow * Enable ruff auto formatting for tests * Fix linting * Simplify Secret import and enhance docstring Co-authored-by: Madeesh Kannan <[email protected]> * Add deprecation warnings for NvcfBackend * Add truncate parameter for embedders * Fix linting * Use enum for truncate mode in embedders * Change how truncate argument is handled * Fix truncate conversion * Update truncate docstring --------- Co-authored-by: Madeesh Kannan <[email protected]> commit 975e0e528cb4f01a92bc95905fe948674c47aaed Author: Massimiliano Pippi <[email protected]> Date: Tue May 7 10:57:05 2024 +0200 fix: make unit tests pass (#720) * make unit tests pass * linting commit 8a1242f64e7825238101b646f687ced76177f397 Author: Vladimir Blagojevic <[email protected]> Date: Mon May 6 21:04:54 2024 +0200 Fix langfuse nightly tests (#716) commit f61db6d3a24cbd9bb5327a8765bf0c8272639754 Author: Massimiliano Pippi <[email protected]> Date: Mon May 6 17:56:21 2024 +0200 change the pydoc renderer class (#718) commit e5667e78b222135dbaddb77f51d6c096650cdd38 Author: Massimiliano Pippi <[email protected]> Date: Mon May 6 17:54:08 2024 +0200 pass the haystack docs version when generating docs (#719) commit d30c0eafef3dd399e37a8e4168c90a418e6e5716 Author: Stefano Fiorucci <[email protected]> Date: Mon May 6 14:58:23 2024 +0200 FastembedTextEmbedder - remove batch_size (#688) commit da46c9c5069ad78c75456dfa2f8684e7ba37d55e Author: Massimiliano Pippi <[email protected]> Date: Mon May 6 09:58:12 2024 +0200 Update README.md commit 9659b1305f8750ff1c8a675d054f3a7c360fd0af Author: Dmitry <[email protected]> Date: Mon May 6 10:43:57 2024 +0300 Type hints in pgvector document store updated for 3.8 compability (#704) commit 04fb950f182fb9d69af3bf42fee26d32206a9a27 Author: Massimiliano Pippi <[email protected]> Date: Fri May 3 19:01:36 2024 +0200 fix: add multi-line variable to step output in the right way (#714) * try * try * remove testing code commit 8b916a33ffc25e549ae52728515daf30fa1da5bb Author: Massimiliano Pippi <[email protected]> Date: Fri May 3 17:53:30 2024 +0200 Follow up: update Cohere integration to use Cohere SDK v5 (#711) * add support for python client v5 * linting commit 1d0a5568178340bf4abbdaeecd7b690ebfc7ddec Author: Massimiliano Pippi <[email protected]> Date: Fri May 3 17:46:25 2024 +0200 chore: sync integrations docs with all the available Haystack versions (#713) * sync docs with all the versions * automatically fetch versions * Update CI_readme_sync.yml Co-authored-by: Silvano Cerza <[email protected]> --------- Co-authored-by: Silvano Cerza <[email protected]> commit 5c7e1b74e58ddb7943999a8f0ca24cfb0d834884 Author: Chris Knight <[email protected]> Date: Fri May 3 10:53:45 2024 +0100 fix: Weaviate schema class name conversion which preserves PascalCase (#707) * fix: Weaviate schema class name conversion which preserves PascalCase * adding a test case for schema name conversion * linting * formatting --------- Co-authored-by: David S. Batista <[email protected]> commit ca87b0edcf607daf80158c13ec182a4f578b6667 Author: Jon <[email protected]> Date: Thu May 2 09:46:22 2024 -0700 Feature/bump chromadb dep to 0.5.0 (#700) * [DEP]: Bump chromadb version Remove specific version pin * Fix: chromadb tests failing due to sort order variance Fix: linting error * Add OllamaEmbeddingFunction to function registry Linting commit 48521156d9fa1af1afb332395ee430d58fcb9428 Author: Vladimir Blagojevic <[email protected]> Date: Thu May 2 18:08:19 2024 +0200 feat: Langfuse integration (#686) * Add langfuse integration * Trace pipeline run * Integration admin additions * Pydoc config * Capture trace url in tracer component * Add integration test, update example * Linting * Add haystack-pydoc-tools dep * Add comprehensive README * Handle both ChatMessage and str payloads * Renaming * Versioning scheme * Pydocs, add public trace flag * Add hatch-vcs dep * Use OPENAI_API_KEY secret * update docstrings * Update integrations/langfuse/README.md Co-authored-by: Daria Fokina <[email protected]> * Update integrations/langfuse/README.md Co-authored-by: Daria Fokina <[email protected]> * Update integrations/langfuse/README.md Co-authored-by: Daria Fokina <[email protected]> * Update integrations/langfuse/README.md Co-authored-by: Daria Fokina <[email protected]> * Update integrations/langfuse/README.md Co-authored-by: Daria Fokina <[email protected]> * Update integrations/langfuse/README.md Co-authored-by: Daria Fokina <[email protected]> * lint fixups * Improve test, previous version always returned 200 * Update integrations/langfuse/README.md Co-authored-by: Daria Fokina <[email protected]> * Update integrations/langfuse/README.md Co-authored-by: Daria Fokina <[email protected]> * Add details about Langfuse keys * Pylint --------- Co-authored-by: Massimiliano Pippi <[email protected]> Co-authored-by: Daria Fokina <[email protected]> commit 26a16b69398e3dc9894f2a853ba817c728805062 Author: Julian Risch <[email protected]> Date: Thu May 2 13:06:36 2024 +0200 docs: Add Ranker type to Cohere and Jina in inventory (#708) commit 667162fe6a08dc052063919f52c40479cb574226 Author: Alex W <[email protected]> Date: Thu May 2 04:50:17 2024 -0400 [Cohere] …
Related Issues
Proposed Changes:
Update
_query_result_to_documents
to also run without metadataHow did you test it?
Added a unit test
Notes for the reviewer
NA
Checklist
fix:
,feat:
,build:
,chore:
,ci:
,docs:
,style:
,refactor:
,perf:
,test:
.