qdrant requirements

RasaHQ · Jun 27, 2024 · cc67006 · cc67006
1 parent 70f7593
commit cc67006
Show file tree

Hide file tree

Showing 4 changed files with 30 additions and 12 deletions.
diff --git a/.github/workflows/continous-integration.yml b/.github/workflows/continous-integration.yml
@@ -48,6 +48,11 @@ jobs:
       - name: Set up virtual environment
         run: poetry config virtualenvs.in-project true
 
+      - name: Install pip dependencies from requirements file
+        run: |
+          source .venv/bin/activate
+          pip install -r qdrant-requirements.txt
+
       # Authenticate with gcloud for release registry (where Rasa is published)
       - id: "auth-release"
         name: Authenticate with gcloud for release registry 🎫

diff --git a/README.md b/README.md
@@ -221,6 +221,11 @@ After you cloned the repository and are authenticated, follow the installation s
    ```
    pyenv local 3.10.12
    ```
+3. Create and activate virtual environment
+   ```
+   pyenv virtualenv 3.10.12 new_venv
+   pyenv activate new_venv
+   ```
 3. Install the dependencies with `poetry`
    ```
    poetry install
@@ -232,22 +237,24 @@ After you cloned the repository and are authenticated, follow the installation s
    OPENAI_API_KEY=<your openai api key>
    RASA_DUCKLING_HTTP_URL=<url to the duckling server>
    ```
-5. [Optional] Set up the extractive search:
+
+5. Set up the extractive search:
    - Setup a local docker instance of Qdrant
       ```
       docker pull qdrant/qdrant
       docker run -p 6333:6333 -p 6334:6334 \
          -v $(pwd)/qdrant_storage:/qdrant/storage:z \
          qdrant/qdrant
       ```
-   - Update the virtual environment
-      ```
-      poetry add datasets cohere qdrant-client sentence-transformers
-      ```
-   - Ingest documents from SQUAD dataset (modify the script if qdrant isn't running locally!)
-      ```
-      python scripts/load-data-to-qdrant.py
-      ```
+   - Upload data to Qdrant
+      - Create a virtual environment for the upload:
+         ```
+         pip install -r qdrant-requirements.txt
+         ```
+      - Ingest documents from SQUAD dataset (modify the script if qdrant isn't running locally!)
+         ```
+         python scripts/load-data-to-qdrant.py
+         ```
    You can toggle parameter `use_generative_llm` in config.yml to change the behavior. The answer is selected from the first search result -> metadata -> `answer` key
 
 #### Custom Information Retriever
@@ -366,3 +373,5 @@ or
 ```commandline
 rasa test e2e e2e/tests/path/to/a/target/test.yml
 ```
+
+## Running the project with enterprise search
diff --git a/addons/qdrant.py b/addons/qdrant.py
@@ -1,4 +1,4 @@
-from typing import Text, Any
+from typing import Text, Any, Dict
 
 import structlog
 from langchain.vectorstores.qdrant import Qdrant
@@ -39,7 +39,7 @@ def __init__(self, message: str) -> None:
     def __str__(self) -> str:
         return self.base_message + self.message + f"{self.__cause__}"
 
-def prepare_search_query(tracker_state: dict[str, Any]) -> str:
+def prepare_search_query(tracker_state: Dict[str, Any]) -> str:
     """Uses Cohere to generate a search query from the chat history.
     Args:
         tracker_state: The tracker state.
@@ -97,7 +97,7 @@ def connect(
         )
 
     async def search(
-        self, query: Text, tracker_state: dict[str, Any], threshold: float = 0.0
+        self, query: Text, tracker_state: Dict[str, Any], threshold: float = 0.0
     ) -> SearchResultList:
         """Search for a document in the Qdrant vector store.
 

diff --git a/qdrant-requirements.txt b/qdrant-requirements.txt
@@ -0,0 +1,4 @@
+datasets==2.20.0
+cohere==5.5.8
+qdrant-client==1.9.2
+sentence-transformers==3.0.1