MongoDB Atlas Hybrid Search Action for Bedrock Agent - Terraform

This project demonstrates how to deploy a set of resources on AWS to implement a MongoDB Hybrid-Search powered Retrieval-Augmented Generation (RAG) architecture. The stack (MdbBedrockActionsStack) includes:

An S3 Bucket to ingest PDF documents into a Knowledge Base.
MongoDB Atlas as the Knowledge Base Vector Store.
A Lambda function to synchronize (ingest, update, remove) PDFs added to the S3 Bucket.
A Lambda function to serve as an entry point for performing Hybrid Search (Vector + Full-Text) into MongoDB Atlas.

Useful commands

terraform init initializes the Terraform project and downloads provider plugins
terraform plan creates an execution plan, showing proposed changes
terraform apply deploys the stack by applying the changes

For more info and examples, check Terraform MongoDB Atlas Provider

Prerequisites

AWS CLI configured with appropriate permissions.
Terraform installed.
MongoDB Atlas account and API keys.
Node.js and npm installed.

Project Structure

.
├── functions
│   ├── package.json                 # Node.js dependencies
│   ├── common                       # (shared modules)
│   ├── ingest                       # Sync Lambda function code
│   └── retrieval                    # Hybrid Search retrieval func code
├── mdb-bedrock-actions.tf           # The bulk of the stack
├── cloudwatch-logs.tf               # Resources for logging
├── variables.tf                     # Expected variables
└── README.md                        # Project documentation

Resources

S3 Bucket

An S3 Bucket is created to ingest PDF documents. The bucket is configured with event notifications to trigger the synchronization Lambda function whenever a PDF is added, updated, or removed.

MongoDB Atlas

MongoDB Atlas is used as the Knowledge Base Vector Store. Ensure you have your MongoDB Atlas API keys and connection string ready. The stack will create necessary collections and indexes for vector and full-text search.

Synchronization Lambda

This Lambda function is triggered by S3 events. It handles the ingestion, update, and removal of PDFs in the MongoDB Atlas Knowledge Base.

Hybrid Search Lambda

This Lambda function serves as an entry point for performing hybrid searches (Vector + Full-Text) in MongoDB Atlas. It can be invoked via API Gateway or other AWS services.

Deployment

Clone the repository:

git clone https://github.com/your-repo/rag-architecture.git
cd rag-architecture

Install dependencies:
```
cd functions/; npm install; cd ..
```
Bootstrap the Terraform environment:
```
terraform init
```

Configure AWS credentials

Ensure your AWS and MongoDB Atlas credentials are set up.**

This can be done using environment variables:

export AWS_SECRET_ACCESS_KEY='<aws secret key>'
export AWS_ACCESS_KEY_ID='<aws key id>'

... or the ~/.aws/credentials file.

$ cat ~/.aws/credentials
[default]
aws_access_key_id = your key id
aws_secret_access_key = your secret key

... or follow as in the variables.tf file and create terraform.tfvars file with all the variable values, ex:

access_key   = "<AWS_ACCESS_KEY_ID>"
secret_key   = "<AWS_SECRET_ACCESS_KEY>"

Deploy the Terraform stack:
```
terraform plan
terraform apply
```

Integrating with MongoDB Atlas

To integrate the Lambda functions with your existing MongoDB Atlas cluster, follow these steps:

Obtain MongoDB Connection String:
- Log in to your MongoDB Atlas account.
- Navigate to your cluster and click on "Connect".
- Choose "Connect your application" and copy the connection string.
Set Environment Variables:
- Set the mongodb_conn_string or mongodb_conn_secret variables:
  - Opiton 1: Update the mongodb_conn_string environment variable for the ingestLambda and retrievalLambda configurations with your MongoDB connection string.
  - Option 2 (Recommended): Update mongodb_conn_secret environment variable for the ingestLambda and retrievalLambda configurations with a secret that contains your MongoDB connection string.
This can be done using environment variables:
```
export TF_VAR_mongodb_conn_string="<atlas connection string>"
export TF_VAR_mongodb_conn_secret="<atlas secret in AWS Secret manager>"
```
... or follow as in the variables.tf file and create terraform.tfvars file with all the variable values, ex:
```
access_key   = "<AWS_ACCESS_KEY_ID>"
secret_key   = "<AWS_SECRET_ACCESS_KEY>"
mongodb_conn_string   = "<MONGODB_CONN_STRING>"
mongodb_conn_secret  = "<MONGODB_CONN_SECRET>"
```

Create Indexes:

Ensure that your MongoDB collections have the necessary indexes for vector and full-text search. You can create these indexes using the MongoDB Atlas UI or via the MongoDB shell.
Vector Search Index (learn more):

{
  "fields": [
    {
      "numDimensions": 1024,
      "path": "embedding",
      "similarity": "cosine",
      "type": "vector"
    },
    {
      "path": "metadata",
      "type": "filter"
    },
    {
      "path": "metadata.source",
      "type": "filter"
    }
    // can be extended with additional fields
  ]
}

Full-Text Search Index (learn more):

{
  "mappings": {
    "dynamic": false,
    "fields": {
      "text": {
        "type": "string"
      }
    }
  }
}

Integrating with with Bedrock Agent

The Hybrid Search Lambda can be integrated as an Action Group of a Bedrock Agent to enable a full RAG architecture. Bedrock can be used for:

Foundation Models: Leveraging pre-trained models for various NLP tasks.
Prompt Building: Constructing prompts to query the Knowledge Base.
Guardrails: Ensuring safe and reliable responses.

Steps to Integrate with Bedrock

Create a Bedrock Agent:
- Define the agent's purpose and capabilities.
- Configure the agent to use the Hybrid Search Lambda as an Action Group.
Configure Action Group:
- Set up the Action Group to invoke the Hybrid Search Lambda.
- Define the input and output formats for the Lambda function.
Deploy and Test:
- Deploy the Bedrock Agent.
- Test the integration by querying the agent and verifying the responses.

By following these steps, you can leverage Bedrock for the Foundation Models, Prompt Building, and Guardrails, while using MongoDB Atlas as the Knowledge Base for a complete RAG architecture.

Conclusion

This project provides an example of how to leverage a RAG architecture using Terraform, S3, MongoDB Atlas, and AWS Lambda. By integrating with Bedrock, you can enhance the architecture with advanced NLP capabilities and ensure robust and reliable responses.

For more information, refer to the Terraform MongoDB Atlas Provider and MongoDB Atlas documentation on Hybrid Search.

^{This software distributed under the License is distributed on an "AS IS" BASIS,
WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
See the LICENSE.md for the specific language governing permissions and
limitations under the License.}

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
docs		docs
functions		functions
.gitignore		.gitignore
LICENSE.md		LICENSE.md
README.md		README.md
cloudwatch-logs.tf		cloudwatch-logs.tf
mdb-bedrock-actions.tf		mdb-bedrock-actions.tf
provider.tf		provider.tf
variables.tf		variables.tf
versions.tf		versions.tf

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MongoDB Atlas Hybrid Search Action for Bedrock Agent - Terraform

Useful commands

Prerequisites

Project Structure

Resources

S3 Bucket

MongoDB Atlas

Synchronization Lambda

Hybrid Search Lambda

Deployment

Integrating with MongoDB Atlas

Integrating with with Bedrock Agent

Steps to Integrate with Bedrock

Conclusion

About

Releases

Packages

Languages

License

mongodb-partners/mdb-bedrock-action-groups

Folders and files

Latest commit

History

Repository files navigation

MongoDB Atlas Hybrid Search Action for Bedrock Agent - Terraform

Useful commands

Prerequisites

Project Structure

Resources

S3 Bucket

MongoDB Atlas

Synchronization Lambda

Hybrid Search Lambda

Deployment

Integrating with MongoDB Atlas

Integrating with with Bedrock Agent

Steps to Integrate with Bedrock

Conclusion

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages