DocuConvo is an innovative application that combines traditional documentation with conversational AI capabilities powered by GPT-3.5. This allows organizations to enhance their documentation search experience by enabling users to converse with the documentation.
DocuConvo operates in the following steps:
-
Crawling Documentation Website:
- Our application crawls the entire documentation website provided by the organization.
-
Creating Knowledge Base:
- The crawled information is processed and converted into vector embeddings.
- Vector embeddings are saved into the Pinecone vector database as an index.
-
Search Process:
- When a search request is received from the organization's search bar, it is compared against the knowledge base using vector embeddings.
- Similar vectors are passed to GPT3.5 as context, along with the search query.
To create a knowledge base for their documentation website, organizations need to provide the following details:
-
Documentation Website URL:
- Example:
https://nextjs.org/docs
- Example:
-
Documentation Website URL Match:
- Example:
https://nextjs.org/docs/**
- This is a URL pattern that describes the structure of the documentation URLs. Use a wildcard (
**
) to capture variations.
- Example:
-
CSS Selector for Main Text Content:
- This selector helps identify the main content area of the documentation, increasing the accuracy of the context passed to GPT.
To store vector embeddings, ensuring complete ownership of your data:
- Pinecone API Key
- Pinecone Index Name
- Pinecone Environment
The last step is to enter the OpenAI API key, which will be used to generate responses for search queries with documentation context.
import { Docuconvo } from 'docuconvo'
const docuconvo = new Docuconvo({
docuconvo_key: 'your-docuconvo-key'
})
const { answer, message, error } = await docuconvo.search(searchQuery)
DocuConvo draws inspiration from the BuilderIO/gpt-crawler project, GPT-Crawler focuses on crawling documentation websites to generate knowledge files for use with OpenAI assistants, DocuConvo takes it a step further by directly integrating the conversational search capability into the documentation website itself.
By combining the information retrieval capabilities of a web crawler with the natural language processing power of GPT-3.5, DocuConvo provides an immersive and interactive experience for users seeking information within documentation.
A big thank you to the following contributors who have played a significant role in the development of DocuConvo: