This web application allows users to upload yacht insurance inquiry emails, extract and display relevant information, and export the data for further analysis. The app is built with Streamlit, using OpenAI’s language model for text analysis and processing, and is integrated with additional tools for document analysis and semantic search.
- OpenAI (GPT-4): For natural language processing and data extraction.
- Tavily: For internet browsing.
- Pandas: For managing and displaying extracted data.
- Streamlit: For the web application interface.
Ensure you have Python installed on your machine (version 3.8 or later).
-
Clone the Repository:
git clone github.com/tanyaaton/inquiry-emails-extraction cd inquiry-emails-extraction
-
Install Dependencies: Install all required packages from the
requirements.txt
file:pip install -r requirements.txt
-
Environment Variables: Create a
.env
file in the root directory and set up the following variables:OPENAI_API_KEY=your_openai_key TAVILY_API_KEY=your_tavily_key
-
Run the App: Start the Streamlit app:
streamlit run app.py
- Upload and Process Emails: Upload emails for content extraction.
- Automated Information Extraction: Uses an LLM model to extract key information fields.
- Auto Fill Information: Leverages the Tavily agent to conduct internet searches and automatically fill the not provided data (e.g., yacht specifications, insurance requirements, and regional data).
- Visibility Toggle: Select which customer data to display in the main view.
- Data Export: Download the processed data as a CSV file.
- Persistent Storage: Data is stored within the app's session state for easy access during the session.
- 🔗 Notion link
- 🎥 Demo VDO