Real-Time Voice Chat Experiments

This repository contains two implementations of real-time voice chat applications using OpenAI's Realtime API with WebRTC.

Prerequisites

Python 3.8+
OpenAI API key with Realtime API access
Modern web browser with WebRTC support

Installation

Clone the repository
Install the required packages:

pip install fastapi uvicorn requests termcolor

Set your OpenAI API key as an environment variable:

# For Linux/Mac
export OPENAI_API_KEY=your_api_key_here

# For Windows
set OPENAI_API_KEY=your_api_key_here

Applications

1. Basic Voice & Text Chat (`1_basic_voice_text_chat.py`)

A straightforward implementation of two-way voice and text chat with the following features:

Real-time voice communication using WebRTC
Text chat capability
Beautiful dark mode UI with glass-morphism effects
Error handling and status updates
Animated UI elements

To run:

python 1_basic_voice_text_chat.py

2. Enhanced Chat with Real-time Classification (`2_out_of_band_responses.py`)

An enhanced version that adds real-time conversation classification:

All features from the basic version
Real-time conversation classification into categories:
- General
- Philosophical
- Math
- Technology
Classifications for both voice and text inputs
Side panel showing conversation classifications with timestamps
Color-coded classification display
Out-of-band processing to maintain smooth chat experience

To run:

python 2_out_of_band_responses.py

Technical Details

WebRTC Implementation

Uses OpenAI's Realtime API for WebRTC signaling
Handles audio streams for voice communication
Manages data channels for text and control messages
Implements proper connection lifecycle management

Out-of-Band Processing (Enhanced Version)

Uses separate processing for classifications without affecting main conversation
Analyzes entire conversation context for accurate classification
Implements metadata-based response handling
Maintains conversation state independently of classifications

UI Features

Built with Tailwind CSS and DaisyUI
Responsive design
Glass-morphism effects
Smooth animations using CSS transitions
Dark mode optimized

Usage

Start either application using the commands above
Click "Connect & Start Chat"
Allow microphone access when prompted
Start chatting using either:
- Voice (just speak)
- Text (type and press Enter or click Send)
For the enhanced version, watch the classifications appear in real-time

Error Handling

Both implementations include comprehensive error handling for:

API key issues
Connection problems
Microphone access
WebRTC negotiation
Data channel communication

Errors are:

Logged to the console
Displayed in the UI
Color-coded in the terminal (using termcolor)

Security Notes

Uses ephemeral tokens for client-side API access
Handles API keys securely through environment variables
Implements proper WebRTC security practices

Limitations

Maximum session duration: 30 minutes
Requires modern browser with WebRTC support
Needs stable internet connection for voice chat
API key must have Realtime API access enabled

Contributing

Feel free to submit issues and enhancement requests!

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
backend		backend
documentation		documentation
static		static
templates		templates
.gitignore		.gitignore
README.md		README.md
app.py		app.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Real-Time Voice Chat Experiments

Prerequisites

Installation

Applications

1. Basic Voice & Text Chat (`1_basic_voice_text_chat.py`)

2. Enhanced Chat with Real-time Classification (`2_out_of_band_responses.py`)

Technical Details

WebRTC Implementation

Out-of-Band Processing (Enhanced Version)

UI Features

Usage

Error Handling

Security Notes

Limitations

Contributing

About

Releases

Packages

Languages

unseen22/Out-of-band-responses-template-for-OpenAI-real-time-voice-chat

Folders and files

Latest commit

History

Repository files navigation

Real-Time Voice Chat Experiments

Prerequisites

Installation

Applications

1. Basic Voice & Text Chat (1_basic_voice_text_chat.py)

2. Enhanced Chat with Real-time Classification (2_out_of_band_responses.py)

Technical Details

WebRTC Implementation

Out-of-Band Processing (Enhanced Version)

UI Features

Usage

Error Handling

Security Notes

Limitations

Contributing

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

1. Basic Voice & Text Chat (`1_basic_voice_text_chat.py`)

2. Enhanced Chat with Real-time Classification (`2_out_of_band_responses.py`)

Packages