OpenAI Realtime API - Voice Chat

Minimal implementation of OpenAI's Realtime Voice API. The aim of this project was to create a simple enough voice assistant to test the capabilities and limitations of the Voice Agents created with the API.

The code heavily borrows from the OpenAI Realtime Console example.

The App

This is a Next.js app written in TypeScript. You can run it locally with: npm run dev after installing the necessary dependencies with npm install.
The app requires a valid OpenAI API key to be set in the OPENAI_API_KEY environment variable.
The Voice Agent is a Push to Talk agent, so you need to click Record and then Send button to input your audio.
VAD is tricky to test reliably as the exact algorithm used by OpenAI is not documented (might get back on this later).
The app uses WebSockets to connect to the OpenAI API, so you need to make sure your firewall allows for WebSocket connections.
Tested on Chrome, Safari. Does not work on Desktop Firefox. Works fine on Mobile.

Limitations

As with any LLM, the responses are not deterministic, and the model may hallucinate. There are no additional guardrails added in place to prevent the model from saying harmful or inappropriate content. Please be mindful of this when using or extending the code.

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
app		app
components/ui		components/ui
lib		lib
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
components.json		components.json
next.config.mjs		next.config.mjs
package-lock.json		package-lock.json
package.json		package.json
postcss.config.mjs		postcss.config.mjs
tailwind.config.ts		tailwind.config.ts
tsconfig.json		tsconfig.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

OpenAI Realtime API - Voice Chat

The App

Limitations

UI Screenshots

About

Languages

License

shekkizh/openai-voice-chat

Folders and files

Latest commit

History

Repository files navigation

OpenAI Realtime API - Voice Chat

The App

Limitations

UI Screenshots

About

Resources

License

Stars

Watchers

Forks

Languages