GitHub - achannn/fiction: Platform for reading/writing stories with ai exploration

What

A platform that allows users to write and share stories. Readers can talk with a chatbot at the end of every chapter and learn more about the world not written in the story. Authors are provided a way to provide extra information about the story's world and characters. This extra information is hidden from readers but is fed to the chatbot for readers to discover during chatting.

Architecture

Writing/reading stories (+ chapters and blobs) is handled through regular CRUD operations using Rails
Authentication is handled using Devise
ChatWindow on the client is a React component, it connects to the Rails server via websockets (using ActionCable)
When chapters and blobs are updated, it is put on the job queue to get an embedding calculated via the EmbeddingCreator
When a chat message arrives, a job is enqueued to get an OpenAI api chat response via the ChatResponder
Embeddings and chat responses (things acquired via OpenAI api) are cached in Redis

What are blobs?

Blobs are text that authors can write to provide extra context and information to the chatbot. They are hidden from the user.

Reasons for architectural choices

Websockets over http

The chat is done over a websocket connection as opposed to http. Given that the way the chat is supposed to work is 1 question -> one answer, http requests would make sense here (one request->one response). However:

OpenAI api response speed is unpredictable with reports of it sometimes taking over a minute. If the request takes too long it could cause timeout issues
We could instead use a POST request to send the question and then poll for the response, however it is simpler to just use websockets
There are also potential UX problems if a user opens multiple tabs to the chat, and it could also create a confusing merged chat history

Background jobs for interacting with OpenAI api

These requests are slow, it would be bad UX to have requests wait on their completion
Jobs can be configured to be retried in case of being rate-limited or transient errors
Different jobs have different urgency: Chat responses are more urgent than calculating embeddings for a story update. It is easy to prioritize jobs using different queues
It is also easier to move to a microservices architecture if there are future scaling requirements, e.g. put the job consumer on a EmbeddingService

Caching strategy

The caching strategy used is simple:

For chat questions, I cache on question/relevantChapters/relevantBlobs. The reason it's not just cached on the question is because chapters and blobs can be updated by the author at any time, invalidating a cached answer based purely on the question.
For embeddings, I just cache on the text being embedded.

More intelligent strategies can be used to increase the amount of cache hits but are out of scope here: e.g. using "close-enough" embedding distances, chunking chapters and blobs

Name		Name	Last commit message	Last commit date
Latest commit History 41 Commits
app		app
bin		bin
config		config
db		db
lib		lib
log		log
public		public
storage		storage
test		test
tmp		tmp
vendor		vendor
.dockerignore		.dockerignore
.gitattributes		.gitattributes
.gitignore		.gitignore
.ruby-version		.ruby-version
Dockerfile		Dockerfile
Gemfile		Gemfile
Gemfile.lock		Gemfile.lock
Procfile.dev		Procfile.dev
README.md		README.md
Rakefile		Rakefile
config.ru		config.ru
package.json		package.json
render.yaml		render.yaml
tsconfig.json		tsconfig.json
yarn.lock		yarn.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

What

Architecture

What are blobs?

Reasons for architectural choices

Websockets over http

Background jobs for interacting with OpenAI api

Caching strategy

About

Releases

Packages

Contributors 2

Languages

achannn/fiction

Folders and files

Latest commit

History

Repository files navigation

What

Architecture

What are blobs?

Reasons for architectural choices

Websockets over http

Background jobs for interacting with OpenAI api

Caching strategy

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages