VoxNovel is an innovative program that leverages the capabilities of booknlp to analyze literature, attribute quotations to specific characters, and generate a tailored audiobook where each character has a distinct voice through coqui tts. This not only provides an immersive audiobook experience but also brings each character to life with a unique voice, making the listening experience much more engaging.
DEMO
High Quality Tortoise Demos
272463996-c4f8dfdf-c5bd-4771-ab1a-6131c22a67b2.mp4
Super fast audio Balacoon Demos
271878548-53b694a9-5ddd-4174-82bc-07aff22d2330.mp4
271876316-530e8781-c77c-4424-89cd-a02df363b0bf.mp4
Run in this order:
sudo apt-get install calibre
conda create --name VoxNovel python=3.10
conda activate VoxNovel
git clone https://github.com/DrewThomasson/VoxNovel.git
cd VoxNovel
pip install tts
pip install booknlp
pip install -r Ubuntu_requirements.txt
python -m spacy download en_core_web_sm
sudo -v && wget -nv -O- https://download.calibre-ebook.com/linux-installer.sh | sudo sh /dev/stdin
also download it from the discovery store or flatpac I did both on my steam deck
mkdir -p ~/miniconda3
wget https://repo.anaconda.com/miniconda/Miniconda3-latest-Linux-x86_64.sh -O ~/miniconda3/miniconda.sh
bash ~/miniconda3/miniconda.sh -b -u -p ~/miniconda3
rm -rf ~/miniconda3/miniconda.sh
~/miniconda3/bin/conda init bash
~/miniconda3/bin/conda init zsh
restart the terminal(close out and open a new window)
conda create --name VoxNovel python=3.10
conda activate VoxNovel
git clone https://github.com/DrewThomasson/VoxNovel.git
cd VoxNovel
sudo pacman -S espeak-ng
(make sure you have pacman fully working, there should be stuff online for the steam deck) guide on getting pacman wokring on steam deck(https://www.reddit.com/r/SteamDeck/comments/t8al0i/install_arch_packages_on_your_steam_deck/) you might have to reset the keys if something goes wrong with that: Resetting all the keys
Remove or reset all the keys installed in your system by removing the /etc/pacman.d/gnupg directory (as root) and by rerunning pacman-key --init followed by pacman-key --populate to re-add the default keys.
15. pip install tts
16. pip install booknlp
17. pip install -r SteamDeck_requirements.txt
18. pip3 install spacy
19. python3 -m spacy download en_core_web_sm
20. sudo pacman -S calibre
21. mv ~/miniconda3/envs/VoxNovel/lib/libstdc++.so.6 ~/miniconda3/envs/tts/lib/libstdc++.so.6.bak
pip install mechanize
pip install bs4
pip install css_parser
brew install calibre
conda create --name VoxNovel python=3.10
conda activate VoxNovel
git clone https://github.com/DrewThomasson/VoxNovel.git
cd VoxNovel
pip install tts
pip install booknlp
8.pip install -r MAC-requirements.txt
pip spacy
10.python -m spacy download en_core_web_sm
So becuase of booknlp windows issues all of this will be run in wsl(don't worry its still easy)
- In your powershell paste
wsl --install
to install wsl - After you set the username and password you'll now be able to open wsl as a application this will allow you to run ubuntu software
You'll now want to open wsl if you closed it and paste these commands
(This will install calibre for ebook conversion)
cd ~
sudo apt update
sudo apt upgrade
sudo apt install calibre
(This will install miniconda)mkdir -p ~/miniconda3 wget https://repo.anaconda.com/miniconda/Miniconda3-latest-Linux-x86_64.sh -O ~/miniconda3/miniconda.sh bash ~/miniconda3/miniconda.sh -b -u -p ~/miniconda3 rm -rf ~/miniconda3/miniconda.sh
(This will make sure miniconda is initalized)~/miniconda3/bin/conda init bash ~/miniconda3/bin/conda init zsh
- Close out of your current wsl window and reopen it to be able to use miniconda
conda create --name VoxNovel python=3.10
conda activate VoxNovel
cd ~
git clone https://github.com/DrewThomasson/VoxNovel.git
cd VoxNovel
pip install tts
pip install booknlp
pip install -r Ubuntu-requirements.txt
pip install spacy
python -m spacy download en_core_web_sm
sudo apt install espeak-ng
You can acess the files on your wsl ubuntu in windows file exploror by putting this into the address\\wsl.localhost\Ubuntu\home\
python gui_run.py
This means all you do is select the book and all the voices will be auto assigned and generated for you.
python auto_noGui_run.py
folders
Final_combined_output_audio: -This is where all of your final audio files will be put in order of chapter num
-free and entirly locally run -supports all ebook file formats by using calibre -can run on cpu or cuda gpu -autoselects a starting estimated voice by pronouns per character -supports all models in Coqui TTS and all voices in the models -easily create a new voice actor in seconds through voice cloneing in gui -can play audio by clicking on the text in the book viewer in gui -Ability to regenerate specific lines if they came out weird
-Ability to change the character for a line if incorectly attributed by booknlp -Ability to regenerate specific lines -Make it so that all the included voices and models already have their premade own demo voices -Make it so that the demo audio for the cloned voices is the not their refrence audio but what their voices sound like generated -Using whisper transcriptions to cut hallucinations out of generated audio