Latest News 🎉

📘Documentation | 🛠️Quick Start | 🤔Reporting Issues

Latest News 🎉

2024

[2024/11/30] 📢📺 We've released a project introduction video! You can find the video on Bilibili. If you like it, don't forget to give it a thumbs up 👍 and follow us!
[2024/11/19] 🔒🛡️ Incorporate a sensitive word processing module to prevent the model from outputting harmful viewpoints.
[2024/11/18] 🔧🌟 Completed Quick Start section documentation updates and resolved environment setup bugs.
[2024/11/14] 🎉✨ Successfully quantized four models using the LMDeploy tool. The FunGPT family welcomes new members! Models are now available on HuggingFace.
[2024/11/13] 🎉✨ Released two new 1.8B models, expanding the FunGPT family! The models are BanterBot_1_8b-chat and BoostBot_1_8b-chat.
[2024/11/10] 🎉✨ Launched two brand-new 7B models, adding new members to the FunGPT family! The models are BanterBot-7b-chat and BoostBot-7b-chat.
[2024/11/10] 🛠️🎯 Major project updates completed, including fixing known absolute path bugs.
[2024/10/28] 🎈🥳 Achieved fine-tuning large language models using xtuner and released the first version of BoostBot_v1!
[2024/10/19] 🎉💬 Developed a toolchain to generate fine-tuned dialogue data using the Chat-GLM4 series.
[2024/10/03] 🎨🐞 Beautified the system interface and fixed some known bugs.
[2024/10/02] 🚀💻 Added model loading and unloading mechanisms to optimize GPU memory usage.
[2024/10/01] 😄🐍 Integrated an exception handling module to enhance application stability.
[2024/09/28] 👋👋 Completed initial testing and evaluation of individual functionalities for LLM (InternLM2.5_1.8b), ASR (Sensevoice), and TTS (ChatTTS).

🌟 Project Introduction

$\quad$ In this fast-paced world 🌍, we all need a little spice to balance our lives. Whether you’re looking for some sweet compliments 🍭 to lift your spirits ✨ or a dose of sharp retorts 💥 to blow off steam 😤, FunGPT, developed based on the InternLM2.5 series models 🤖, is here to meet your needs 🎯!

🍬 Sweet Compliment Mode:

Mood Booster 🌟✨: When you’re feeling down, our Sweet Compliment Mode will instantly lift your spirits, just like tasting an incredibly sweet candy.
Confidence Fuel Station 💪🌈: Meanwhile, our Praise Master will compliment you in the most suitable and unique ways, making your confidence soar.

🔪 Sharp Retort Mode:

Stress Release Valve 💥😤: When you’re feeling overwhelmed, our Retort Mode provides an outlet to blow off steam while delivering sharp remarks.
Humorous Roasting Machine 😂👅: The words of the Roast Master are not only sharp but also humorous and imaginative, letting you experience brain-twisting comebacks while having fun.

🤔 Project Highlights

$\quad$FunGPT is built on the cutting-edge InternLM2.5 series models. Using Xtuner, we performed both instruction and full fine-tuning, enabling the models to meet personalized user needs. To enhance accessibility, we released the 1.8B series of lightweight models, which deliver exceptional performance despite their reduced size. Additionally, we employed AWQ quantization using LMDeploy on multiple models, saving GPU memory while boosting inference speed! ⚡

$\quad$ This project caters to every users by focusing on the two scenarios of praise and roasting, allowing users to chat with our two "Masters" to regulate their moods. For developers just entering the world of LLMs, we provide extensive documentation and video tutorials covering LLM, ASR, and TTS, making this project an ideal entry point for mastering large model development skills.

$\quad$ Our advantages include, but are not limited to:

🤗 Master of Compliments: Generate sweet words to brighten your daily life.
🗯️ Roasting Expert: Tailored responses with sharp wit, engaging in a battle of wits with "me."
📊 Data Collection Guide: Fully open-source, helping you quickly grasp the creation of fine-tuning datasets.
📖 Complete LLM Workflow Guide: Comprehensive code and documentation, open-source, making it easy to get started.
🔊 Complete ASR Workflow Guide: Open everything to help you realize your dream of speech recognition.
🎙️ Complete TTS Workflow Guide: From basics to advanced, fully open-source with no reservations!
📂 Clear Structure: Detailed annotations and documentation ensure seamless onboarding.
⚡ Model Quantization: Lower the usage barrier and experience the magic of AI anytime, anywhere.
🎥 Video Tutorials: Stay tuned for our complete project introduction videos!

🖼️ Screenshots

Original_7b_BoostBot	BoostBot-7b

Original_7b_BanterBot	BanterBot-7b

🏗️ Project Architecture Diagram

🧳 Model Collection

Model	Base	Type	Link
BanterBot-7b-chat	internlm2_5_chat_7b	Pre-trained + QLoRA fine-tuning	HuggingFace OpenXLab
BoostBot-7b-chat	internlm2_5_chat_7b	Pre-trained + QLoRA fine-tuning	HuggingFace OpenXLab
BanterBot_1_8b-chat	internlm2_5_chat_1_8b	Pre-trained + QLoRA fine-tuning	HuggingFace OpenXLab
BoostBot_1_8b-chat	internlm2_5_chat_1_8b	Pre-trained + QLoRA fine-tuning	HuggingFace OpenXLab
BanterBot-7b-chat-w4a16-4bit	internlm2_5_chat_7b	Pre-trained + QLoRA fine-tuning + w4a16 quantization	HuggingFace OpenXLab
BoostBot-7b-chat-w4a16-4bit	internlm2_5_chat_7b	Pre-trained + QLoRA fine-tuning + w4a16 quantization	HuggingFace OpenXLab
BanterBot_1_8b-chat-w4a16-4bit	internlm2_5_chat_1_8b	Pre-trained + QLoRA fine-tuning + w4a16 quantization	HuggingFace OpenXLab
BoostBot_1_8b-chat-w4a16-4bit	internlm2_5_chat_1_8b	Pre-trained + QLoRA fine-tuning + w4a16 quantization	HuggingFace OpenXLab

📂 Project Structure

.
|-- ASR
|   |-- __init__.py
|   |-- models
|   |   `-- sensevoice.py
|   |-- readme.md
|   `-- weights
|       `-- readme.md
|-- Assets
|   |-- animation
|   |   `-- Animation_1.json
|   |-- avatar
|   |   |-- BanterBot.jpg
|   |   |-- BoostBot.jpg
|   |   |-- BoostBot_v2.jpg
|   |   |-- User_v1.jpg
|   |   |-- person1.png
|   |   `-- person2.png
|   |-- gif
|   |   |-- BanterBot-7b.gif
|   |   |-- BoostBot-7b.gif
|   |   |-- Original_7b_BanterBot.gif
|   |   `-- Original_7b_BoostBot.gif
|   |-- image
|   |-- svg
|   |   |-- FunGPT-logo.svg
|   |   `-- openxlab_logo.svg
|   `-- video
|       |-- BanterBot-7b.mp4
|       |-- BoostBot-7b.mp4
|       |-- Original_7b_BanterBot.mp4
|       `-- Original_7b_BoostBot.mp4
|-- Data
|   |-- BanterBot
|   |   |-- feasible_data
|   |   |   `-- readme.md
|   |   |-- raw
|   |   |   `-- readme.md
|   |   |-- readme.md
|   |   |-- sample
|   |   |   `-- readme.md
|   |   |-- scripts
|   |   |   |-- filter_bad_from_conv_data.py
|   |   |   |-- filter_sensitive_words_from_conv_data.py
|   |   |   |-- generate_mutil_conv_chatglm.py
|   |   |   |-- generate_mutil_topic_chatglm.py
|   |   |   |-- generate_self_congnitive_data.py
|   |   |   `-- merge_conv_data_finetune.py
|   |   `-- sensitive_words
|   |       `-- readme.md
|   `-- BoostBot
|       |-- feasible_data
|       |   `-- readme.md
|       |-- raw
|       |   `-- topic.txt
|       |-- sample
|       |   `-- multi_conversation.jsonl
|       `-- scripts
|           |-- generate_mutil_conv_chatglm.py
|           |-- generate_mutil_topic_chatglm.py
|           |-- generate_self_congnitive_data.py
|           `-- merge_conv_data_finetune.py
|-- Docs
|   |-- pictures
|   |   `-- FunGPT.png
|   |-- readme.md
|   `-- user_guides
|       `-- readme.md
|-- Finetune
|   |-- BanterBot
|   |   `-- internlm2_5_chat_7b_qlora_alpaca_e3_copy.py
|   |-- BaseModel
|   |   `-- readme.md
|   `-- BoostBot
|       `-- internlm2_5_chat_7b_qlora_alpaca_e3_copy.py
|-- LICENSE
|-- LLM
|   |-- __init__.py
|   |-- models
|   |   `-- internlm2_5_7b_chat.py
|   |-- readme.md
|   |-- templates
|   |   `-- template.py
|   `-- weights
|       `-- readme.md
|-- README.md
|-- README_en.md
|-- README_zh.md
|-- TTS
|   |-- __init__.py
|   |-- models
|   |   `-- chattts.py
|   |-- readme.md
|   `-- weights
|       `-- readme.md
|-- Test
|   |-- ASR
|   |   |-- example.py
|   |   `-- test_wav.wav
|   |-- TTS
|   |   `-- example.ipynb
|   `-- readme.md
|-- Utils
|   |-- common_utils.py
|   |-- configs.py
|   |-- convert_gif.sh
|   |-- data_utils.py
|   |-- model_settings.py
|   |-- model_utils.py
|   `-- readme.md
|-- Work_dirs
|   |-- ASR
|   |   `-- readme.md
|   `-- TTS
|       `-- readme.md
|-- __init__.py
|-- app.py
|-- env.yaml
|-- pages
|   |-- 1_🍬💖_甜言模式.py
|   |-- 2_💥😤_怼语模式.py
|   `-- 3_🚀💫_待开发ing.py
|-- project_structure.txt
`-- requirements.txt

44 directories, 79 files

⚡ Quick Start

1. Deployment Environment

Operating System: Ubuntu 20.04.6 LTS
CPU: Intel(R) Xeon(R) Platinum 8369B CPU @ 2.90GHz (Online GPU Server)
GPU: NVIDIA A100-SXM4-80GB, NVIDIA-SMI 535.54.03, Driver Version: 535.54.03, CUDA Version: 12.2
Python: 3.10.0

2. Key Dependency Information

Python==3.10.0
torch==2.4.1
torch-complex==0.4.4
torchaudio==2.4.1
torchvision==0.16.2
chattts==0.1.1
streamlit==1.38.0
audio-recorder-streamlit==0.0.10

3. Deployment Steps

3.1. Clone the Code or Manually Download the Code to Place on the Server:

git clone https://github.com/Alannikos/FunGPT

3.2. Configure the Python Environment (Recommended to Use Conda)

Enter the root directory of the project

cd FunGPT

Create a conda environment

conda create -n FunGPT python==3.10.0

Install third-party libraries

pip install -r requirements.txt

# This will take approximately 1 hour

3.3 Download Models

3.3.1 TTS Models (Required if using TTS)

Install git-lfs As model files need to be downloaded, please ensure git-lfs is already installed. Linux users can install it using the following command:

apt install git-lfs

Initialize LFS

git lfs install

Download the TTS model to the specified path

# 1. Navigate to the specific directory
cd /FunGPT/TTS/weights

# 2. Download the model from huggingface
git clone https://huggingface.co/2Noise/ChatTTS

For users unable to access HuggingFace, download from the mirror source

# 2. Download the model from the mirror source
git clone https://hf-mirror.com/2Noise/ChatTTS

3.3.2 ASR Models (Required if using ASR)

As model files need to be downloaded, please ensure git-lfs is already installed. Linux users can install it using the following command:

# Users who have already downloaded can ignore this command

apt install git-lfs

Initialize LFS

git lfs install

Download the ASR model to the specified path

# 1. 进入指定目录
cd /FunGPT/ASR/weights

# 2. Navigate to the specific directory
git clone https://huggingface.co/FunAudioLLM/SenseVoiceSmall

For users unable to access HuggingFace, download from the mirror source

# 2. Download the model from the mirror source
git clone https://hf-mirror.com/FunAudioLLM/SenseVoiceSmall

3.3.3 LLM Models (Required)

$\quad$ For selecting LLM models, we provide many options. The models with the best performance are BanterBot-7b-chat and BoostBot-7b-chat. The quantized models also perform very well. Among the 1_8B models, the performance is also quite good. To save on download time, we have chosen these two 1_8B models: BanterBot-1_8b-chat and BoostBot-1_8b-chat as examples, but you can replace them as needed.

Initialize LFS

git lfs install

Download the LLM models to the specified path

# 1. Navigate to the specific directory
cd /FunGPT/LLM/weights

# 2. Download the BanterBot-1_8b-chat model from huggingface

https://huggingface.co/Alannikos768/BanterBot_1_8b-chat

# 3. Download the BoostBot-1_8b-chat model from huggingface

https://huggingface.co/Alannikos768/BoostBot_1_8b-chat

For users unable to access HuggingFace, download from OpenXLab


# 2. Download the BanterBot-1_8b-chat model from OpenXLab (for users in China)
git clone https://code.openxlab.org.cn/Alannikos/BanterBot-1_8b-chat.git

# 3. Download the BoostBot-1_8b-chat model from OpenXLab (for users in China)
git clone https://code.openxlab.org.cn/Alannikos/BoostBot-1_8b-chat.git

3.4 Run the Web Script

conda activate FunGPT

streamlit run app.py --server.address=127.0.0.1 --server.port=7860

4. Model Experience

If running on a remote server, port forwarding is needed

ssh -p 46411 user@ip -CNg -L 7860:127.0.0.1:7860 -o StrictHostKeyChecking=no

Then, to experience the application

Open your browser, input http://127.0.0.1:7860, and click the corresponding interface to experience FunGPT.

📚 Detailed Guide

Data Generation Guide

$\quad$ In the fine-tuning process of large models, we can utilize various algorithms for SFT, whether through native techniques like LoRA fine-tuning or using packaged tools such as Xtuner and LLaMA-Factory. We need to prepare high-quality fine-tuning data. However, the cost of manual data annotation is relatively high, which is inefficient for most individual developers. Thus, this project uses ChatGLM4-Flash, available for free from Zhipu, to generate the multi-turn dialogue datasets we need. This approach is simple and easy to control. Generally, the generated datasets can achieve satisfactory fine-tuning results. For the specific data generation guide, you can refer to the data_generation.

$\quad$ In our documentation, we will mainly introduce the process of creating a multi-turn dialogue dataset and how to construct a self-awareness dataset. With these two parts of the dataset, you can basically fine-tune a large language model for downstream tasks.

LLM Usage Guide

$\quad$ The large language model is the core component of this project. We selected the open-source InternLM2.5 series as the base model. InternLM2.5 has strong understanding and generation capabilities, supports an 8K context window, and has good Chinese language understanding capabilities. In actual deployment, we use a 4bit quantized version to reduce resource consumption while maintaining model performance. For detailed deployment and usage methods, please refer to the LLM_Usage document.

$\quad$ In this project, the LLM is primarily responsible for understanding user input and generating responses. It also needs to handle multi-modal input and coordinate with ASR and TTS modules.

ASR Usage Guide

$\quad$ The speech recognition module uses the open-source SenseVoice model, which has excellent multilingual speech recognition capabilities. The model supports multiple languages, including Chinese and English, with high accuracy and good ability to handle background noise. For specific deployment and usage instructions, please check the ASR_Usage document.

$\quad$ In practical applications, the ASR module is responsible for converting user voice input into text and passing it to the LLM for processing. We provide a streaming recognition interface and support real-time speech transcription.

TTS Usage Guide

$\quad$ The speech synthesis module uses the open-source ChatTTS model, which can generate natural and smooth speech. We use a bilingual model for Chinese and English, supporting multi-speaker synthesis and adjustable parameters like speed and timbre. For detailed configuration and usage methods, please refer to the TTS_Usage document.

$\quad$ The TTS module is mainly responsible for converting text generated by the LLM into speech output, supporting batch synthesis mode. We also provide an emotion control interface that can automatically adjust the tone and pitch according to the text content for more natural output.

Model Fine-Tuning Guide

$\quad$ To meet the needs of specific scenarios, we provide a complete model fine-tuning process. We mainly use parameter-efficient fine-tuning methods like LoRA and QLoRA, which can be trained on consumer-grade graphics cards. The fine-tuning process uses the Xtuner tool, which offers friendly configuration templates and comprehensive training monitoring. For specific fine-tuning processes and parameter settings, please refer to the Xtuner_Usage document.

$\quad$ Fine-tuning supports various task types such as instruction alignment, multi-turn dialogue, and role-playing. We provide pre-processing scripts to convert data formats and also support incremental training for further optimization on existing models.

Model Quantization Guide

$\quad$ Quantization is an important optimization method to deploy large models with limited computing resources. We use the LMDeploy tool for model quantization, supporting INT4 quantization to significantly reduce memory usage while maintaining model performance. For detailed quantization processes, refer to the LMDeploy_Usage document.

$\quad$ The quantization process supports weight quantization and activation value quantization, and provides tools to verify the accuracy of quantized models. LMDeploy also offers performance comparison data of different quantization strategies to help users choose the most suitable quantization plan.

🔮 Future Development Plans

- ~~Recording Project Video~~
- Support GPT-Sovits
- Support API access for large language models
- Improve the data generation guide section
- Enhance the large language model usage section
- Refine the text-to-speech module introduction section
- Improve the speech recognition usage section
- ~~Add Sensitive Words Module~~

🙏 Acknowledgments

Thanks to the following open-source tools and projects for their support:

Tool	Description
InternLM-Tutorial	An active and open large model training camp
Xtuner	Tool for model training and fine-tuning
LMDeploy	Tool for model quantization and deployment
Streamlit	Tool for efficiently building AI applications
DeepSpeed	Tool for model training and inference acceleration
Pytorch	Widely used deep learning framework

Related Projects

Project	Description
InternLM	A series of advanced open-source large language models
ChatTTS	An open-source text-to-speech project
SenseVoice	A speech recognition open-source project by Alibaba
LangGPT	An open-source project about structured prompts.
GangLLM	An open-source project about engaging in banter with users
Linly-Talker	An open-source project on artificial intelligence systems
Yanjie	An open-source project for enhancing English learning with an AI assistant
wulewule	An open-source project featuring an AI assistant themed on Black Myth: Wukong
ChatSensitiveWords	An project concerning SensitiveWords

Special Thanks

Organization	Description
Shanghai Artificial Intelligence Laboratory	Thanks for the technical and platform support

⚖️ Disclaimer

$\quad$ Thank you for your interest in and use of the FunGPT project. This project aims to provide users with an enjoyable and engaging interaction experience. Please be aware of the following points:

Research Purposes: The FunGPT project and its associated resources are intended solely for academic research purposes and are strictly prohibited from any commercial use. If any third-party code is involved, please adhere strictly to its respective open-source license.
Accuracy of Generated Content: Due to the influence of factors such as model algorithms, randomness, and quantization precision limitations, FunGPT cannot guarantee the accuracy or applicability of the generated content. Please exercise caution and independently assess the suitability of the content when using it.
Legal Responsibility: This project does not assume any responsibility for the legality of the model's output content and its consequences. Users should ensure that their behavior complies with relevant laws and regulations and are responsible for the outcomes of their use.

$\quad$ By using FunGPT, you indicate that you have accepted these disclaimer terms. We hope FunGPT can bring fun and vitality to your daily life. Thank you for your support and use.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Latest News 🎉

🌟 Project Introduction

🤔 Project Highlights

🖼️ Screenshots

🏗️ Project Architecture Diagram

🧳 Model Collection

📂 Project Structure

⚡ Quick Start

1. Deployment Environment

2. Key Dependency Information

3. Deployment Steps

3.1. Clone the Code or Manually Download the Code to Place on the Server:

3.2. Configure the Python Environment (Recommended to Use Conda)

3.3 Download Models

3.3.1 TTS Models (Required if using TTS)

3.3.2 ASR Models (Required if using ASR)

3.3.3 LLM Models (Required)

3.4 Run the Web Script

4. Model Experience

📚 Detailed Guide

Data Generation Guide

LLM Usage Guide

ASR Usage Guide

TTS Usage Guide

Model Fine-Tuning Guide

Model Quantization Guide

🔮 Future Development Plans

🙏 Acknowledgments

Related Projects

Special Thanks

⚖️ Disclaimer

Star History

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 90 Commits
ASR		ASR
Assets		Assets
Data		Data
Docs		Docs
Finetune		Finetune
LLM		LLM
TTS		TTS
Test		Test
Utils		Utils
Work_dirs		Work_dirs
pages		pages
.gitattributes		.gitattributes
LICENSE		LICENSE
README.md		README.md
README_en.md		README_en.md
README_zh.md		README_zh.md
__init__.py		__init__.py
app.py		app.py
env.yaml		env.yaml
project_structure.txt		project_structure.txt
requirements.txt		requirements.txt

License

Alannikos/FunGPT

Folders and files

Latest commit

History

Repository files navigation

Latest News 🎉

🌟 Project Introduction

🤔 Project Highlights

🖼️ Screenshots

🏗️ Project Architecture Diagram

🧳 Model Collection

📂 Project Structure

⚡ Quick Start

1. Deployment Environment

2. Key Dependency Information

3. Deployment Steps

3.1. Clone the Code or Manually Download the Code to Place on the Server:

3.2. Configure the Python Environment (Recommended to Use Conda)

3.3 Download Models

3.3.1 TTS Models (Required if using TTS)

3.3.2 ASR Models (Required if using ASR)

3.3.3 LLM Models (Required)

3.4 Run the Web Script

4. Model Experience

📚 Detailed Guide

Data Generation Guide

LLM Usage Guide

ASR Usage Guide

TTS Usage Guide

Model Fine-Tuning Guide

Model Quantization Guide

🔮 Future Development Plans

🙏 Acknowledgments

Related Projects

Special Thanks

⚖️ Disclaimer

Star History

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages