Build an end-to-end chatbot with Transformer in TensorFlow 2. Checkout the my tutorial on blog.tensorflow.org.
- 8 Dec 2020: Updated support to TensorFlow 2.3.1 and TensorFlow Datasets 4.1.0
- 18 Jan 2020: Added notebook with Google Colab TPU support in TensorFlow 2.1.
- TensorFlow 2.4.0
- TensorFlow Datasets
- create new anaconda environment and initialize environment
chatbot
conda create -n chatbot python=3.8 conda activate chatbot
- run installation script
sh setup.sh
- Note: the script would install CUDA and cuDNN via conda if installing on a Linux system.
- We will use the conversations in movies and TV shows provided by Cornell Movie-Dialogs Corpus, which contains more than 220 thousands conversational exchanges between more than 10k pairs of movie characters, as our dataset.
- We pre-process our dataset in the following order:
- Extract
max_samples
conversation pairs into list ofquestions
andanswers
. - Pre-process each sentence by removing special characters in each sentence.
- Build tokenizer (map text to ID and ID to text) using TensorFlow Datasets SubwordTextEncoder.
- Tokenize each sentence and add
start_token
andend_token
to indicate the start and end of each sentence. - Filter out sentence that has more than
max_length
tokens. - Pad tokenized sentences to
max_length
- Extract
- Check dataset.py implementation.
- check model.py for the implementation of Multi-Headed Attention, Positional Encoding and Transformer.
- check all available flags and hyper-parameters
python main.py --help
python main.py --batch_size 256 --epochs 50 --max_samples 50000
input: where have you been?
output: i m not talking about that .
input: it's a trap!
output: no , it s not .