Abstractive Text Summarization

The Algorithm

Our implementation differs in that we fix the context and summary token of the embedding matrix.

-Both embedding matrices are initialised from GloVe

Helptext:

python3 main.py -h

The training datasets are under data/. Each JSON file contains three fields title, full_text, summary. They're downloaded with scripts in download_data/.

GloVe data needs to be downloaded and unzipped under glove/. The code uses the first 10k most frequent tokens by default. To generate the embeddings for them,

cd glove
head -n 10000 glove.6B.300d.txt >glove.10k.300d.txt

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
download_data		download_data
README.md		README.md
embeddings.py		embeddings.py
enc_dec.py		enc_dec.py
main.py		main.py
optimisers.py		optimisers.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Abstractive Text Summarization

The Algorithm

About

Releases

Packages

Languages

Ganeshpadmanaban/Neural-Attention-Model-Abstractive-Summarization

Folders and files

Latest commit

History

Repository files navigation

Abstractive Text Summarization

The Algorithm

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages