Contrastively Trained Encodings from Decoder

Extending Decoders with an Integrated Encoder

This repo holds the code for training encoders that embed the final hidden state from large decoder models. To our knowledge, CoTrEnD is the first architecture to leverage a contrastive loss to train an encoder from a decoder. It was developed as part of the 24h Meta LLAMA-3 hackathon May 2024 by Abhishek Singh, Arthur Böök, and Wian Stipp.

Motivation

The motivation behind the CoTrEnD project is to utilize on the rich hidden states that are generated within large decoders. Rather than separating the embedder from the decoder as one typically would in a RAG approach, CoTrEnD integrates the encoder on top of the decoder. This allows the encoder to leverage the semantic information already captured within the decoder's hidden states.

Architecture

The CoTrEnD architecture is a simple extension of the decoder-only model. The encoder is trained to embed the final hidden state of the decoder. The encoder is trained using a contrastive loss, which encourages the encoder to embed similar hidden states for similar inputs, and dissimilar hidden states for dissimilar inputs.

User Interface

The CoTrEnD project includes a user interface that allows users to interact with the model. The user interface is built using Streamlit with two modes of operation.

RAG Mode

The user can ask anything in the question field, and the CoTrEnD model will do a embedding search over the vectorstore to augment the generated answer.

Document Lookup Mode

The user can enter a medical entity in the entity field, and the CoTrEnD model will return the most similar document from the vectorstore.

Team

Abhishek Singh

Name		Name	Last commit message	Last commit date
Latest commit History 130 Commits
.vscode		.vscode
DrQA @ 6d7c1b7		DrQA @ 6d7c1b7
clapt		clapt
contriever @ 39fb220		contriever @ 39fb220
data		data
eval		eval
fid @ fe769f3		fid @ fe769f3
slurms		slurms
static		static
.gitignore		.gitignore
.gitmodules		.gitmodules
Makefile		Makefile
README.md		README.md
poetry.lock		poetry.lock
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Contrastively Trained Encodings from Decoder

Motivation

Architecture

User Interface

RAG Mode

Document Lookup Mode

Team

Abhishek Singh

Arthur Böök

Wian Stipp

About

Releases

Packages

Contributors 3

Languages

abhisheksingh-7/cotrend

Folders and files

Latest commit

History

Repository files navigation

Contrastively Trained Encodings from Decoder

Motivation

Architecture

User Interface

RAG Mode

Document Lookup Mode

Team

Abhishek Singh

Arthur Böök

Wian Stipp

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages