Use GatortronGPT for Retrieval Augmented Generation
- check MTEB leader board for the latest rank https://huggingface.co/spaces/mteb/leaderboard
- models for testing
- sionic-ai/sionic-ai-v2 (TBD)
- BAAI/bge-base-en-v1.5
- Instructor-large
- TODO: train gatortron BERT to perform embedding task https://medium.com/llamaindex-blog/fine-tuning-embeddings-for-rag-with-synthetic-data-e534409a3971
- current elasticsearch-py release has several features not supported (8.8 release)
- To run most advanced feature we recommand install elasticsearch-py from github main branch as
pip install git+https://github.com/elastic/elasticsearch-py.git
https://www.elastic.co/guide/en/elasticsearch/reference/current/docker.html https://hub.docker.com/_/elasticsearch
- install docker on your local (check docker website)
- get elasticsearch images
docker pull elasticsearch:8.10.2
- create a new elasticsearch single node network
docker network create elastic
- then we can create a tmux env to run docker elasticsearch service
- inside tmux
docker run --name es-node01 --net elastic -p 9200:9200 -p 9300:9300 -t elasticsearch:8.10.2
- note: you need to wait until the service is up where there will be a password generated, you have to save it in a separate file which we need to use later for access via python client API
- use a separate tmux env
docker pull docker.elastic.co/kibana/kibana:8.10.2
docker run --name kib-01 --net elastic -p 5601:5601 docker.elastic.co/kibana/kibana:8.10.2
- see exmaple: notebook/demo_hybrid_search_elasticsearch.ipynb