Do you have any plan to expand the scope to CPU? #1118

mengniwang95 · 2023-10-08T09:07:21Z

mengniwang95
Oct 8, 2023

Hi, I find this repo mainly focus on LLM inference on GPUs currently. Do you have any plan to expand the scope to CPU?

Our team develop the Intel® Extension for Transformers, which is an innovative toolkit to accelerate Transformer-based models on Intel platforms, in particular effective on 4th Intel Xeon Scalable processor Sapphire Rapids (codenamed Sapphire Rapids). The toolkit provides the below key features:

Seamless user experience of model compressions (include RTN, AWQ, GPTQ, bitsandbytes and other our own algorithms in the future for weight-only quantization) on Transformer-based models by extending Hugging Face transformers APIs and leveraging Intel® Neural Compressor
Advanced software optimizations and unique compression-aware runtime.
Optimized Transformer-based model packages.
NeuralChat, a customizable chatbot framework to create your own chatbot within minutes by leveraging a rich set of plugins and SOTA optimizations.
Inference of Large Language Model (LLM) in pure C/C++ with weight-only quantization kernels.

We want to make some contributions to LLM ecosystem sincerely and TGI is a really popular project. So, is there any chance to integrate part of our work into TGI?

Thanks

HydeAndGeek · 2024-03-04T20:54:37Z

HydeAndGeek
Mar 4, 2024

Any updates on this? Looking for one click installers to connect multiple llm models for testing on Intel iris xe gpu on a Lenovo i7 seems impossible to get anything running even 50% to what the other nvda are capable of running. I just got this thing as a gift otherwise would replace it. Thanks in advance

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Do you have any plan to expand the scope to CPU? #1118

{{title}}

{{editor}}'s edit

{{editor}}'s edit

Replies: 1 comment

{{title}}

Select a reply

Do you have any plan to expand the scope to CPU? #1118

mengniwang95 Oct 8, 2023

Replies: 1 comment

HydeAndGeek Mar 4, 2024

mengniwang95
Oct 8, 2023

HydeAndGeek
Mar 4, 2024