Add Support for Embeddings to Reduce Token Usage And reduce the risk of abnormal stop. #246

Rain-Of-Stars · 2024-12-30T02:04:11Z

Which API Provider are you using?

OpenAI Compatible

Which Model are you using?

deepseek

What happened?

I propose adding a feature using an embedding system to reduce tokens in requests. This could help manage and optimize the daily token. Leveraging embeddings may maintain or enhance functionality while reducing token usage and costs.

Steps to reproduce

Now the context consumption is too large, and multiple rounds of dialogue will overflow tokens.

Relevant API REQUEST output

No response

Additional context

No response

mrubens · 2025-01-09T23:58:04Z

@Rain-Of-Stars sounds great - any thoughts on the best way to do it?

Rain-Of-Stars added the bug Something isn't working label Dec 30, 2024

Rain-Of-Stars changed the title ~~Add Support for Embeddings to Reduce Token Usage~~ Add Support for Embeddings to Reduce Token Usage And reduce the risk of abnormal stop. Dec 30, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add Support for Embeddings to Reduce Token Usage And reduce the risk of abnormal stop. #246

Add Support for Embeddings to Reduce Token Usage And reduce the risk of abnormal stop. #246

Rain-Of-Stars commented Dec 30, 2024

mrubens commented Jan 9, 2025

Add Support for Embeddings to Reduce Token Usage And reduce the risk of abnormal stop. #246

Add Support for Embeddings to Reduce Token Usage And reduce the risk of abnormal stop. #246

Comments

Rain-Of-Stars commented Dec 30, 2024

Which API Provider are you using?

Which Model are you using?

What happened?

Steps to reproduce

Relevant API REQUEST output

Additional context

mrubens commented Jan 9, 2025