Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Simple Agentic framework with batch generation #2830

Open
August-murr opened this issue Feb 11, 2025 · 0 comments
Open

Simple Agentic framework with batch generation #2830

August-murr opened this issue Feb 11, 2025 · 0 comments
Labels
✨ enhancement New feature or request 🏋 GRPO Related to GRPO

Comments

@August-murr
Copy link
Collaborator

we need a simple agentic framework that can handle tool calls, run them, and then provide the output back in the chat for follow-up responses, especially focusing on Batch generations to maximize GPU utility.

For the initial version, we'll use a Transformers model, and later on, we'll switch to a VLLM version for better efficiency.

I’ve been avoiding Langchain and smolagents since they rely heavily on pre-written prompts, which overcomplicates and limits training, plus they aren’t designed for batch generation.

If there’s a way to achieve this with existing libraries like Langchain or smolagents, I’d love to hear your thoughts!

@github-actions github-actions bot added the ✨ enhancement New feature or request label Feb 11, 2025
@August-murr August-murr added the 🏋 GRPO Related to GRPO label Feb 11, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
✨ enhancement New feature or request 🏋 GRPO Related to GRPO
Projects
None yet
Development

No branches or pull requests

1 participant