Skip to content

v2.6.11

Latest
Compare
Choose a tag to compare
@DefTruth DefTruth released this 31 Jan 06:54
d7914c0

What's Changed

  • add MiniMax-01 in Trending LLM/VLM Topics and Long Context Attention by @shaoyuyoung in #112
  • [feat] add deepseek-r1 by @shaoyuyoung in #113
  • 🔥🔥[DistServe] DistServe: Disaggregating Prefill and Decoding for Goodput-optimized Large Language Model Serving by @DefTruth in #114
  • 🔥🔥[KVDirect] KVDirect: Distributed Disaggregated LLM Inference by @DefTruth in #115
  • 🔥🔥[DeServe] DESERVE: TOWARDS AFFORDABLE OFFLINE LLM INFERENCE VIA DECENTRALIZATION by @DefTruth in #116
  • 🔥🔥[Mooncake] Mooncake: A KVCache-centric Disaggregated Architecture for LLM Serving by @DefTruth in #117

New Contributors

Full Changelog: v2.6.10...v2.6.11