What's Changed
- add
MiniMax-01
in Trending LLM/VLM Topics and Long Context Attention by @shaoyuyoung in #112 - [feat] add deepseek-r1 by @shaoyuyoung in #113
- 🔥🔥[DistServe] DistServe: Disaggregating Prefill and Decoding for Goodput-optimized Large Language Model Serving by @DefTruth in #114
- 🔥🔥[KVDirect] KVDirect: Distributed Disaggregated LLM Inference by @DefTruth in #115
- 🔥🔥[DeServe] DESERVE: TOWARDS AFFORDABLE OFFLINE LLM INFERENCE VIA DECENTRALIZATION by @DefTruth in #116
- 🔥🔥[Mooncake] Mooncake: A KVCache-centric Disaggregated Architecture for LLM Serving by @DefTruth in #117
New Contributors
- @shaoyuyoung made their first contribution in #112
Full Changelog: v2.6.10...v2.6.11