Skip to content

Pull requests: nod-ai/shark-ai

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

Refactor the SystemManager across shortfin apps
#1012 opened Feb 27, 2025 by KyleHerndon Loading…
Shortfin llm beam search
#1011 opened Feb 27, 2025 by stbaione Draft
Bump IREE requirement pins to 3.3.0rc20250227
#1003 opened Feb 25, 2025 by shark-pr-automator bot Loading…
Sharded integration tests
#995 opened Feb 24, 2025 by renxida Draft
Test CI
#981 opened Feb 19, 2025 by stellaraccident Draft
Speed-up all_gather by reducing buffer copies.
#979 opened Feb 18, 2025 by Alex-Vasile Loading…
Add metrics to sdxl phases (#948)
#961 opened Feb 12, 2025 by daveliddell Loading…
Update random generation for fp8 quantization
#945 opened Feb 10, 2025 by rsuderman Loading…
Add sharding support for latent attention block
#935 opened Feb 7, 2025 by rsuderman Loading…
Shard input_mask for Llama
#905 opened Feb 3, 2025 by stbaione Draft
[Llama] Kv cache new layout
#858 opened Jan 22, 2025 by Groverkss Draft
Added experimental mm padding for cache behavior
#825 opened Jan 14, 2025 by rsuderman Loading…
ProTip! no:milestone will show everything without a milestone.