-
Notifications
You must be signed in to change notification settings - Fork 36
Pull requests: nod-ai/shark-ai
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Make shortfin LLM integration tests run on matrix: cpu, gpu
#890
opened Jan 30, 2025 by
renxida
Loading…
[sharktank][llama] enable quark parity test on mi300x
#886
opened Jan 29, 2025 by
dan-garvey
Loading…
Bump IREE requirement pins to their latest versions.
#879
opened Jan 29, 2025 by
shark-pr-automator
bot
Loading…
[Llama] Do not allow configurable partitions for KVCache
#856
opened Jan 21, 2025 by
Groverkss
Loading…
Bump iree dependencies forward to include barrier changes
#834
opened Jan 16, 2025 by
rsuderman
Loading…
Fixed mixed precision contraction semantics for mmt_block_scaled_offset_q4_unsigned
#720
opened Dec 20, 2024 by
Groverkss
Loading…
Enable tokenizers in shortfin packages on Linux x86_64.
#688
opened Dec 12, 2024 by
ScottTodd
Loading…
Expanded sharded support for alternative sharding mechanisms
#680
opened Dec 12, 2024 by
rsuderman
Loading…
Previous Next
ProTip!
Exclude everything labeled
bug
with -label:bug.