MPS support #48

BlackSamorez · 2024-03-10T16:17:17Z

No description provided.

BlackSamorez · 2024-04-21T14:10:00Z

Note to self: matmul doesn't work for more than one vector at a time for some reason

BuildBackBuehler · 2024-05-09T02:52:26Z

Yessss!!! I'm excited about this. I was just looking into what steps would entail. I'm too green (not enough exp./knowledge) to know the ins and outs of building from source/the granular ala C++...but I'm wondering how much could be lifted from other MPS backends.

TVM has MPS support and its own matmul definition. I'd also prefer/suggest if possible as little Torch as possible. Tends to be slow. TVM has its own nn, ops and most of the important formulae like Conv1 + 2 + 3D.

Have you been able to install aqlm[gpu,cpu] or just aqlm[cpu]? Right now Triton's extra module "GPU" is holding me back from the whole shebang. So I was figuring if I can, work on building out the Triton MPS backend. It has support for adding your own backend, but once again, no clue in hell what I'm doing.

Personally, depending on what is gained/lossed, I'll probably just use the aqlm models with MLC-LLM but that also requires some additional mods to get it working.

BlackSamorez added 7 commits March 10, 2024 14:03

1.1.2dev

ee0dd7e

mps kernel init

2cf6443

prob_m/k

8c2ce5b

Minor fixes and routing

e496726

Basic impl

9828c93

float impl (for testing)

5bf50ff

uint16_t

ba0832d

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

MPS support #48

MPS support #48

BlackSamorez commented Mar 10, 2024

BlackSamorez commented Apr 21, 2024

BuildBackBuehler commented May 9, 2024

MPS support #48

Are you sure you want to change the base?

MPS support #48

Conversation

BlackSamorez commented Mar 10, 2024

BlackSamorez commented Apr 21, 2024

BuildBackBuehler commented May 9, 2024