You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Can you add in the previous benchmark of requirement to run intern 1 mil context sizes etc? to understand how much vram and ram is needed for other models.
i was wondering if i use a powerful 32gb 5090 gpu paired with dell r930 1.5tb ram. is the ram (ddr3 / ddr4) important in inference speed?
how fast / slow is the inference speed in this kind of configuration?
reacted with thumbs up emoji reacted with thumbs down emoji reacted with laugh emoji reacted with hooray emoji reacted with confused emoji reacted with heart emoji reacted with rocket emoji reacted with eyes emoji
-
Can you add in the previous benchmark of requirement to run intern 1 mil context sizes etc? to understand how much vram and ram is needed for other models.
i was wondering if i use a powerful 32gb 5090 gpu paired with dell r930 1.5tb ram. is the ram (ddr3 / ddr4) important in inference speed?
how fast / slow is the inference speed in this kind of configuration?
Beta Was this translation helpful? Give feedback.
All reactions