Change the repository type filter
All
Repositories list
61 repositories
nm-vllm-certs
Publicvllm-flash-attention
Publicvllm
Publiccompressed-tensors
Publicyolov5
Publicupstream-transformers
Publicaxolotl
Publicnm-actions
Publicguidellm
Publicquant_kernel_benchmarks
Publiclm-evaluation-harness
Publicdocs
Publicflash-attention
Publicmistral-evals
Publicevalplus
Publicgraphs
Publictemp-llm-compressor
Publicalpaca_eval
Publicupstream-llm-foundry
Publicnm-vllm
Public archivemteb
Publictransformers
PublicAutoFP8
PublicOmniQuant
Publictemp-AutoGPTQ
Publicupstream-composer
PublicMixEval
Publicmamba
Publiccausal-conv1d
Publicsparseml
PublicLibraries for applying sparsification recipes to neural networks with a few lines of code, enabling faster and smaller models