Skip to content
@usyd-fsalab

FSA

Popular repositories Loading

  1. fp6_llm fp6_llm Public

    An efficient GPU support for LLM inference with x-bit quantization (e.g. FP6,FP5).

    Cuda 211 15

  2. NeuralNetworkRandomness NeuralNetworkRandomness Public

    Python 14

  3. ReadingList ReadingList Public

    12

  4. FSA FSA Public

    Webpage for FSA

    HTML 2

  5. flash-llm flash-llm Public

    Forked from AlibabaResearch/flash-llm

    Flash-LLM: Enabling Cost-Effective and Highly-Efficient Large Generative Model Inference with Unstructured Sparsity

    Cuda 2

  6. ConferenceTalk ConferenceTalk Public

    Conference talks given by FSA Lab, University of Sydney

    1

Repositories

Showing 7 of 7 repositories
  • fp6_llm Public

    An efficient GPU support for LLM inference with x-bit quantization (e.g. FP6,FP5).

    usyd-fsalab/fp6_llm’s past year of commit activity
    Cuda 211 Apache-2.0 15 2 0 Updated Oct 28, 2024
  • blog Public Forked from huggingface/blog

    Public repo for HF blog posts

    usyd-fsalab/blog’s past year of commit activity
    Jupyter Notebook 0 765 0 0 Updated Oct 25, 2023
  • FSA Public

    Webpage for FSA

    usyd-fsalab/FSA’s past year of commit activity
    HTML 2 0 0 0 Updated Oct 3, 2023
  • flash-llm Public Forked from AlibabaResearch/flash-llm

    Flash-LLM: Enabling Cost-Effective and Highly-Efficient Large Generative Model Inference with Unstructured Sparsity

    usyd-fsalab/flash-llm’s past year of commit activity
    Cuda 2 Apache-2.0 16 0 0 Updated Sep 24, 2023
  • usyd-fsalab/ReadingList’s past year of commit activity
    12 0 0 0 Updated Apr 27, 2022
  • usyd-fsalab/NeuralNetworkRandomness’s past year of commit activity
    Python 14 MIT 0 0 0 Updated Mar 18, 2022
  • ConferenceTalk Public

    Conference talks given by FSA Lab, University of Sydney

    usyd-fsalab/ConferenceTalk’s past year of commit activity
    1 0 0 0 Updated Jul 28, 2021

Top languages

Loading…

Most used topics

Loading…