Skip to content
@ModelTC

ModelTC

Model Infra

Pinned Loading

  1. MQBench MQBench Public

    Model Quantization Benchmark

    Shell 770 140

  2. United-Perception United-Perception Public

    United Perception

    Python 429 65

  3. NNLQP NNLQP Public

    Python 34 3

  4. Dipoorlet Dipoorlet Public

    Offline Quantization Tools for Deploy.

    Python 116 16

  5. lightllm lightllm Public

    LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.

    Python 2.7k 215

Repositories

Showing 10 of 40 repositories
  • llmc Public

    [EMNLP 2024 Industry Track] This is the official PyTorch implementation of "LLMC: Benchmarking Large Language Model Quantization with a Versatile Compression Toolkit".

    ModelTC/llmc’s past year of commit activity
    Python 341 Apache-2.0 37 5 0 Updated Dec 11, 2024
  • quant_horizon Public
    ModelTC/quant_horizon’s past year of commit activity
    Cuda 5 Apache-2.0 2 0 0 Updated Dec 11, 2024
  • lightllm Public

    LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.

    ModelTC/lightllm’s past year of commit activity
    Python 2,674 Apache-2.0 215 63 9 Updated Dec 11, 2024
  • general-sam-py Public

    Python bindings for general-sam and some utilities

    ModelTC/general-sam-py’s past year of commit activity
    Python 3 Apache-2.0 0 0 2 Updated Dec 10, 2024
  • mtc-token-healing Public

    Token healing implementation in Rust

    ModelTC/mtc-token-healing’s past year of commit activity
    Rust 3 Apache-2.0 0 0 4 Updated Dec 9, 2024
  • general-sam Public

    A general suffix automaton implementation in Rust with Python bindings

    ModelTC/general-sam’s past year of commit activity
    Rust 4 Apache-2.0 0 0 1 Updated Oct 18, 2024
  • EasyLLM Public

    Built upon Megatron-Deepspeed and HuggingFace Trainer, EasyLLM has reorganized the code logic with a focus on usability. While enhancing usability, it also ensures training efficiency.

    ModelTC/EasyLLM’s past year of commit activity
    Python 42 Apache-2.0 7 0 0 Updated Sep 18, 2024
  • DeepSpeed Public Forked from microsoft/DeepSpeed

    DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

    ModelTC/DeepSpeed’s past year of commit activity
    Python 0 Apache-2.0 4,324 0 0 Updated Sep 13, 2024
  • opencompass Public Forked from open-compass/opencompass

    OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.

    ModelTC/opencompass’s past year of commit activity
    Python 1 Apache-2.0 454 0 0 Updated Sep 6, 2024
  • xtuner Public Forked from InternLM/xtuner

    An efficient, flexible and full-featured toolkit for fine-tuning LLM (InternLM2, Llama3, Phi3, Qwen, Mistral, ...)

    ModelTC/xtuner’s past year of commit activity
    Python 0 Apache-2.0 319 0 0 Updated Aug 22, 2024

People

This organization has no public members. You must be a member to see who’s a part of this organization.