Skip to content
@SqueezeAILab

SqueezeAILab

SqueezeAI is part of Berkeley AI Research Lab at UC Berkeley focused on AI Systems research.

Popular repositories Loading

  1. LLMCompiler LLMCompiler Public

    [ICML 2024] LLMCompiler: An LLM Compiler for Parallel Function Calling

    Python 1.6k 120

  2. SqueezeLLM SqueezeLLM Public

    [ICML 2024] SqueezeLLM: Dense-and-Sparse Quantization

    Python 677 43

  3. TinyAgent TinyAgent Public

    [EMNLP 2024 Demo] TinyAgent: Function Calling at the Edge!

    Python 365 57

  4. KVQuant KVQuant Public

    [NeurIPS 2024] KVQuant: Towards 10 Million Context Length LLM Inference with KV Cache Quantization

    Python 332 30

  5. LLM2LLM LLM2LLM Public

    [ACL 2024] LLM2LLM: Boosting LLMs with Novel Iterative Data Enhancement

    Python 175 12

  6. SqueezedAttention SqueezedAttention Public

    SQUEEZED ATTENTION: Accelerating Long Prompt LLM Inference

    Python 41 4

Repositories

Showing 9 of 9 repositories
  • ETS Public

    ETS: Efficient Tree Search for Inference-Time Scaling

    SqueezeAILab/ETS’s past year of commit activity
    0 0 1 0 Updated Feb 19, 2025
  • SqueezedAttention Public

    SQUEEZED ATTENTION: Accelerating Long Prompt LLM Inference

    SqueezeAILab/SqueezedAttention’s past year of commit activity
    Python 41 4 2 0 Updated Nov 20, 2024
  • Tool2Vec Public

    Efficient and Scalable Estimation of Tool Representations in Vector Space

    SqueezeAILab/Tool2Vec’s past year of commit activity
    Python 18 MIT 0 1 0 Updated Sep 5, 2024
  • TinyAgent Public

    [EMNLP 2024 Demo] TinyAgent: Function Calling at the Edge!

    SqueezeAILab/TinyAgent’s past year of commit activity
    Python 365 MIT 57 7 1 Updated Sep 4, 2024
  • KVQuant Public

    [NeurIPS 2024] KVQuant: Towards 10 Million Context Length LLM Inference with KV Cache Quantization

    SqueezeAILab/KVQuant’s past year of commit activity
    Python 332 30 14 1 Updated Aug 13, 2024
  • SqueezeLLM Public

    [ICML 2024] SqueezeLLM: Dense-and-Sparse Quantization

    SqueezeAILab/SqueezeLLM’s past year of commit activity
    Python 677 MIT 43 16 3 Updated Aug 13, 2024
  • LLMCompiler Public

    [ICML 2024] LLMCompiler: An LLM Compiler for Parallel Function Calling

    SqueezeAILab/LLMCompiler’s past year of commit activity
    Python 1,612 MIT 120 4 1 Updated Jul 10, 2024
  • LLM2LLM Public

    [ACL 2024] LLM2LLM: Boosting LLMs with Novel Iterative Data Enhancement

    SqueezeAILab/LLM2LLM’s past year of commit activity
    Python 175 MIT 12 2 0 Updated Mar 25, 2024
  • open_source_projects Public

    Open Source Projects from Pallas Lab

    SqueezeAILab/open_source_projects’s past year of commit activity
    20 MIT 2 0 0 Updated Oct 10, 2021

People

This organization has no public members. You must be a member to see who’s a part of this organization.

Top languages

Python