Skip to content
View tairenpiao's full-sized avatar

Organizations

@nota-github @Nota-NetsPresso

Block or report tairenpiao

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Visual Studio Code

TypeScript 168,015 30,785 Updated Mar 3, 2025

The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.

Python 69,411 7,459 Updated Mar 3, 2025

Brevitas: neural network quantization in PyTorch

Python 1,264 206 Updated Feb 28, 2025

Advanced Quantization Algorithm for LLMs/VLMs.

Python 380 30 Updated Mar 3, 2025

A pytorch quantization backend for optimum

Python 892 70 Updated Mar 3, 2025

Get up and running with Llama 3.3, DeepSeek-R1, Phi-4, Gemma 2, and other large language models.

Go 130,723 10,709 Updated Mar 3, 2025

A unified library of state-of-the-art model optimization techniques such as quantization, pruning, distillation, speculative decoding, etc. It compresses deep learning models for downstream deploym…

Python 756 55 Updated Mar 3, 2025

The official Meta Llama 3 GitHub site

Python 28,427 3,296 Updated Jan 26, 2025

A library for training, compressing and deploying computer vision models (including ViT) with edge devices

Python 68 7 Updated Feb 12, 2025

Create Customized Software using Natural Language Idea (through LLM-powered Multi-Agent Collaboration)

Python 26,347 3,344 Updated Dec 30, 2024

A tool to modify ONNX models in a visualization fashion, based on Netron and Flask.

JavaScript 1,437 176 Updated Feb 25, 2025

ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator

C++ 15,823 3,074 Updated Mar 3, 2025

Simplify your onnx model

C++ 3,993 389 Updated Sep 3, 2024

a fast, scalable, multi-language and extensible build system

Java 23,725 4,143 Updated Mar 3, 2025

The Web framework for perfectionists with deadlines.

Python 82,560 32,319 Updated Mar 3, 2025

AIMET is a library that provides advanced quantization and compression techniques for trained neural network models.

Python 2,234 395 Updated Mar 3, 2025

Universal LLM Deployment Engine with ML Compilation

Python 20,089 1,675 Updated Mar 3, 2025

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficie…

C++ 9,587 1,125 Updated Mar 3, 2025

Simple tool for partial optimization of ONNX. Further optimize some models that cannot be optimized with onnx-optimizer and onnxsim by several tens of percent. In particular, models containing Eins…

Python 19 Updated May 7, 2024

Self-Created Tools to convert ONNX files (NCHW) to TensorFlow/TFLite/Keras format (NHWC). The purpose of this tool is to solve the massive Transpose extrapolation problem in onnx-tensorflow (onnx-t…

Python 756 75 Updated Feb 17, 2025

ncnn is a high-performance neural network inference framework optimized for the mobile platform

C++ 21,045 4,209 Updated Mar 3, 2025

Visualizer for neural network, deep learning and machine learning models

JavaScript 29,532 2,854 Updated Mar 2, 2025

Empowering everyone to build reliable and efficient software.

Rust 101,646 13,158 Updated Mar 3, 2025

An Open Source Machine Learning Framework for Everyone

C++ 188,350 74,560 Updated Mar 3, 2025

Open deep learning compiler stack for cpu, gpu and specialized accelerators

Python 12,062 3,521 Updated Mar 3, 2025

CUDA Templates for Linear Algebra Subroutines

C++ 6,929 1,133 Updated Feb 28, 2025

Transformer related optimization, including BERT, GPT

C++ 6,056 900 Updated Mar 27, 2024

Development repository for the Triton language and compiler

MLIR 14,681 1,829 Updated Mar 3, 2025

[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"

Python 7,516 758 Updated Aug 12, 2024

The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Jupyter Notebook 49,119 5,797 Updated Sep 18, 2024
Next