MayDomine

MayDomine MayDomine

The Great Way has no gate

15 followers · 9 following

Achievements

Stars

thunlp / FR-Spec

FR-Spec: Frequency-Ranked Speculative Sampling

C++ 12 1 Updated Mar 20, 2025

xlite-dev / ffpa-attn-mma

📚FFPA(Split-D): Yet another Faster Flash Prefill Attention with O(1) GPU SRAM complexity for headdim > 256, ~2x↑🎉vs SDPA EA.

Cuda 156 7 Updated Mar 25, 2025

deepseek-ai / DeepEP

DeepEP: an efficient expert-parallel communication library

Cuda 7,314 677 Updated Mar 27, 2025

deepseek-ai / FlashMLA

FlashMLA: Efficient MLA decoding kernels

C++ 11,379 810 Updated Mar 1, 2025

Arman19941113 / dnd-resume

🚀 Resume Builder 在线简历生成工具

TypeScript 1,071 98 Updated Feb 27, 2025

thunlp / Seq1F1B

Forked from NVIDIA/Megatron-LM

Sequence-level 1F1B schedule for LLMs.

Python 16 1 Updated Dec 24, 2024

undg / telescope-gp-agent-picker.nvim

Lua 5 Updated Oct 26, 2024

jalvesaq / cmp-zotcite

Zotero completion source for nvim-cmp using zotcite as backend.

Lua 24 2 Updated Jan 27, 2025

OpenBMB / BMTrain

Efficient Training (including pre-training and fine-tuning) for Big Models

Python 579 80 Updated Jul 22, 2024

folke / snacks.nvim

🍿 A collection of QoL plugins for Neovim

Lua 4,324 195 Updated Mar 1, 2025

cxfksword / jellyfin-plugin-metashark

jellyfin电影元数据插件

C# 1,664 79 Updated Mar 9, 2025

MayDomine / zellij-like-tmux-conf

Like the repo-name said.

Shell 2 Updated Nov 11, 2024

NVIDIA / TransformerEngine

A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper and Ada GPUs, to provide better performance with lower memory utilizatio…

Python 2,305 388 Updated Mar 26, 2025

sail-sg / zero-bubble-pipeline-parallelism

Forked from NVIDIA/Megatron-LM

Zero Bubble Pipeline Parallelism

Python 375 22 Updated Mar 4, 2025

MayDomine / Seq1F1B

Forked from NVIDIA/Megatron-LM

Sequence-level 1F1B schedule for LLMs.

Python 17 3 Updated Jun 4, 2024

MayDomine / Burst-Attention

Distributed IO-aware Attention algorithm

Python 18 Updated Aug 22, 2024

aqlaboratory / openfold

Trainable, memory-efficient, and GPU-friendly PyTorch reproduction of AlphaFold 2

Python 2,949 574 Updated Feb 24, 2025

charlie0129 / bupt-net-login

北邮北京邮电大学校园网网关自动化认证脚本。支持有线网和无线网。支持带参数 Portal 认证、AC 跳转、掉线重连。跨平台。 BUPT Network Login.

Shell 75 1 Updated Oct 13, 2024

OpenBMB / CPM-Live

Live Training for Open-source Big Models

Python 507 40 Updated May 30, 2023

gnehs / subtitle-translator-electron

↔️ Translate subtitle using ChatGPT

1,639 97 Updated Apr 17, 2024

triton-lang / triton

Development repository for the Triton language and compiler

MLIR 15,002 1,889 Updated Mar 27, 2025

OpenBMB / BMTools

Tool Learning for Big Models, Open-Source Solutions of ChatGPT-Plugins

Python 2,776 258 Updated Dec 5, 2023

MayDomine / flash-attention

Forked from Dao-AILab/flash-attention

add attention mask for flash attention

Python 3 Updated Aug 19, 2024