Skip to content
View MayDomine's full-sized avatar

Block or report MayDomine

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

FR-Spec: Frequency-Ranked Speculative Sampling

C++ 12 1 Updated Mar 20, 2025

📚FFPA(Split-D): Yet another Faster Flash Prefill Attention with O(1) GPU SRAM complexity for headdim > 256, ~2x↑🎉vs SDPA EA.

Cuda 156 7 Updated Mar 25, 2025

DeepEP: an efficient expert-parallel communication library

Cuda 7,314 677 Updated Mar 27, 2025

FlashMLA: Efficient MLA decoding kernels

C++ 11,379 810 Updated Mar 1, 2025

🚀 Resume Builder 在线简历生成工具

TypeScript 1,071 98 Updated Feb 27, 2025

Sequence-level 1F1B schedule for LLMs.

Python 16 1 Updated Dec 24, 2024

Zotero completion source for nvim-cmp using zotcite as backend.

Lua 24 2 Updated Jan 27, 2025

Efficient Training (including pre-training and fine-tuning) for Big Models

Python 579 80 Updated Jul 22, 2024

🍿 A collection of QoL plugins for Neovim

Lua 4,324 195 Updated Mar 1, 2025

jellyfin电影元数据插件

C# 1,664 79 Updated Mar 9, 2025

Like the repo-name said.

Shell 2 Updated Nov 11, 2024

A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper and Ada GPUs, to provide better performance with lower memory utilizatio…

Python 2,305 388 Updated Mar 26, 2025

Zero Bubble Pipeline Parallelism

Python 375 22 Updated Mar 4, 2025

Sequence-level 1F1B schedule for LLMs.

Python 17 3 Updated Jun 4, 2024

Distributed IO-aware Attention algorithm

Python 18 Updated Aug 22, 2024

Trainable, memory-efficient, and GPU-friendly PyTorch reproduction of AlphaFold 2

Python 2,949 574 Updated Feb 24, 2025

北邮北京邮电大学校园网网关自动化认证脚本。支持有线网和无线网。支持带参数 Portal 认证、AC 跳转、掉线重连。跨平台。 BUPT Network Login.

Shell 75 1 Updated Oct 13, 2024

Live Training for Open-source Big Models

Python 507 40 Updated May 30, 2023

↔️ Translate subtitle using ChatGPT

1,639 97 Updated Apr 17, 2024

Development repository for the Triton language and compiler

MLIR 15,002 1,889 Updated Mar 27, 2025

Tool Learning for Big Models, Open-Source Solutions of ChatGPT-Plugins

Python 2,776 258 Updated Dec 5, 2023

add attention mask for flash attention

Python 3 Updated Aug 19, 2024

new guy

PHP 1 1 Updated Jun 11, 2023

Efficient Training (including pre-training and fine-tuning) for Big Models

Python 1 Updated Jul 15, 2024

Elegant and Powerfull. Powered by OpenAI and Vercel.

TypeScript 3,207 2,991 Updated Oct 16, 2024

Tensors and Dynamic neural networks in Python with strong GPU acceleration

Python 88,288 23,702 Updated Mar 27, 2025

用文本编辑器剪视频

Python 7,086 730 Updated Oct 5, 2024

real Transformer TeraFLOPS on various GPUs

Jupyter Notebook 898 114 Updated Jan 9, 2024

中文独立博客列表

Python 21,361 2,539 Updated Mar 24, 2025
Next