Skip to content
View somepago's full-sized avatar

Block or report somepago

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

HunyuanVideo: A Systematic Framework For Large Video Generation Model

Python 6,900 524 Updated Dec 31, 2024

A Data Streaming Library for Efficient Neural Network Training

Python 1,190 150 Updated Dec 23, 2024

Machine Learning Interviews from FAANG, Snapchat, LinkedIn. I have offers from Snapchat, Coupang, Stitchfix etc. Blog: mlengineer.io.

10,038 1,619 Updated Aug 31, 2023

Using FlexAttention to compute attention with different masking patterns

Python 40 Updated Sep 22, 2024

Efficient Triton Kernels for LLM Training

Python 4,080 233 Updated Jan 2, 2025

Megatron's multi-modal data loader

Python 153 13 Updated Dec 18, 2024

A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 πŸ“ and reasoning techniques.

6,014 328 Updated Jan 2, 2025

NumPy tutorials & educational content in notebook format

Python 505 190 Updated Jan 2, 2025

γ€Žγ‚Όγƒ­γ‹γ‚‰δ½œγ‚‹ Deep Learning ❸』(O'Reilly Japan, 2020)

Python 756 298 Updated May 27, 2024

LLM related research papers curated by LLMs themselves

Python 16 10 Updated Jan 1, 2025

Neural Networks: Zero to Hero

Jupyter Notebook 12,548 1,693 Updated Aug 18, 2024

LLM101n: Let's build a Storyteller

30,805 1,682 Updated Aug 1, 2024

Pytorch implementation of "Genie: Generative Interactive Environments", Bruce et al. (2024).

Python 107 11 Updated Aug 21, 2024

Official implementation of AnimateDiff.

Python 10,799 878 Updated Jul 31, 2024

Lumina-T2X is a unified framework for Text to Any Modality Generation

Python 2,113 88 Updated Aug 6, 2024

πŸ“Ί An End-to-End Solution for High-Resolution and Long Video Generation Based on Transformer Diffusion

Python 1,644 118 Updated Dec 27, 2024

Implementation of πŸ’ Ring Attention, from Liu et al. at Berkeley AI, in Pytorch

Python 485 29 Updated Oct 25, 2024

Here we will keep track of the latest AI Game Development Tools, including LLM, Agent, Code, Writer, Image, Texture, Shader, 3D Model, Animation, Video, Audio, Music, Singing Voice and Analytics. πŸ”₯

628 66 Updated Dec 31, 2024

[SIGGRAPH Asia 2024, Journal Track] ToonCrafter: Generative Cartoon Interpolation

Python 5,476 457 Updated Sep 9, 2024

[ICML 2024] Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs (RPG)

Jupyter Notebook 1,725 100 Updated Dec 24, 2024

Code to reproduce "Transformers Can Do Arithmetic with the Right Embeddings", McLeish et al (NeurIPS 2024)

Python 181 12 Updated May 28, 2024

This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.

Python 11,822 1,040 Updated Dec 31, 2024

A curated list of recent diffusion models for video generation, editing, restoration, understanding, etc.

3,722 211 Updated Dec 29, 2024

DIAMOND (DIffusion As a Model Of eNvironment Dreams) is a reinforcement learning agent trained in a diffusion world model. NeurIPS 2024 Spotlight.

Python 1,664 114 Updated Dec 6, 2024

[ICML 2024 Best Paper] Discrete Diffusion Modeling by Estimating the Ratios of the Data Distribution (https://arxiv.org/abs/2310.16834)

Python 445 43 Updated Feb 29, 2024

[CVPR24 Highlights] Polos: Multimodal Metric Learning from Human Feedback for Image Captioning

Python 25 Updated Jul 12, 2024

Simple AI agents / assistants

Python 40 4 Updated Oct 8, 2024

A list of AI autonomous agents

12,554 935 Updated Nov 19, 2024

When do we not need larger vision models?

Python 349 11 Updated Dec 4, 2024

Official implementation of "HowToCaption: Prompting LLMs to Transform Video Annotations at Scale." ECCV 2024

Python 49 Updated Oct 2, 2024
Next