Discrete Flow Matching implemented in PyTorch

Implementation of Discrete Flow Matching [1][2], which is a generative model for generating discrete things such as text with flow matching. The code is implemented in PyTorch.

Step 0 of 128 (input)	Step 64 of 128	Step 128 of 128 (output)

How to run

Environment setup

Install uv for package management, e.g. pip install uv
Make sure Python 3.12 is installed: uv python install 3.12
Install the dependencies: uv sync --group jupyter

Run python -m discrete_flow_matching_pytorch.train --config configs/conv-8.yaml to start training a text generation model logging to wandb.

The sample notebook demonstrates the sampling process.

Note: Instead of using uv, it is also possible to install the dependencies in pyproject.toml with pip.

Summary of discrete flow matching compared to continuous flow matching

During training, we mask out text tokens according to the timestep
The model is trained to predict the original unmasked tokens with cross entropy loss
In sampling, we unmask text gradually with the sampled tokens

References

[1] Generative Flows on Discrete State-Spaces: Enabling Multimodal Flows with Applications to Protein Co-Design (YouTube presentation): Combines discrete and continuous flow matching. Originally introduced Discrete Flow Matching. Appendix F was very useful for the implementation
[2] Discrete Flow Matching: Builds on Multiflow's Discrete Flow Matching

Name		Name	Last commit message	Last commit date
Latest commit History 29 Commits
configs		configs
media		media
notebooks		notebooks
src/discrete_flow_matching_pytorch		src/discrete_flow_matching_pytorch
.gitignore		.gitignore
.python-version		.python-version
LICENSE		LICENSE
README.md		README.md
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Discrete Flow Matching implemented in PyTorch

How to run

Environment setup

Summary of discrete flow matching compared to continuous flow matching

References

About

Languages

License

RobinKa/discrete-flow-matching-pytorch

Folders and files

Latest commit

History

Repository files navigation

Discrete Flow Matching implemented in PyTorch

How to run

Environment setup

Summary of discrete flow matching compared to continuous flow matching

References

About

Topics

Resources

License

Stars

Watchers

Forks

Languages