Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

mlx 0.21.0 #198680

Merged
merged 2 commits into from
Nov 22, 2024
Merged

mlx 0.21.0 #198680

merged 2 commits into from
Nov 22, 2024

Conversation

BrewTestBot
Copy link
Member

@BrewTestBot BrewTestBot commented Nov 22, 2024

Created by brew bump


Created with brew bump-formula-pr.

  • resource blocks have been checked for updates.
release notes
## Highlights
* Support 3 and 6 bit quantization: [benchmarks](https://github.com/ml-explore/mlx/pull/1613)
* Much faster memory efficient attention for headdim 64, 80: [benchmarks](https://github.com/ml-explore/mlx/pull/1610)
* Much faster sdpa inference kernel for longer sequences: [benchmarks](https://github.com/ml-explore/mlx/pull/1597)

Core

  • contiguous op (C++ only) + primitive
  • Bfs width limit to reduce memory consumption during eval
  • Fast CPU quantization
  • Faster indexing math in several kernels:
    • unary, binary, ternary, copy, compiled, reduce
  • Improve dispatch threads for a few kernels:
    • conv, gemm splitk, custom kernels
  • More buffer donation with no-ops to reduce memory use
  • Use CMAKE_OSX_DEPLOYMENT_TARGET to pick Metal version
  • Dispatch Metal bf16 type at runtime when using the JIT

NN

  • nn.AvgPool3d and nn.MaxPool3d
  • Support groups in nn.Conv2d

Bug fixes

  • Fix per-example mask + docs in sdpa
  • Fix FFT synchronization bug (use dispatch method everywhere)
  • Throw for invalid *fft{2,n} cases
  • Fix OOB access in qmv
  • Fix donation in sdpa to reduce memory use
  • Allocate safetensors header on the heap to avoid stack overflow
  • Fix sibling memory leak
  • Fix view segfault for scalars input
  • Fix concatenate vmap

@github-actions github-actions bot added python Python use is a significant feature of the PR or issue bump-formula-pr PR was created using `brew bump-formula-pr` macos-only Formula depends on macOS labels Nov 22, 2024
Copy link
Contributor

🤖 An automated task has requested bottles to be published to this PR.

@github-actions github-actions bot added the CI-published-bottle-commits The commits for the built bottles have been pushed to the PR branch. label Nov 22, 2024
@BrewTestBot BrewTestBot added this pull request to the merge queue Nov 22, 2024
Merged via the queue into master with commit 3e3ff6c Nov 22, 2024
15 checks passed
@BrewTestBot BrewTestBot deleted the bump-mlx-0.21.0 branch November 22, 2024 22:47
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bump-formula-pr PR was created using `brew bump-formula-pr` CI-published-bottle-commits The commits for the built bottles have been pushed to the PR branch. macos-only Formula depends on macOS python Python use is a significant feature of the PR or issue
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants