v1.2.0
Metal v1.2.0
Merged pull requests:
- Avoid constructing
MulAddMul
s on Julia v1.12+ (#295) (@dkarrasch) - Trigger the runtime profiler when a test times out. (#330) (@maleadt)
- Add MPSMatrixSoftMax (#333) (@christiangnrd)
- Reorganize and add some MPS tests (#335) (@christiangnrd)
- Typo fix (#336) (#337) (@101001000)
- Add error message for running Metal.jl under Rosetta (#339) (@tgymnich)
- Add
MPSCommandBuffer
(#340) (@christiangnrd) - Bump julia-actions/setup-julia from 1 to 2 (#341) (@dependabot[bot])
- Revert error message for Rosetta (#342) (@tgymnich)
- Update to ObjectiveC.jl v3. (#343) (@maleadt)
- Add autoreleasepools to MPS interface methods. (#344) (@maleadt)
- Don't redundantly return the cmdbuf from commit methods. (#345) (@maleadt)
- Whitespace fixes (#346) (@christiangnrd)
- CompatHelper: bump compat for LLVM to 7, (keep existing compat) (#347) (@github-actions[bot])
- CompatHelper: add new compat entry for SpecialFunctions in [weakdeps] at version 2, (keep existing compat) (#352) (@github-actions[bot])
- [NFC] Fix indentation (#353) (@christiangnrd)
- Bump LLVM downgrader (#354) (@maleadt)
- Don't export non-existent
contents
(#356) (@christiangnrd) - Remove/fix unused exports (#357) (@christiangnrd)
- Unexport
SimpleVersion
andAS
(#360) (@christiangnrd) - Add support for opaque pointers (#361) (@maleadt)
- Docstrings (#362) (@christiangnrd)
- Initial MacOS 15 support (#365) (@christiangnrd)
- Replace
current_device()
withdevice()
(#366) (@christiangnrd) - Support reading metallib v1.2.8 files from macOS 15. (#367) (@maleadt)
- Add metallib (dis)assembly helper scripts. (#368) (@maleadt)
- Simplify testing of examples. (#369) (@maleadt)
- Temporarily allow 1.11 to fail. (#371) (@maleadt)
- CompatHelper: add new compat entry for PrecompileTools at version 1, (keep existing compat) (#372) (@github-actions[bot])
- Define complex sqrt (#374) (@mtfishman)
- Check the macOS version during initialization. (#375) (@maleadt)
- CompatHelper: bump compat for LLVM to 8, (keep existing compat) (#376) (@github-actions[bot])
- Add
accumulate
implementation (#377) (@chengchingwen) - fix derived device array (#378) (@chengchingwen)
- avoid ReshapedArray using Int128 in metal kernel (#379) (@chengchingwen)
- improve type stability of derived array (#380) (@chengchingwen)
- add
findall
implementation (#382) (@zhenwu0728) - Bump version (#383) (@christiangnrd)
Closed issues:
- Tests sporadically timing out on 1.11 (#329)
- ReshapedArray indexing broken because of Int128 operation (#332)
- KernelAbstractions copyto! typo (#336)
- Segmentation Faults (#338)
- Port
accmulate!
andfindall
from CUDA.jl (#348) - Tests failing with
GPUCompiler
v0.26.5 andLLVM
v7.1 (#350) - downgrades LLVM (#355)
- sqrt(::Complex) unsupported due to conversion exceptions (#364)