Releases: EnzymeAD/Reactant.jl
Releases · EnzymeAD/Reactant.jl
v0.2.17
Reactant v0.2.17
Merged pull requests:
- gpu, llvm and nvvm dialect bindings (#315) (@jumerckx)
- fix: recursion in AbsInt working (#483) (@avik-pal)
- add test for #226 (#486) (@jumerckx)
- fix: missing scalar indexing check for setindex (#491) (@avik-pal)
- fix: incorrect IR for traced RNGs (#494) (@avik-pal)
Closed issues:
v0.2.16
Reactant v0.2.16
Merged pull requests:
- CuArray tracing (#475) (@wsmoses)
- Improve error message on broadcast (#476) (@wsmoses)
- Fix traced rnumber promotion (#477) (@wsmoses)
- Fix type unstable stack (#478) (@wsmoses)
- Rewrite apply iterate (#479) (@wsmoses)
- Improve cuda error (#480) (@wsmoses)
- Format code of branch "main" (#481) (@github-actions[bot])
v0.2.15
Reactant v0.2.15
Merged pull requests:
- precompile first interpretation (#353) (@glou-nes)
- [GHA] Set author of PRs created by scheduled jobs to Enzyme bot (#455) (@giordano)
- feat: partial support for boolean indexing (#457) (@avik-pal)
- feat:
no_nan
mode and more passes (#458) (@avik-pal) - Fix cuda isdefined error (#459) (@wsmoses)
- feat: more trigonometric functions (#462) (@avik-pal)
- fix: check for name and module in function (#463) (@avik-pal)
- Fix offsetarrays support (#464) (@wsmoses)
- Fix opaqueclosure Worldage issue (#465) (@wsmoses)
- Format code of branch "main" (#466) (@github-actions[bot])
- CompatHelper: add new compat entry for OffsetArrays in [weakdeps] at version 1 for package ., (keep existing compat) (#467) (@github-actions[bot])
- CompatHelper: add new compat entry for OffsetArrays at version 1 for package test, (keep existing compat) (#468) (@github-actions[bot])
- Update index.md (#469) (@x66ccff)
- Format code of branch "main" (#471) (@github-actions[bot])
- feat: missing mean(f, ...) dispatches (#472) (@avik-pal)
- Update index.md, delete the
- Provide better error message if calling thunk with wrong types (#474) (@wsmoses)
Closed issues:
v0.2.14
Reactant v0.2.14
Merged pull requests:
- More mul overloads (#446) (@wsmoses)
- fix: use pip for jax (#450) (@avik-pal)
- Pipeline for nested enzyme differentiation (#452) (@wsmoses)
- Revert modularize bazel build to unbreak yggy (#453) (@wsmoses)
- CompatHelper: add new compat entry for SpecialFunctions in [weakdeps] at version 2 for package ., (keep existing compat) (#454) (@github-actions[bot])
Closed issues:
v0.2.13
Reactant v0.2.13
Merged pull requests:
SpecialFunctions
simple functions (#384) (@glou-nes)- Modularize Bazel build (#421) (@mofeing)
- feat: support Base.stack (#433) (@avik-pal)
- Fix mul overload (#440) (@wsmoses)
- fix: handle aos for mul (#441) (@avik-pal)
- feat: missing BF16 dispatches (#443) (@avik-pal)
- refactor: rework TracedRNG to be similar to other types (#448) (@avik-pal)
Closed issues:
- 2nd order AD fails (#298)
v0.2.12
Reactant v0.2.12
Merged pull requests:
- feat: indexing using traced values (#434) (@avik-pal)
- Regenerate MLIR Bindings (#435) (@github-actions[bot])
- fix: try building with cudnn 9.4 (#436) (@avik-pal)
- chore: bump jll (#437) (@avik-pal)
Closed issues:
- Scalar indexing triggered when requesting a view of a traced array (#408)
v0.2.11
Reactant v0.2.11
Merged pull requests:
- CompatHelper: add new compat entry for YaoBlocks in [weakdeps] at version 0.13, (keep existing compat) (#312) (@github-actions[bot])
- WIP: kernels (#314) (@wsmoses)
- test: unbreak CUDA CI (#337) (@avik-pal)
- feat: correct handling of wrapped arrays functionalities (#342) (@avik-pal)
- reduce codegen on
YaoBlocks.mat
methods (#343) (@mofeing) - refactor: use Ops instead of direct stablehlo calls (#347) (@avik-pal)
- Run CI on aarch64 (#350) (@mofeing)
- add missing @test (#352) (@glou-nes)
- Add Ops.hlo_call(::String, args...) (#358) (@Pangoraw)
- Kernels: support Julia 1.11 (#359) (@jumerckx)
- Define outer
repeat
method forTracedRArray
(#361) (@AntonOresten) - docs: add Reactant.Ops docs to the website (#362) (@avik-pal)
- feat: tracing Random.jl functionality correctly (#363) (@avik-pal)
- Interp2 (#365) (@wsmoses)
- fix: ensure printing of wrapped ConcreteRArrays goes through our show (#367) (@avik-pal)
- feat: add support for the remaining wrapper types (#369) (@avik-pal)
- Format code of branch "main" (#371) (@github-actions[bot])
stablehlo.sort
Ops (#374) (@glou-nes)- No Transpose emission for 0 rank tensor (#375) (@glou-nes)
- fix: include files if they end with .jl (#377) (@avik-pal)
- refactor: move overrides into a separate file (#379) (@avik-pal)
- CompatHelper: bump compat for Flux to 0.16 for package test, (keep existing compat) (#383) (@github-actions[bot])
- Fix traced_getfield (#385) (@wsmoses)
- fix
backtrace
calls on debug mode (#387) (@mofeing) - Kernel-supporting jll (#389) (@wsmoses)
- fix: override to overlay (#390) (@avik-pal)
- CompatHelper: add new compat entry for CUDA in [weakdeps] at version 5 for package ., (keep existing compat) (#391) (@github-actions[bot])
- CompatHelper: add new compat entry for CUDA at version 5 for package test, (keep existing compat) (#392) (@github-actions[bot])
- fix: missing interpolation in compile (#394) (@avik-pal)
- CompatHelper: add new compat entry for Random123 at version 1 for package test, (keep existing compat) (#395) (@github-actions[bot])
- CompatHelper: add new compat entry for StatsBase at version 0.34 for package test, (keep existing compat) (#396) (@github-actions[bot])
- CompatHelper: add new compat entry for StableRNGs at version 1 for package test, (keep existing compat) (#397) (@github-actions[bot])
- CompatHelper: add new compat entry for Distributions at version 0.25 for package test, (keep existing compat) (#398) (@github-actions[bot])
- CompatHelper: add new compat entry for HypothesisTests at version 0.11 for package test, (keep existing compat) (#399) (@github-actions[bot])
- feat: bump jll version and add opt pass (#400) (@avik-pal)
- ci: disable non-functional CI (#401) (@avik-pal)
- feat: add a no_enzyme option (#402) (@avik-pal)
- bump reactant jll (#404) (@Pangoraw)
- Inference cache (#405) (@jumerckx)
- Pythoncall (#407) (@wsmoses)
- JLL: enable throwing better errors (#409) (@wsmoses)
- Regenerate MLIR Bindings (#410) (@github-actions[bot])
- Format code of branch "main" (#411) (@github-actions[bot])
- CompatHelper: add new compat entry for PythonCall in [weakdeps] at version 0.9 for package ., (keep existing compat) (#412) (@github-actions[bot])
- CompatHelper: add new compat entry for PythonCall at version 0.9 for package test, (keep existing compat) (#413) (@github-actions[bot])
- generate EnzymeXLA dialect bindings (#414) (@mofeing)
- refactor: move python testing to integration (#415) (@avik-pal)
- fix: handle traced array returns inside objects (#417) (@avik-pal)
- Improve reactant error messages (#418) (@wsmoses)
- Fix ReactantPythonCallExt.jl (#419) (@wsmoses)
- feat: add zero and fill! for ConcreteRArray (#420) (@avik-pal)
- Fix error on global (#422) (@wsmoses)
- Cuv2 (#423) (@wsmoses)
- disable absint of absint (#424) (@wsmoses)
- fix: make eltype of Traced/Concrete Arrays to be respective RNumbers (#426) (@avik-pal)
- CUDA kernels take 3 (#427) (@wsmoses)
- Format code of branch "main" (#432) (@github-actions[bot])
Closed issues:
getindex
assumes contiguous indexing (#242)- Incorrect stableHLO for grouped convolution gradient (#331)
stablehlo.cholesky
inconsistent on GPU vs CPU (#338)- Incorrect traced code for upsampling functions (#339)
- Handling Wrapped Arrays Correctly (#345)
dynamic_slice
(#349)- Handling random numbers correctly (#356)
- Cannot compile function with Module as parameter (#357)
- Cannot repeat >2-dimensional arrays (#360)
- Segfault on Integer Overflow (#368)
- [BUG]: Segfault with large amount of data (#380)
- Compile time scales with MPI ranks (and it shouldn't) (#386)
- Graceful termination on out of memory (#406)
- Autodiff code generation fails when caching values (#416)
v0.2.10
v0.2.9
v0.2.8
Reactant v0.2.8
Merged pull requests:
- Add IFRT C-API bindings (#178) (@mofeing)
- Reactant interpreter overload (#259) (@wsmoses)
- First batch of shortcuts to HLO ops (#267) (@mofeing)
- fix attribute for arrray of bools (#279) (@Pangoraw)
- enable new optimizations (#280) (@Pangoraw)
- gensym in
@code_hlo
(#281) (@jumerckx) - test for traced_type where array isn't captured by type param (#282) (@jumerckx)
- fix: don't print "Module" (#283) (@avik-pal)
- docs: nicer documentation setup (#284) (@avik-pal)
- add check for null in
show(::Operation)
. (#285) (@jumerckx) - build(deps): bump codecov/codecov-action from 4 to 5 (#286) (@dependabot[bot])
- Format code of branch "main" (#289) (@github-actions[bot])
- CompatHelper: add new compat entry for EnzymeCore at version 0.8, (keep existing compat) (#290) (@github-actions[bot])
- chore: specify sources for local dev (#292) (@avik-pal)
- feat: define aos_to_soa (#294) (@avik-pal)
- add fill to TracedRNumber (#295) (@glou-nes)
- Format code of branch "main" (#296) (@github-actions[bot])
- fix: vararg compilation (#299) (@avik-pal)
- ci: add downgrade testing (#300) (@avik-pal)
- Fix tracedarray (#302) (@wsmoses)
- fix
create_result(::Array)
for non-vectors (#304) (@mofeing) - implement
collect
forTracedRArray
(#305) (@mofeing) - fix
YaoBlocks.mat
for rotation gates withTracedRNumber
parameters (#306) (@mofeing) - use row major when building attributes (#307) (@Pangoraw)
- Format code of branch "main" (#309) (@github-actions[bot])
- Speed up invoke usage (#313) (@wsmoses)
- specify aarch64-linux toolchain for Yggdrasil build (#316) (@mofeing)
- add optional debug info to Ops calls (#317) (@mofeing)
- Implement NNlib.∇conv_data! and NNlib.∇conv_filter! (#318) (@Pangoraw)
- Ops remaining basic numerical test (#320) (@glou-nes)
- fix: preserve IOContext for printing (#322) (@avik-pal)
- fix: #196 (#323) (@glou-nes)
- [JLL] Add conversion from llvm to mlir (#324) (@wsmoses)
Closed issues:
- IFRT tracking issue (#176)
- Abstract interpretation over
ReactantInterpreter
sometimes skips overriden methods (#189) - crash with parameter less function (#196)
- Mismatch computing
Enzyme.gradient
between regular array andConcreteRArray
(#236) - Add downgrade testing CI (#271)
- Inconsistent result for
iszero
for booleans (#277) - Implement
fill
forTracedRNumber
(#288) - compilation failure of getindex with a type-assert (#293)