Decouple dispatch and specialization #11339

timholy · 2015-05-19T02:04:51Z

aka "Generalize the ANY mechanism" aka "Completely destroy Jeff's life forever"
(continuing the heresy in JuliaLang/LinearAlgebra.jl#42 in a direction that might be more profitable, but much harder)

What if we could encode an Array's size among its type parameters, but not pay the compilation price of generating specialized code for each different size?

Here's a candidate set of rules (EDITED):

In the type declaration, you specify the default specialization behavior for the parameters: type Bar{T,K*} means that, by default, functions are specialized on T but not on K. Array might be defined as type Array{T,N,SZ*<:NTuple{N,Int}} (in a future world of triangular dispatch).
In a function signature, if a type's parameter isn't represented as a "TypeVar", it uses the default behavior. So foo(A::Array) specializes the element type and the dimensionality, but not the size.
Specifying a parameter overrides the default behavior. foo{T,K}(b::Bar{T,K}) would specialize on K, for example. Conversely, putting * after a type parameter causes the parameter to be used in dispatch (subtying and intersection) but prevents specialization. This is much like the ANY hack now. matmul{S,T,m*,k*,n*}(A::Array{S,2,(m,k)}, B::Array{T,2,(k,n)} would perform the size check at compile time but not specialize on those arguments.

I'm aware that this is more than a little crazy and has some serious downsides (among others, it feels like performance could get finicky). But to go even further out on a limb, one might additionally have matmul{S,T,m<=4,k<=4,n<=4}(...) to generate specialized algorithms (note the absence of *s) for small sizes and use the unspecialized fallbacks for larger sizes. This way one wouldn't need a specialized FixedSizeArray type, and could just implement such algorithms on Array.

A major downside to this proposal is that calls in functions that lack complete specialization might have to be expanded like this:

if size(A,1) <= 4 && size(A,2) <= 4 && size(B,2) <= 4
    C = matmul_specialized(A,B)
else
    C = matmul_unspecialized(A,B)
end

The text was updated successfully, but these errors were encountered:

tkelman · 2015-05-19T03:37:25Z

If a type's parameter isn't represented as a "TypeVar", it doesn't get specialized.

Would this potentially require type parameters to be mandatory for good performance in many more situations than they are now? I worry whether this would make Julia's performance characteristics even more complicated to understand for new users. Or maybe this would somehow end up becoming more intuitive in the end? Not sure. (Is this what you meant by "performance could get finicky"?)

timholy · 2015-05-19T09:43:16Z

Yes, that was my concern. I edited the rules above so that I think we could maintain julia's current behavior but add new ones.

JeffBezanson · 2015-05-21T18:16:03Z

I think specialization has to be a property of functions, not the type itself.

The better version of this is to automatically figure out which parameters the function doesn't depend on, and avoid specializing on those (in other words, discover "parametricity" in the technical sense).

timholy · 2016-03-27T18:32:57Z

I just spent a lovely hour+ re-reading JuliaLang/LinearAlgebra.jl#42. What bliss. Surprisingly, I found myself back this issue, which I'd forgotten about entirely.

I think @JeffBezanson's comment above is potentially important, and I'm just checking two things:

Is this already implicit in the two main "compiler improvement" umbrella issues, compiler optimization tracker #3440 and compiler performance #14743? Perhaps it's implicit in the rather vague "fewer specializations" and possibly "when 2 specializations of a function have the same LLVM IR, reuse the native code." From a compilation-speed issue, the latter sounds later than ideal (unless typed->LLVM IR is fast). Worth adding a separate bullet point and link?
Should we be worried about this in any way in thinking about API design? For arrays I wonder if it's potentially under threat from making our code more generic and yet more efficient, something I'm thinking about a fair bit currently.

For the latter, consider a simple example:

function fill!{T}(A::AbstractArray{T}, val)
    valT = convert(T, val)
    for i in eachindex(A)
        A[i] = valT
    end
    A
end

For all LinearFast arrays, the dimensionality of A is an irrelevant parameter, and one might be able to use an unspecialized function (modulo the inlining of that call to setindex!) that can handle all dimensionalities without a performance cost. But when eachindex returns a CartesianRange, dimensionality matters for the generated code, particularly given the demonstrated importance of inlining next and done.

In other words, for those of us not working on the compiler, is this worth being concerned about as we contemplate different paths forward? I'm guessing the answer is no, but it would be interesting to hear reflections on this. My reasoning is that presumably this has to be solved not at the level of the method, but at the level of individual cached (compiled) instantiations. In that case one can specialize for certain types of inputs but use more general code for others, and that solves the problem I'm worrying about above.

timholy · 2016-03-27T18:39:37Z

Thinking just a little bit more: presumably the ideal way to implement this corresponds to generating a method for Array{Float64, 3} but then performing an extra step to realize (prior to cache insertion) that the generated code doesn't depend on 3 and replacing the method-matching with Array{Float64,N}.

JeffBezanson · 2017-05-10T22:51:03Z

I think a good item for 1.0 here is to deprecate ANY and replace it with @nospecialize args... metadata at the top of a method.

timholy · 2017-05-11T09:03:58Z

Would args be able to be a type parameter, too? Or just one of the arguments to the function?

JeffBezanson · 2017-05-11T13:35:16Z

Eventually, yes.

andyferris · 2017-07-04T10:43:11Z

I'd have to say, this would be pretty amazing, to have in-built static arrays like this. We'd need non-resizable Array, of course...

One problem to solve is that you'd only ever want to dispatch to a static-sized method if the size typevar was known to the compiler. To avoid dynamic dispatch, you'd need another annotation on methods so that the method is only considered when inference can infer the relevant information. The annotation would imply a promise to the compiler that the method is functionally identical to the one it specializes, or else we'd have determinism problems.

We'd also need to store such abstract types unboxed, for performance.

This could solve a range of problems, such as a Val that is equally as fast as possible when it is a dynamic or static value, or it could replace the literal_pow mechanism...

JeffBezanson · 2017-07-14T16:58:39Z

1.0 part of this is done.

AriMKatz · 2022-03-03T17:38:46Z

This would be very useful for shape checking ML programs, where runtime shape errors are notoriously vexing.

See this: aviatesk/JET.jl#82 (comment) for an example where it "just works" with JET.jl and sized arrays, but again compilation cost is an issue.

Would also be helpful for IPO Escape analysis, which according to the docs would do better with propagating array shape info. And This issue here: #43642

Is it possible to have this be opt in so as to not be a breaking change and be available during 1.x?

vtjnash · 2022-03-29T20:56:45Z

@nospecialize now does this (using exactly the type given in the signature for inference, and usually dispatch also, if possible)

AriMKatz · 2022-03-30T18:46:11Z

@nospecialize now does this (using exactly the type given in the signature for inference, and usually dispatch also, if possible)

Is it possible to specialize on just specific type parameters?

Edit:
People want to be able to check array and table shapes with JET (or at runtime) without paying for extra specialization cost.

What if I want to include the whole shape of an array but specialize only on the rank and element type

I'm not sure if that even makes sense in principle though.

JeffBezanson · 2022-03-31T19:20:29Z

I don't think this should be closed since there could still be much more control over what to specialize on.

timholy added the julep Julia Enhancement Proposal label May 19, 2015

timholy mentioned this issue Apr 1, 2016

Expand documentation on Val{T} #15729

Merged

StefanKarpinski added the types and dispatch Types, subtyping and method dispatch label Aug 24, 2016

StefanKarpinski added this to the 0.6.0 milestone Aug 24, 2016

JeffBezanson modified the milestones: 1.0, 0.6.0 Dec 23, 2016

JeffBezanson added a commit that referenced this issue Jul 3, 2017

replace ANY with @nospecialize annotation. part of #11339

384a474

JeffBezanson mentioned this issue Jul 3, 2017

RFC: replace ANY with @nospecialize annotation #22666

Merged

3 tasks

JeffBezanson added a commit that referenced this issue Jul 3, 2017

replace ANY with @nospecialize annotation. part of #11339

fc7419d

JeffBezanson added a commit that referenced this issue Jul 4, 2017

replace ANY with @nospecialize annotation. part of #11339

24dec72

JeffBezanson added a commit that referenced this issue Jul 5, 2017

replace ANY with @nospecialize annotation. part of #11339

afadcd1

JeffBezanson added a commit that referenced this issue Jul 5, 2017

replace ANY with @nospecialize annotation. part of #11339

cbddec2

JeffBezanson added a commit that referenced this issue Jul 6, 2017

replace ANY with @nospecialize annotation. part of #11339

6494b2a

JeffBezanson added a commit that referenced this issue Jul 7, 2017

replace ANY with @nospecialize annotation. part of #11339

43643d2

JeffBezanson added a commit that referenced this issue Jul 8, 2017

replace ANY with @nospecialize annotation. part of #11339

bcbfefb

JeffBezanson added a commit that referenced this issue Jul 9, 2017

replace ANY with @nospecialize annotation. part of #11339

b481326

JeffBezanson added a commit that referenced this issue Jul 10, 2017

replace ANY with @nospecialize annotation. part of #11339

fc8d192

JeffBezanson added a commit that referenced this issue Jul 12, 2017

replace ANY with @nospecialize annotation. part of #11339

6a74f93

JeffBezanson added a commit that referenced this issue Jul 13, 2017

replace ANY with @nospecialize annotation. part of #11339

215902a

JeffBezanson added a commit that referenced this issue Jul 13, 2017

replace ANY with @nospecialize annotation. part of #11339

a53afd9

JeffBezanson added a commit that referenced this issue Jul 14, 2017

replace ANY with @nospecialize annotation. part of #11339

e7240fd

JeffBezanson added a commit that referenced this issue Jul 14, 2017

replace ANY with @nospecialize annotation. part of #11339

9d00563

JeffBezanson modified the milestones: 2.0+, 1.0 Jul 14, 2017

jeffwong pushed a commit to jeffwong/julia that referenced this issue Jul 24, 2017

replace ANY with @nospecialize annotation. part of JuliaLang#11339

fb3aef0

quinnj mentioned this issue Sep 18, 2017

Specialization on unused arguments #23753

Closed

AriMKatz mentioned this issue Feb 8, 2021

Static array shape checking aviatesk/JET.jl#82

Open

AriMKatz mentioned this issue Mar 10, 2022

Create staticint.jl #44538

Draft

vtjnash closed this as completed Mar 29, 2022

JeffBezanson reopened this Mar 31, 2022

mbauman mentioned this issue Sep 22, 2023

It is too hard to reason about when Julia specializes arguments #51423

Open

nsajko mentioned this issue Jul 15, 2024

Function over-specialization on unused parts of types #26834

Open

nsajko mentioned this issue Sep 26, 2024

Dispatch performance regression #55807

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Decouple dispatch and specialization #11339

Decouple dispatch and specialization #11339

timholy commented May 19, 2015

tkelman commented May 19, 2015

timholy commented May 19, 2015

JeffBezanson commented May 21, 2015

timholy commented Mar 27, 2016

timholy commented Mar 27, 2016

JeffBezanson commented May 10, 2017

timholy commented May 11, 2017

JeffBezanson commented May 11, 2017

andyferris commented Jul 4, 2017

JeffBezanson commented Jul 14, 2017

AriMKatz commented Mar 3, 2022

vtjnash commented Mar 29, 2022

AriMKatz commented Mar 30, 2022 •

edited

Loading

JeffBezanson commented Mar 31, 2022

Decouple dispatch and specialization #11339

Decouple dispatch and specialization #11339

Comments

timholy commented May 19, 2015

tkelman commented May 19, 2015

timholy commented May 19, 2015

JeffBezanson commented May 21, 2015

timholy commented Mar 27, 2016

timholy commented Mar 27, 2016

JeffBezanson commented May 10, 2017

timholy commented May 11, 2017

JeffBezanson commented May 11, 2017

andyferris commented Jul 4, 2017

JeffBezanson commented Jul 14, 2017

AriMKatz commented Mar 3, 2022

vtjnash commented Mar 29, 2022

AriMKatz commented Mar 30, 2022 • edited Loading

JeffBezanson commented Mar 31, 2022

AriMKatz commented Mar 30, 2022 •

edited

Loading