-
Notifications
You must be signed in to change notification settings - Fork 40
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Bump IR downgrader #489
Merged
Merged
Bump IR downgrader #489
Conversation
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Storage servers are stuck; JuliaRegistries/General#120562 |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Metal Benchmarks
Benchmark suite | Current: 2ff598b | Previous: 3fd6c06 | Ratio |
---|---|---|---|
private array/construct |
26744 ns |
27095.285714285714 ns |
0.99 |
private array/broadcast |
463958.5 ns |
465000 ns |
1.00 |
private array/random/randn/Float32 |
804417 ns |
789812.5 ns |
1.02 |
private array/random/randn!/Float32 |
661333 ns |
660666.5 ns |
1.00 |
private array/random/rand!/Int64 |
553292 ns |
572250 ns |
0.97 |
private array/random/rand!/Float32 |
607645.5 ns |
597625 ns |
1.02 |
private array/random/rand/Int64 |
755459 ns |
736833 ns |
1.03 |
private array/random/rand/Float32 |
593250 ns |
631958.5 ns |
0.94 |
private array/copyto!/gpu_to_gpu |
652542 ns |
653750 ns |
1.00 |
private array/copyto!/cpu_to_gpu |
671291.5 ns |
817417 ns |
0.82 |
private array/copyto!/gpu_to_cpu |
831542 ns |
634167 ns |
1.31 |
private array/accumulate/1d |
1326916 ns |
1339791.5 ns |
0.99 |
private array/accumulate/2d |
1387750 ns |
1393417 ns |
1.00 |
private array/iteration/findall/int |
2064791.5 ns |
2094937.5 ns |
0.99 |
private array/iteration/findall/bool |
1823937.5 ns |
1826875 ns |
1.00 |
private array/iteration/findfirst/int |
1739041 ns |
1686499.5 ns |
1.03 |
private array/iteration/findfirst/bool |
1661875 ns |
1657166.5 ns |
1.00 |
private array/iteration/scalar |
3764458.5 ns |
3913229 ns |
0.96 |
private array/iteration/logical |
3194458.5 ns |
3178375.5 ns |
1.01 |
private array/iteration/findmin/1d |
1757334 ns |
1756417 ns |
1.00 |
private array/iteration/findmin/2d |
1350709 ns |
1353625 ns |
1.00 |
private array/reductions/reduce/1d |
1036875 ns |
1039812 ns |
1.00 |
private array/reductions/reduce/2d |
661729.5 ns |
661333 ns |
1.00 |
private array/reductions/mapreduce/1d |
1039895.5 ns |
1046500.5 ns |
0.99 |
private array/reductions/mapreduce/2d |
665042 ns |
658292 ns |
1.01 |
private array/permutedims/4d |
2703646 ns |
2708916 ns |
1.00 |
private array/permutedims/2d |
1004417 ns |
1008083 ns |
1.00 |
private array/permutedims/3d |
1572520.5 ns |
1583375 ns |
0.99 |
private array/copy |
595291 ns |
579458 ns |
1.03 |
latency/precompile |
5116241166 ns |
6399118167 ns |
0.80 |
latency/ttfp |
6708516354 ns |
6579444208.5 ns |
1.02 |
latency/import |
1151160667 ns |
1130024792 ns |
1.02 |
integration/metaldevrt |
707375 ns |
710542 ns |
1.00 |
integration/byval/slices=1 |
1648166.5 ns |
1546084 ns |
1.07 |
integration/byval/slices=3 |
11217625 ns |
9416354 ns |
1.19 |
integration/byval/reference |
1632416 ns |
1601166.5 ns |
1.02 |
integration/byval/slices=2 |
2610250 ns |
2699792 ns |
0.97 |
kernel/indexing |
494375 ns |
457917 ns |
1.08 |
kernel/indexing_checked |
483709 ns |
449833 ns |
1.08 |
kernel/launch |
8417 ns |
10034.833333333332 ns |
0.84 |
metal/synchronization/stream |
14250 ns |
14542 ns |
0.98 |
metal/synchronization/context |
15083 ns |
14875 ns |
1.01 |
shared array/construct |
25572.916666666664 ns |
25840.333333333332 ns |
0.99 |
shared array/broadcast |
474020.5 ns |
457229.5 ns |
1.04 |
shared array/random/randn/Float32 |
799375 ns |
796187.5 ns |
1.00 |
shared array/random/randn!/Float32 |
648542 ns |
647958 ns |
1.00 |
shared array/random/rand!/Int64 |
555708.5 ns |
563875 ns |
0.99 |
shared array/random/rand!/Float32 |
593625 ns |
601416.5 ns |
0.99 |
shared array/random/rand/Int64 |
784458 ns |
748833 ns |
1.05 |
shared array/random/rand/Float32 |
622208 ns |
603916 ns |
1.03 |
shared array/copyto!/gpu_to_gpu |
88041 ns |
86542 ns |
1.02 |
shared array/copyto!/cpu_to_gpu |
88541 ns |
88375 ns |
1.00 |
shared array/copyto!/gpu_to_cpu |
77083 ns |
78375 ns |
0.98 |
shared array/accumulate/1d |
1353750 ns |
1350249.5 ns |
1.00 |
shared array/accumulate/2d |
1394792 ns |
1390125 ns |
1.00 |
shared array/iteration/findall/int |
1827125 ns |
1793917 ns |
1.02 |
shared array/iteration/findall/bool |
1598291 ns |
1592959 ns |
1.00 |
shared array/iteration/findfirst/int |
1394083.5 ns |
1401625 ns |
0.99 |
shared array/iteration/findfirst/bool |
1361459 ns |
1363333 ns |
1.00 |
shared array/iteration/scalar |
154459 ns |
156875 ns |
0.98 |
shared array/iteration/logical |
2993875 ns |
2962417 ns |
1.01 |
shared array/iteration/findmin/1d |
1467146 ns |
1463291 ns |
1.00 |
shared array/iteration/findmin/2d |
1366750 ns |
1365959 ns |
1.00 |
shared array/reductions/reduce/1d |
720937.5 ns |
730375 ns |
0.99 |
shared array/reductions/reduce/2d |
668729.5 ns |
661375 ns |
1.01 |
shared array/reductions/mapreduce/1d |
742000 ns |
740833 ns |
1.00 |
shared array/reductions/mapreduce/2d |
663667 ns |
662354 ns |
1.00 |
shared array/permutedims/4d |
2722646 ns |
2838125 ns |
0.96 |
shared array/permutedims/2d |
1016125 ns |
1005479 ns |
1.01 |
shared array/permutedims/3d |
1581042 ns |
1562084 ns |
1.01 |
shared array/copy |
235417 ns |
247562.5 ns |
0.95 |
This comment was automatically generated by workflow using github-action-benchmark.
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Should fix #488.